DOI QR코드

DOI QR Code

Optimizing Wavelet in Noise Canceler by Deep Learning Based on DWT

DWT 기반 딥러닝 잡음소거기에서 웨이블릿 최적화

  • Received : 2023.11.16
  • Accepted : 2024.02.17
  • Published : 2024.02.29

Abstract

In this paper, we propose an optimal wavelet in a system for canceling background noise of acoustic signals. This system performed Discrete Wavelet Transform(DWT) instead of the existing Short Time Fourier Transform(STFT) and then improved noise cancellation performance through a deep learning process. DWT functions as a multi-resolution band-pass filter and obtains transformation parameters by time-shifting the parent wavelet at each level and using several wavelets whose sizes are scaled. Here, the noise cancellation performance of several wavelets was tested to select the most suitable mother wavelet for analyzing the speech. In this study, to verify the performance of the noise cancellation system for various wavelets, a simulation program using Tensorflow and Keras libraries was created and simulation experiments were performed for the four most commonly used wavelets. As a result of the experiment, the case of using Haar or Daubechies wavelets showed the best noise cancellation performance, and the mean square error(MSE) was significantly improved compared to the case of using other wavelets.

본 논문에서는 음향신호의 배경잡음을 소거하기 위한 시스템에서 최적의 wavelet을 제안한다. 이 시스템은 기존의 단구간 푸리에변환(STFT: Short Time Fourier Transform) 대신 이산 웨이블릿변환(DWT: Discrete Wavelet Transform)을 수행한 후 심층학습과정을 통하여 잡음소거 성능을 개선하였다. DWT는 다해상도 대역통과필터 기능을 하며 각 레벨에서 모 웨이블릿을 시간 이동시키고 크기를 스케일링한 여러 웨이블릿을 이용하여 변환 파라미터를 구한다. 여기서 음성을 분석하는데 가장 적합한 모(mother) 웨이블릿을 선정하기 위해 여러 웨이블릿에 대한 잡음소거 성능을 실험하였다. 본 연구에서 여러 웨이블릿에 대한 잡음소거시스템의 성능을 검증하기 위하여 Tensorflow와 Keras 라이브러리를 사용한 시뮬레이션 프로그램을 작성하고 가장 많이 사용되는 4개의 wavelet에 대해 모의실험을 수행하였다. 실험 결과, Haar 또는 Daubechies 웨이블릿을 사용하는 경우가 가장 우수한 잡음소거 성능을 나타냈으며 타 웨이블릿을 사용하는 경우보다 평균자승오차(MSE: Mean Square Error)가 크게 개선되는 것을 볼 수 있었다.

Keywords

References

  1. S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-29, Apr. 1979, pp. 113-120. https://doi.org/10.1109/TASSP.1979.1163209
  2. J. Hansen and M. Clements, "Constrained iterative speech enhancement with to speech recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-39, no. 4, Apr. 1989, pp. 21-27.
  3. H. Lee, "Nonlinear noise attenuator by adaptive Wiener filter with neural network," J. of the Korea Institute of Electronic Communication Sciences, vol. 18, no. 1, 2023, pp. 71-76.
  4. J. Lim, A. V. Oppenheim, and L. D. Braida, "Evaluation of an adaptive comb filtering method for enhancing speech degraded by white noise addition," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-26, no. 4, Apr. 1991, pp. 354-358.
  5. W. A. Harrison, J. Lim, and E. Singer, "A new application of adaptive noise cancellation," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-34, Feb. 1986, pp. 21-27.
  6. V. Justin, S. Saudia, and T. Nasser, "Fourier transform-based windowed adaptive switching minimum filter for reducing periodic noise from digital images," IET image processing, vol. 10, no. 9, 2016, pp. 646-656. https://doi.org/10.1049/iet-ipr.2015.0750
  7. I. Daubechies, "The Wavelet Transform Time-Frequency Localization and Signal Analysis," IEEE Trans. on Information Theory, vol. 36, no. 5, 1990, pp. 961-1005. https://doi.org/10.1109/18.57199
  8. C. Lee and D. Kim, "Adaptive Noise Reduction of Speech Using Wavelet Transform," J. of the Korea Institute of Electronic Communication Sciences, vol. 4, no. 3, 2009, pp. 190-196.
  9. D. L. Dondio, "De-Noising by Soft-Thresholding," IEEE Trans. on Information Theory, vol. 41, no. 3, 1995, pp. 613-627. https://doi.org/10.1109/18.382009
  10. J. Schmidhuber, "Deep learning in neural networks: An overview," Neural Networks, vol. 61, 2015, pp. 85-117. https://doi.org/10.1016/j.neunet.2014.09.003
  11. H. Lee, "Optimization of the number of filter in CNN noise attenuator," J. of the Korea Institute of Electronic Communication Sciences, vol. 16, no. 4, 2021, pp. 625-632.