DOI QR코드

DOI QR Code

Two-Microphone Generalized Sidelobe Canceller with Post-Filter Based Speech Enhancement in Composite Noise

  • Park, Jinsoo (Department of Biomicrosystem Technology, Korea University) ;
  • Kim, Wooil (School of Computer Science Engineering, Incheon National University) ;
  • Han, David K. (Office of Naval Research) ;
  • Ko, Hanseok (School of Electrical Engineering, Korea University)
  • 투고 : 2015.05.23
  • 심사 : 2015.12.09
  • 발행 : 2016.04.01

초록

This paper describes an algorithm to suppress composite noise in a two-microphone speech enhancement system for robust hands-free speech communication. The proposed algorithm has four stages. The first stage estimates the power spectral density of the residual stationary noise, which is based on the detection of nonstationary signal-dominant time-frequency bins (TFBs) at the generalized sidelobe canceller output. Second, speech-dominant TFBs are identified among the previously detected nonstationary signal-dominant TFBs, and power spectral densities of speech and residual nonstationary noise are estimated. In the final stage, the bin-wise output signal-to-noise ratio is obtained with these power estimates and a Wiener post-filter is constructed to attenuate the residual noise. Compared to the conventional beamforming and post-filter algorithms, the proposed speech enhancement algorithm shows significant performance improvement in terms of perceptual evaluation of speech quality.

키워드

참고문헌

  1. R. Martin, "Statistical Methods for the Enhancement of Noisy Speech," Int. Workshop Acoust. Echo Noise Contr., Kyoto, Japan, Sept. 8-11, 2003, pp. 1-6.
  2. J. Beh and H. Ko, "Spectral Subtraction Using Spectral Harmonics for Robust Speech Recognition in Car Environments," Lecture Notes Comput. Sci., vol. 2660, June 2003, pp. 1109-1116.
  3. J. Benesty, J. Chen, and Y. Huang, "Microphone Array Signal Processing," Berlin, Germany: Springer-Verlag, 2008, pp. 1-222.
  4. J. Beh, R.H. Baran, and H. Ko, "Dual Channel Based Speech Enhancement Using Novelty Filter for Robust Speech Recognition in Automobile Environment," IEEE Trans. Consum. Electron., vol. 52, no. 2, May 2006, pp. 583-589. https://doi.org/10.1109/TCE.2006.1649683
  5. L.J. Griffiths and C.W. Jim, "An Alternative Approach to Linearly Constrained Adaptive Beamforming," IEEE Trans. Antennas Propag., vol. 30, no. 1, Jan. 1982, pp. 27-34. https://doi.org/10.1109/TAP.1982.1142739
  6. O. Hoshuyama, A. Sugiyama, and A. Hirano, "A Robust Adaptive Beamformer for Microphone Arrays with a Blocking Matrix Using Constrained Adaptive Filters," IEEE Trans. Signal Process., vol. 47, no. 10, Oct. 1999, pp. 2677-2684. https://doi.org/10.1109/78.790650
  7. W. Herbordt and W. Kellermann, "Analysis of Blocking Matrices for Generalized Sidelobe Cancellers for Non-stationary Broadband Signals," IEEE Int. Conf. Acoust. Speech Signal Process., Orlando, FL, USA, May 13-17, 2002, pp. IV-4187.
  8. S. Gannot, D. Burshtein, and E. Weinstein, "Signal Enhancement Using Beamforming and Nonstationarity with Applications to Speech," IEEE Trans. Signal Process., vol. 49, no. 8, Aug. 2001, pp. 1614-1626. https://doi.org/10.1109/78.934132
  9. S. Gannot, D. Burshtein, and E. Weinstein, "Theoretical Analysis of the General Transfer Function GSC," Int. Workshop Acoust. Echo Noise Contr., Darmstadt, Germany, Sept. 10-13, 2001, pp. 103-106.
  10. J. Park et al., "Pre-filtering Algorithm for Dual-Microphone Generalized Sidelobe Canceller Using General Transfer Function," IEICE Trans. Inf. Syst., vol. E97-D, no. 9, Sept. 2014, pp. 2533-2566. https://doi.org/10.1587/transinf.2014EDL8026
  11. I.A. McCowan and H. Bourlard, "Microphone Array Post-Filter Based on Noise Field Coherence," IEEE Trans. Speech Audio Process., vol. 11, no. 6, Nov. 2003, pp. 709-716. https://doi.org/10.1109/TSA.2003.818212
  12. H. Yoon and H. Ko, "Microphone Array Post-Filter Using Input Output Ratio of Beamformer Noise Power Spectrum," Electron. Lett., vol. 43, no. 18, Aug. 2007, pp. 1003-1005. https://doi.org/10.1049/el:20071534
  13. O. Yilmaz and S. Rickard, "Blind Separation of Speech Mixtures via Time-Frequency Masking," IEEE Trans. Signal Process., vol. 52, no. 7, July 2004, pp. 1830-1847. https://doi.org/10.1109/TSP.2004.828896
  14. S. Jeong, S. Lee, and M. Hahn, "Dual Microphone-Based Speech Enhancement by Spectral Classification and Wiener Filtering," Electron. Lett., vol. 44, no. 3, July 2008, pp. 253-254. https://doi.org/10.1049/el:20083327
  15. S.V. Vaseghi, "Wiener Filters," in Advanced Digital Signal Processing and Noise Reduction, Chichester, UK: John Wiley & Sons Ltd., 2002, pp. 178-202.
  16. K.W. Baugh and K.R. Hardwicke, "On the Detection of Transient Signals Using Spectral Correlation," Circuits Syst. Signal Process., vol. 13, no. 4, Dec. 1994, pp. 467-479. https://doi.org/10.1007/BF01183742
  17. R. Martin, "Spectral Subtraction Based on Minimum Statistics," Proc. European Signal Process. Conf., Edinburgh, UK, Sept. 13-16, 1994, pp. 1182-1185.
  18. S.V. Vaseghi, "Spectral Subtraction," in Advanced Digital Signal Processing and Noise Reduction, Chichester, UK: John Wiley & Sons Ltd., 2002, pp. 333-352.
  19. ITU-T Recommendation P.862, Perceptual Evaluation of Speech Quality (PESQ): An Objective Method for End-to-End Speech Quality Assessment of Narrowband Telephone Networks and Speech Codecs, Feb. 2001.

피인용 문헌

  1. Generalized Sidelobe Canceller Beamforming with Combined Postfilter and Sparse NMF for Speech Enhancement vol.20, pp.2, 2016, https://doi.org/10.1142/s0219477521500140