DOI QR코드

DOI QR Code

Speech Enhancement Based on IMCRA Incorporating noise classification algorithm

잡음 환경 분류 알고리즘을 이용한 IMCRA 기반의 음성 향상 기법

  • Received : 2012.08.23
  • Accepted : 2012.11.28
  • Published : 2012.12.01

Abstract

In this paper, we propose a novel method to improve the performance of the improved minima controlled recursive averaging (IMCRA) in non-stationary noisy environment. The conventional IMCRA algorithm efficiently estimate the noise power by averaging past spectral power values based on a smoothing parameter that is adjusted by the signal presence probability in frequency subbands. Since the minimum of smoothing parameter is defined as 0.85, it is difficult to obtain the robust estimates of the noise power in non-stationary noisy environments that is rapidly changed the spectral characteristics such as babble noise. For this reason, we proposed the modified IMCRA, which adaptively estimate and updata the noise power according to the noise type classified by the Gaussian mixture model (GMM). The performances of the proposed method are evaluated by perceptual evaluation of speech quality (PESQ) and composite measure under various environments and better results compared with the conventional method are obtained.

Keywords

References

  1. R. Martin, "Spectral subtraction based on minimum statistics," Proceeding of 7th EUSIPCO'94, Edinburgh, U.K., pp.1182-1185, Sep. 1994.
  2. I. Cohen and B. Berdugo, "Spectral enhancement by tracking speech presence probability in subbands," Proc. IEEE Workshop on Hands Free Speech Communication, HSC'01, Kyoto, Japan, pp.95-98, Apr. 2001.
  3. Y. H. Son, S. M. Lee, "Improved speech absence probability estimation based on environmental noise classification," Journal of Central South University., Vol. 19, No. 9, pp 2548-2553, September. 2012. https://doi.org/10.1007/s11771-012-1309-6
  4. S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Transactions on Acoustics, Speech and Signal Processing, pp.113-120, Apr. 1979.
  5. I. Cohen and B. Berdugo, "Speech enhancement for non-stationary noise environment," Signal Processing, pp.2403-2418, Nov. 2001.
  6. G. Doblinger, "Computationally efficient speech enhancement by spectral minima tracking in subbands," Proc. 4th European Conf. Speech, Communication and Technology, EUROSPEECH'95, pp.1513-1516, Sep. 1995.
  7. I. Cohen and B. Berdugo, " Noise estimation by minima controlled recursive averaging for robust speech enhancement," IEEE Signal Processing Letters, pp.12-15, Jan. 2002
  8. I. Cohen, "Noise spectrum estimation in adverse environments : improved minima controlled recursive averaging," IEEE Transactions on Speech and Audio Processing, pp.466-475, Sep. 2003.
  9. P. Renevey and A, Drygajlo, "Entropy based voice activity detection in very noisy conditions." In Eurospeech 2001, pp. 1887-1890, Sep. 2001.
  10. M. Asgari, A. Sayadian, M. Farhadloo and E. A. Mehrizi, "Voice activity detection using entropy in spectral domain," Telecommunication Networks and Applications Conference 2008, pp. 407-410, Dec. 2008.
  11. J. Lei, J. Wang and Z, Yang, "A robust voice activity detection algorithm in nonstationary noise," International Conference on Industrial and Information Systems 2009, pp. 195-198, Dec. 2009.
  12. Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error log-spectral amplitude estimator," IEEE Transactions on Acoustics, Speech and Siganl Processing, ASSP-32(2), pp.443-445, Apr. 1985.
  13. Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator," IEEE Transactions on Acoustics, Speech and Siganl Processing, ASSP-32(6), pp.1109-1121, Dec. 1984.
  14. Y. Hu and P. C. Loizou, "Evaluation of objective quality measures for speech enhancement," IEEE Transactions on Audio, Speech and Language Processing, pp.229-238 Jan. 2008.