DOI QR코드

DOI QR Code

A Generalized Subspace Approach for Enhancing Speech Corrupted by Colored Noise Using Whitening Transformation

유색 잡음에 오염된 음성의 향상을 위한 백색 변환을 이용한 일반화 부공간 접근

  • Received : 2011.03.10
  • Accepted : 2011.07.13
  • Published : 2011.08.31

Abstract

In this paper, we proposed an algorithm for speech enhancement of speeches corrupted by colored noise. When there is no correlation between colored noise and speech signal, the colored noise turns into white noise through whitening transformation. This transformed signal has been applied to the generalized subspace approach for speech enhancement. The speech spectral distortion, produced by the whitening transformation as pre-processing, has been restored by using the inverse whitening transformation as post-processing of the proposed algorithm. The performance of the proposed algorithm for speech enhancement has been confirmed by computer simulation. The colored noises used in this experiment were car noise and multi-talker babble. It is confirmed that the proposed algorithm shows better performance from SNR and SSD viewpoint over the previous approach with the data from the AURORA and TIMIT data base.

본 논문에서는 유색잡음에 의해 오염된 음성신호의 음성향상 알고리즘을 제안한다. 유색잡음과 음성신호가 서로 상관이 없을 경우 유색잡음은 백색화 변환을 통해 무색잡음으로 변환된다. 이 변환된 신호를 음성신호 향상을 위한 일반화된 부공간 접근법에 적용한다. 전처리 과정에서의 백색화 변환으로 발생되는 음성 스펙트럼 왜곡은 제안한 알고리즘의 후처리를 통해 역 백색화하여 복구한다. 제안한 알고리즘의 성능을 컴퓨터 시뮬레이션으로 확인하였다. 사용한 유색잡음은 자동차 잡음과 멀티 토커 배블 잡음이다. AURORA 및 TIMIT 데이터 베이스에서 취득한 데이터로 실험했을 때 제안하는 방법이 신호대잡음비 및 스펙트럼 왜곡 측면에서 기존 접근법보다 개선됨을 확인하였다.

Keywords

Acknowledgement

Supported by : 부산대학교

References

  1. M. Berouti, R. Schwartz, and J. Makhoul, "Enhancement of speech corrupted by acoustic noise," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 1979, pp. 208-211.
  2. P. Lockwood and J. Boudy, "Experiments with a nonlinear spectral subtractor (NSS), Hidden Markov Models and the projection, for robust speech recognition in cars," Speech Commun., vol. 11, no. 2, pp.215-228, 1992. https://doi.org/10.1016/0167-6393(92)90016-Z
  3. Y. Ephraim and H. L. Van Trees, "A signal subspace approach for speech enhancement," IEEE Trans. Speech Audio Processing, vol. 3, pp.251-266, 1995. https://doi.org/10.1109/89.397090
  4. U. Mittal and N. Phamdo, "Signal/noise KLT based approach for enhancing speech degraded by colored noise," IEEE Trans. Speech Audio Processing, vol. 8, pp. 159-167, Mar. 2000. https://doi.org/10.1109/89.824700
  5. A. Rezayee and S. Gazor, "An adaptive KLT approach for speech enhancement," IEEE Trans. Speech Audio Processing, vol. 9, pp. 87-95, Feb. 2001. https://doi.org/10.1109/89.902276
  6. Y. Hu and P. C. Loizou, "A subspace approach for enhancing speech corrupted by colored noise," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, vol. 1, Orlando, FL, May 2002, pp. 573-576.
  7. M. Dendrinos, S. Bakamidis, and G. Carayannis, "Speech enhancement from noise: A regenerative approach," Speech Commun., vol. 10, pp. 45-57, 1991. https://doi.org/10.1016/0167-6393(91)90027-Q
  8. S. H. Jensen, P. C. Hansen, S. D. Hansen, and J. A. Sorensen, "Reduction of broad-band noise in speech by truncated QSVD," IEEE Trans. Speech Audio Processing, vol. 3, pp. 439-448, Nov. 1995. https://doi.org/10.1109/89.482211
  9. Yi Hu and Philipos C. Loizou, A Generalized Subspace Approach for Enhancing Speech Corrupted by Colored Noise IEEE Trans. Speech Audio Processing, vol. 11, no. 4, pp. 334-341. July 2003. https://doi.org/10.1109/TSA.2003.814458
  10. S. B. Searle, Matrix Algebra Useful for Statistics. New York: Wiley, 1982.
  11. G. Strang, Linear Algebra and Its Applications, 3rd ed. New York: Harcourt Brace Jovanonich, 1988.

Cited by

  1. A Generalized Subspace Approach for Enhancing Speech Corrupted by Colored Noise Using Voice Activity Detector(VAD) vol.17, pp.8, 2013, https://doi.org/10.6109/jkiice.2013.17.8.1769