DOI QR코드

DOI QR Code

A Generalized Subspace Approach for Enhancing Speech Corrupted by Colored Noise Using Voice Activity Detector(VAD)

음성활동영역검색을 사용하는 유색잡음에 오염된 음성의 향상을 위한 일반화 부공간 접근

  • Son, Kyung-Sik (Department of Electronics Engineering, Pusan National University) ;
  • Kim, Hyun-Tae (Department of Multimedia Engineering, Dongeui University)
  • Received : 2013.04.09
  • Accepted : 2013.05.22
  • Published : 2013.08.31

Abstract

In this paper, we proposed the modified YL(Yi and Loizou) algorithm, using a VAD(voice activity detector) for enhancing speech corrupted by colored noise. The performance of the proposed algorithm has been compared to the YL algorithm and LS(Lee and Son, etc.) algorithm by computer simulation. The colored noises used in the experiment were a car noise and multi-talker babble from the AURORA data base and the used voices from the TIMIT data base. It is confirmed that the proposed algorithm shows better performance from SNR(signal to noise ratio) and SSD(speech spectral distortion) viewpoint over the previous two approach.

본 논문에서는 유색잡음에 의해 오염된 음성신호의 음성향상 알고리즘인 YL 접근법에 VAD(voice activity detector)를 구현하는 수정된 알고리즘을 제안한다. 제안한 알고리즘을 YL 접근법 및 LS 접근법과 컴퓨터 시뮬레이션으로 성능을 비교하였다. 사용한 유색잡음은 자동차 잡음과 다중화자 배블 잡음으로 AURORA 데이터베이스로 부터 각각 발췌하였고, 음성신호는 TIMIT 데이터 베이스로부터 발췌하였다. 제안한 알고리즘을 실험했을 때 제안하는 방법이 신호대잡음비 및 스펙트럼 왜곡 측면에서 기존의 두 알고리즘 보다 개선됨을 확인하였다.

Keywords

References

  1. Y. Ephraim and H. L. Van Trees, "A signal subspace approach for speech enhancement," IEEE Trans. Speech Audio Processing, vol. 3, pp.251-266, 1995. https://doi.org/10.1109/89.397090
  2. U. Mittal and N. Phamdo, "Signal/noise KLT based approach for enhancing speech degraded by colored noise," IEEE Trans. Speech Audio Processing, vol. 8, pp. 159-167, Mar. 2000. https://doi.org/10.1109/89.824700
  3. A. Rezayee and S. Gazor, "An adaptive KLT approach for speech enhancement," IEEE Trans. Speech Audio Processing, vol. 9, pp. 87-95, Feb. 2001. https://doi.org/10.1109/89.902276
  4. Yi Hu and Philipos C. Loizou, A Generalized Subspace Approach for Enhancing Speech Corrupted by Colored Noise IEEE Trans. Speech Audio Processing, vol. 11, no. 4, pp. 334-341. July 2003. https://doi.org/10.1109/TSA.2003.814458
  5. J. W. Lee, K. S. Son, J. S. Park, H. T. Kim, "A Generalized Subspace Approach for Enhancing Speech Corrupted by Colored Noise Using Whitening Transformation", The Journal of the Korean Institute of Information and Communication Engineering, vol. 15, no. 8, pp. 1665-1674, 2011. https://doi.org/10.6109/jkiice.2011.15.8.1665
  6. S. B. Searle, Matrix Algebra Useful for Statistics. New York: Wiley, 1982.
  7. G. Strang, Linear Algebra and Its Applications, 3rd ed. New York: Harcourt Brace Jovanonich, 1988.
  8. Y. Hu and P. C. Loizou, "A subspace approach for enhancing speech corrupted by colored noise," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, vol. 1, Orlando, FL, May 2002, pp. 573-576.