- Volume 4 Issue 4
DOI QR Code
Microphone Array Based Speech Enhancement Using Independent Vector Analysis
마이크로폰 배열에서 독립벡터분석 기법을 이용한 잡음음성의 음질 개선
- Received : 2012.08.14
- Accepted : 2012.11.14
- Published : 2012.12.31
Speech enhancement aims to improve speech quality by removing background noise from noisy speech. Independent vector analysis is a type of frequency-domain independent component analysis method that is known to be free from the frequency bin permutation problem in the process of blind source separation from multi-channel inputs. This paper proposed a new method of microphone array based speech enhancement that combines independent vector analysis and beamforming techniques. Independent vector analysis is used to separate speech and noise components from multi-channel noisy speech, and delay-sum beamforming is used to determine the enhanced speech among the separated signals. To verify the effectiveness of the proposed method, experiments for computer simulated multi-channel noisy speech with various signal-to-noise ratios were carried out, and both PESQ and output signal-to-noise ratio were obtained as objective speech quality measures. Experimental results have shown that the proposed method is superior to the conventional microphone array based noise removal approach like GSC beamforming in the speech enhancement.
Supported by : 경북대학교
- Ephraim, Y. & Malah, D. (1984). Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator, Acoustics, Speech and Signal Processing, IEEE Transactions on. vol. 32, 1109-1121. https://doi.org/10.1109/TASSP.1984.1164453
- Benesty, J. et al. (2008). Microphone array signal processing. Berlin: Springer Verlag.
- Hyvarinen, A., Karhunen, J. & Oja, E. (2001). Independent component analysis. New Youk: Wiley.
- Haykin, S. (2009). Neural networks and learning machines. New Jersey: Pearson Prentice Hall.
- Asano, F., Ikeda, S., Ogawa, M., Asoh, H. & Kitawaki, N. (2001). A combined approach of array processing and independent component analysis for blind separation of acoustic signals, in Proc.IEEE Int. Conf. Acoust., Speech Signal Process., 2729-2732.
- Kim, T., Attias, H., Lee, S. & Lee, T. (2007). Blind source separation exploiting higher-order frequency dependencies, Audio, Speech, and Language Processing, IEEE Transactions on, vol. 15, 70-79. https://doi.org/10.1109/TASL.2006.872618
- PESQ, http://www.itu.int/rec/T-REC-P.862/en
- Donohue, K. D. Systems Array Processing Toolbox. http://www.engr.uky.edu/-donohue/.
- NoiseX-92, http://www.speech.cs.cmu.edu/comp.speech/ Section1/ Data/noisex.html