Vocal Tract Normalization Using The Power Spectrum Warping

파워 스펙트럼 warping을 이용한 성도 정규화

  • 유일수 (성균관대학교 정보통신공학부) ;
  • 김동주 (성균관대학교 정보통신공학부) ;
  • 노용완 (성균관대학교 정보통신공학부) ;
  • 홍광석 (성균관대학교 정보통신공학부)
  • Published : 2003.11.21

Abstract

The method of vocal tract normalization has been known as a successful method for improving the accuracy of speech recognition. A frequency warping procedure based low complexity and maximum likelihood has been generally applied for vocal tract normalization. In this paper, we propose a new power spectrum warping procedure that can be improve on vocal tract normalization performance than a frequency warping procedure. A mechanism for implementing this method can be simply achieved by modifying the power spectrum of filter bank in Mel-frequency cepstrum feature(MFCC) analysis. Experimental study compared our Proposal method with the well-known frequency warping method. The results have shown that the power spectrum warping is better 50% about the recognition performance than the frequency warping.

Keywords