MCE Training Algorithm for a Speech Recognizer Detecting Mispronunciation of a Foreign Language

외국어 발음오류 검출 음성인식기를 위한 MCE 학습 알고리즘

  • 배민영 (대전대학교 정보통신공학과) ;
  • 정용주 (계명대학교 전자공학과) ;
  • 권철홍 (대전대학교 정보통신공학과)
  • Published : 2004.12.01

Abstract

Model parameters in HMM based speech recognition systems are normally estimated using Maximum Likelihood Estimation(MLE). The MLE method is based mainly on the principle of statistical data fitting in terms of increasing the HMM likelihood. The optimality of this training criterion is conditioned on the availability of infinite amount of training data and the correct choice of model. However, in practice, neither of these conditions is satisfied. In this paper, we propose a training algorithm, MCE(Minimum Classification Error), to improve the performance of a speech recognizer detecting mispronunciation of a foreign language. During the conventional MLE(Maximum Likelihood Estimation) training, the model parameters are adjusted to increase the likelihood of the word strings corresponding to the training utterances without taking account of the probability of other possible word strings. In contrast to MLE, the MCE training scheme takes account of possible competing word hypotheses and tries to reduce the probability of incorrect hypotheses. The discriminant training method using MCE shows better recognition results than the MLE method does.

Keywords