The Comparison of Speaker Adaptation Methods

;

The Journal of the Acoustical Society of Korea (한국음향학회지)

Volume 18 Issue 1
/
Pages.61-66
/
1999
/
1225-4428(pISSN)
/
2287-3775(eISSN)

The Acoustical Society of Korea (한국음향학회)

The Comparison of Speaker Adaptation Methods

화자 적응 방법들의 비교

황영수 (관동대학교 전자공학과)

Published : 1999.01.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

In this paper, we proposed various speaker adaptation methods and studied the performance of these methods. Methods which were studied in this paper are MAPE(Maximum A Posteriori Probability Estimation), Linear Spectral Estimating, Multi-Layer Perceptron and ARTMAP. In order to evaluate the performance of these methods, we used Korean isolated digits as the experimental data, the hybrid speaker adaptation method, which unified MAPE, linear spectral estimating and output probability of SCHMM, showed the better recognition result than those which performed other methods. And the method using ARTMAP showed the similar result to above hybrid method.

본 논문은 화자 적응 방법 제안과 그 방법들의 성능을 검토한 것이다. 본 논문에서 제안 검토한 방법들은 최대사후확률추정(MAPE)방법, 음성 선형 특성을 이용한 방법, 다층 퍼셉트론(MLP)을 이용한 방법과 ARTMAP을 이용한 방법들이다. 각 방법들의 성능 평가를 위하여 한국어 숫자음으로 실험한 결과, 최대사후확률추정 방법과 반연속 HMM의 출력 확률적응, 음성 선형 특성 등 3방법을 결합한 방법이 가장 우수한 결과를 보였으며, 이와 비슷한 실험 결과를 ARTMAP을 이용한 화자 적응 방법에서 보였다.

Keywords

References

Speech Comm. v.5 no.2 Vowel Normalization by Frequency Warped Spectral Matching H.Matsumoto,(et al.)
IEEE Trans.Acoust.Speech Signal Processing v.ASSP-28 no.2 A Training Procedure for Isolated Word Recognition Systems S.Furui,
新學論 J67-A v.6 セット化音韻テンプレット二基つくん不特定話者單語音聲認識ツステム木下
Proc.ICASSP 87 v.15.5 Speaker Adaptation through Vector Quantization K.Shikano,(et.al.)
Proc.ICASSP 87 v.15.5 Fuzzy Vector Quantization Applied to Hidden Markov Modeling H.Tseng,(et.al.)
전자정보통신공학회논문집 v.J81-D-II no.3 Speaker Adaptation Using Maximum a Posteriori Probability Estimation Estimation and data Size Dependent Parameter Smoothing M.Tonomura,;T.Kosaka,;S.Matsunaga,
The Journal of the Acoustical Society of Korea v.15 no.3 A Study on the Speaker Adaptation of SCHMM Young Soo,Hwang.
IEEE Neural Networks v.NN-3 Fuzzy ARTMAP: A neural network architecture for incremental supervised learning of analog multidimensional maps G.A.Carpenter,;Grossberg,;N.Markuzon,;J.H.Reynolds,;D.B.Rosen,
제15회 음성 통신 및 신호 처리 워크샵 논문집 v.15 no.1 관찰 확률 최대화를 이용한 화자 적응 양태영;윤대희;차일환 외 6인.

The Journal of the Acoustical Society of Korea (한국음향학회지)

The Comparison of Speaker Adaptation Methods

화자 적응 방법들의 비교

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)