Prosodic Break Index Estimation using LDA and Tri-tone Model

LDA와 tri-tone 모델을 이용한 운율경계강도 예측

  • Published : 1999.10.01

Abstract

In this paper we propose a new mixed method of LDA and tri-tone model to predict Korean prosodic break indices(PBI) for a given utterance. PBI can be used as an important cue of syntactic discontinuity in continuous speech recognition(CSR). The model consists of three steps. At the first step, PBI was predicted with the information of syllable and pause duration through the linear discriminant analysis (LDA) method. At the second step, syllable tone information was used to estimate PBI. In this step we used vector quantization (VQ) for coding the syllable tones and PBI is estimated by tri-tone model. In the last step, two PBI predictors were integrated by a weight factor. The proposed method was tested on 200 literal style spoken sentences. The experimental results showed 72% accuracy.

본 논문에서는 발화된 문장으로부터 운율 경계 강도를 효과적으로 예측하기 위해 LDA와 tri-tone 모델을 혼합한 방법을 제안하였다. 이 방법은 기존의 LDA 방법을 사용하여 음절과 휴지기의 길이 정보를 운율경계강도 예측에 적용하고 피치정보를 벡터양자화에 적용하여 tri-tone이란 개념을 도입한 혼합형 모형이다. 제안된 방법은 주어진 200문장의 운율경계 강도를 예측하는 실험에서 72%의 정확성을 나타내었다.

Keywords