Isolated-Word Speech Recognition using Variable-Frame Length Normalization

Sin, Chan-Hu;Lee, Hui-Jeong;Park, Byeong-Cheol;

한국음향학회지 (The Journal of the Acoustical Society of Korea)

제6권4호
/
Pages.21-30
/
1987
/
1225-4428(pISSN)
/
2287-3775(eISSN)

한국음향학회 (The Acoustical Society of Korea)

가변프레임 길이정규화를 이용한 단어음성인식

Isolated-Word Speech Recognition using Variable-Frame Length Normalization

신찬후 (성균관대학교 공과대학 전자공학과) ;
이희정 (성균관대학교 공과대학 전자공학과) ;
박병철 (성균관대학교 공과대학 전자공학과)

발행 : 1987.12.01

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

단어음성인식에서 발성속도의 차이에 따른 단어음성 길이의 비선형적 변화는 정확한 인식을 어렵게 하는 주요한 원인이 되어 왔다. DP매칭은 시간축의 비선형 신축에 의해 시간정규화를 행함으로써 인식결과에 대한 신뢰성을 상당히 높였으나 시간정규화 파정에 요구되는 과도한 계산부담이 문제로 되어 있다. 본 논문에서는 시간정규화가 필요없는 방법으로 멀티섹션벡터양자화에 새로운 길이정규화법을 적용하는 방법을 제안한다. 이 방법은 종래의 고정프레임 길이정규화에 의해 멀티섹션코드북을 작성할 때보다. 정규화길이의 실정에 훨씬 융통성을 가질 수 있으므로 분석 및 거리계산의 양면에서 시간 단축을 가능케 하여 좀더 신속히 인식결과를 얻을 수 있는 장점이 있다

Length normalization by variable frame size is proposed as a novel approach to length normalization to solve the problem that the length variation of spoken word results in a lowing of recognition accuracy. This method has the advantage of curtailment of recognition time in the recognition stage because it can reduce the number of frames constructing a word compared with length normalization by a fixed frame size. In this paper, variable frame length normalization is applied to multisection vector quantization and the efficiency of this method is estimated in the view of recognition time and accuracy through practical recognition experiments.

한국음향학회지 (The Journal of the Acoustical Society of Korea)

가변프레임 길이정규화를 이용한 단어음성인식

Isolated-Word Speech Recognition using Variable-Frame Length Normalization

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)