A Study on Speaker Identification Parameter Using Difference and Correlation Coeffieicent of Digit_sound Spectrum

숫자음의 스펙트럼 차이값과 상관계수를 이용한 화자인증 파라미터 연구

  • 이후동 (서경대학교 컴퓨터학과) ;
  • 강선미 (서경대학교 컴퓨터학과) ;
  • 장문수 (서경대학교 소프트웨어학과) ;
  • 양병곤 (동의대학교 영어영문학과)
  • Published : 2004.09.01

Abstract

Speaker identification system basically functions by comparing spectral energy of an individual production model with that of an input signal. This study aimed to develop a new speaker identification system from two parameters from the spectral energy of numeric sounds: difference sum and correlation coefficient. A narrow-band spectrogram yielded more stable spectral energy across time than a wide-band one. In this paper, we collected empirical data from four male speakers and tested the speaker identification system. The subjects produced 18 combinations of three-digit numeric. sounds !en times each. Five productions of each three-digit number were statistically averaged to make a model for each speaker. Then, the remaining five productions were tested on the system. Results showed that when the threshold for the absolute difference sum was set to 1200, all the speakers could not pass the system while everybody could pass if set to 2800. The minimum correlation coefficient to allow all to pass was 0.82 while the coefficient of 0.95 rejected all. Thus, both threshold levels can be adjusted to the need of speaker identification system, which is desirable for further study.

Keywords