A Temporal Decomposition Method Based on a Rate-distortion Criterion

비트율-왜곡 기반 음성 신호 시간축 분할

  • 이기승 (건국대학교 정보통신대학 전자공학과)
  • Published : 2002.04.01

Abstract

In this paper, a new temporal decomposition method is proposed. which takes into consideration not only spectral distortion but also bit rates. The interpolation functions, which are one of necessary parameters for temporal decomposition, are obtained from the training speech corpus. Since the interval between the two targets uniquely defines the interpolation function, the interpolation can be represented without additional information. The locations of the targets are determined by minimizing the bit rates while the maximum spectral distortion maintains below a given threshold. The proposed method has been applied to compressing the LSP coefficients which are widely used as a spectral parameter. The results of the simulation show that an average spectral distortion of about 1.4 dB can be achieved at an average bit rate of about 8 bits/Frame.

본 논문에서는 음성 신호 시간축 분할의 새로운 기법으로, 비트율과 왜곡을 함께 고려한 기법이 제안되었다. 시간축 분할에 필요한 보간 함수는 학습 음성 데이터로부터 얻어진다. 보간 함수는 두 타겟간의 길이에 따라 유일하게 결정되므로 보간 함수는 추가 정보없이 표현된다. 타겟 샘플은 비트율을 최소화시키면서 동시에 최대 스펙트럼 오차가 문턱 치보다 작게 되도록 선택하였다. 제안된 기법은 음성 부호화기의 스펙트럼 변수로 널리 사용되는 LSP계수의 부호화에 적용되었으며, 모의실험 결과 평균적으로 8 bits/Frame의 비트율에서 1.4 dB의 스펙트럼 왜곡이 얻어짐을 알 수 있었다.

Keywords

References

  1. Proc. ICASSP-83 Efficient coding of LPC parameters by temporal decomposition B.S. Atal
  2. IEEE Trans. on ASSP v.33 no.3 Matrix quantizer design for LPC speech using the generlized Lloyd algorithm C. Tsao;R.M. Gray https://doi.org/10.1109/TASSP.1985.1164584
  3. IEEE Trans. on ASSP v.36 no.9 LPC speech coding based on variable length segment quantization Y. Shirak;M. Honda https://doi.org/10.1109/29.90372
  4. Proc. ICASSP-88 Temporal decomposition and acoustic-phonetic decoding of speech F. Bimbot;G. Chollet;P. Deleglise
  5. IEEE Trans. on Signal Processing v.39 no.6 Short-term temporal decomposition and its properties for speech coding Y.M. Cheng;D. Oshanghnessy https://doi.org/10.1109/78.136534
  6. IEE Electronics Letters v.32 no.24 Adaptive-width approximation of events in temporal decompostion based speech coding S. Ghaemmaghami;M. Deriche https://doi.org/10.1049/el:19961525
  7. Proc. ICASSP-98 Spectral stability based event localizing temporal decomposition A.C.R. Nandasena;M. Akagi
  8. Proc. ICSLP-98 Hierarchical temporal decompostion;A novel approach to efficient compression of spectral characteristics of speech S. Ghaemmaghami;M. Deriche;S. Sridharan
  9. IEEE Tran. On Speech and Audio Processing v.7 no.2 Split matrix quantization of LPC parameters C.S. Xydeas;C. Papanastasion https://doi.org/10.1109/89.748117
  10. IEE Electronics Letters v.35 no.6 Very low rate speech coding using temporal decompostion S. Ghaemmaghami;S. Sridharan https://doi.org/10.1049/el:19990316
  11. IEE Electronics Letters v.35 no.12 Eficient quantization method for LSF parameters based on restricted temporal decomposition S.J. Kim;Y.H. Oh https://doi.org/10.1049/el:19990670
  12. Speech Coding and Synthesis v.12 W.B. Kleijn;K.K. Paliwal
  13. Digital Processing of Speech Signal L.R. Rabiner;R.W. Schafer