비음수 행렬 분해와 디코릴레이터를 이용한 모노-스테레오 블라인드 업믹스 기법

Mono-To-Stereo Blind Upmix Using Non-Negative Matrix Factorization and Decorrelator

  • 최근우 (서울대학교 전기컴퓨터공학부 음향공학연구실) ;
  • 전상배 (서울대학교 전기컴퓨터공학부 음향공학연구실) ;
  • 이석진 (서울대학교 전기컴퓨터공학부 음향공학연구실) ;
  • 성굉모 (서울대학교 전기컴퓨터공학부 음향공학연구실)
  • 투고 : 2010.09.06
  • 심사 : 2010.11.15
  • 발행 : 2010.10.30

초록

본 논문은 충분한 음원 너비 (Apparent Source Width)와 스테레오 이미지 품질 (Stereophonic Image Quality)을 확보하는 모노-스테레오 업믹스 기법을 제안한다. 모노 신호의 분석을 위해 높은 계수의 비음수 행렬 분해가 사용된다. 그 결과로\ 나온 분해된 음원들은 음조성 (Tonality)에 의하여 타악기 (Percussive)와 음조 (Tonal) 그룹으로 분류된다. 두 그룹 중 하나는 바로 스테레오 채널로 들어가는 반면 나머지 하나는 디코릴레이터를 통과하여 들어가게 된다. 청취 평가 결과 제안한 방법은 충분한 음원 너비와 스테레오 음상을 제공할 뿐만 아니라 기존의 방법에 비해 음색 변화도 감소하는 종합적으로 향상된 성능을 보여주었다.

This paper presents a new method for upmixing mono signal to stereo signal with guaranteeing high stereophonic image quality (SIQ) and large apparent source width (ASW). The proposed method consists of analysis phase and synthesis phase. In analysis phase, a mono signal is first decomposed into multiple sound sources by the use of high-rank nonnegative matrix factorization. Then the multiple sources are clustered into two groups based on tonality criterion. In synthesis phase, one group is directly fed into left and right channels while the other group is decorrelated before being fed into each channel. Subjective tests reveals that the proposed method gives listener high SIQ and large ASW with minimizing timbral distortions.

키워드

참고문헌

  1. M. R. Schroeder and B. F. Logan, "Colorless artificial reverberation," J. of AES, no. 3, pp. 192-197, 1961.
  2. D. Lee and H. Seung, "Algorithms for non-negative matrix factorization," in Proc. NIPS, 2001.
  3. C. Uhle, A. Walther, and M. Ivertowski, "Blind one-to-N upmixing," AudioMostly 2nd Conference, pp. 110-115, September, 2007.
  4. M. Lagrange, L. G. Martins, and G. Tzanetakis, "Semi-automatic mono to stereo up-mixing using sound source formation", AES 125th Convention, paper no. 7042, May, 2007.
  5. M. Helen, "Separation of drums from polyphonic music using non-negative matrix factorization and support vector machine," EURASIP 13th Conference, September, 2005.
  6. E. Benetos, "Musical instrument classification using non-negative matrix factorization algorithms and subset feature selection," in Proc. IEEE Conference on Acoustics, Speech, and Signal Processing, May, 2006.
  7. K. Brandenburg and J. D. Johnston, "Second generation perceptual audio coding : The hybrid coder," AES 88th Convention, March, 1990.
  8. M. O. J. Hawksford and N. Harris, "Diffuse signal processing and acoustic source characterization for applications in synthetic loudspeaker arrays," AES 122nd Convention, April. 2002.
  9. C. Uhle, "Ambience separation from mono recordings using non-negative matrix factorization," AES 30th Conference, March, 2007.
  10. P. Smaragdis, "Non-negative matrix factorization for polyphonic music transcription," in Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, October, 2003.
  11. ITU-R (1997). Recommendation BS. 1116-1 : Recommendation BS. 1116: Methods for subjective assessment of small impairments in audio systems including multichannel sound systems, International communication union.
  12. M. Morimoto, "The role of rear loudspeaker in spatial impression", AES 103th Convention, paper no. 4554, September, 1997.
  13. F. Rumsey, "Subject assessment of the spatial attributes of reproduced sound," AES 15th Conference, October, 1998.
  14. F. Rumsey, S. ZielinCski, and R. Kassier, "On the relative importance of spatial and timbral fidelities in judgments of degraded multichannel audio quality," J. of ASA, vol. 118, Issue 2, pp. 968-976, August, 2005.
  15. F. Rumsey, "Spatial audio and sensory evaluation techniques-context, history and aims," Spatial audio and sensory evaluation techniques conference, April, 2006.
  16. ITU-R (2001). Recommendation BS. 1534-1 : Method for the subjective assessment of intermediate quality level of coding systems, International communication union.
  17. ITU-T (1996). Recommendation P.800 : Method for object and subject assessment of quality, International communication union.