DOI QR코드

DOI QR Code

Detection of Music Mood for Context-aware Music Recommendation

상황인지 음악추천을 위한 음악 분위기 검출

  • Received : 2009.10.12
  • Accepted : 2010.06.07
  • Published : 2010.08.31

Abstract

To provide context-aware music recommendation service, first of all, we need to catch music mood that a user prefers depending on his situation or context. Among various music characteristics, music mood has a close relation with people‘s emotion. Based on this relationship, some researchers have studied on music mood detection, where they manually select a representative segment of music and classify its mood. Although such approaches show good performance on music mood classification, it's difficult to apply them to new music due to the manual intervention. Moreover, it is more difficult to detect music mood because the mood usually varies with time. To cope with these problems, this paper presents an automatic method to classify the music mood. First, a whole music is segmented into several groups that have similar characteristics by structural information. Then, the mood of each segments is detected, where each individual's preference on mood is modelled by regression based on Thayer's two-dimensional mood model. Experimental results show that the proposed method achieves 80% or higher accuracy.

상황인지 음악추천 서비스를 제공하기 위해서는 무엇보다 상황 또는 문맥에 따라 사용자가 선호하는 음악의 분위기를 파악할 필요가 있다. 음악 분위기 검출에 대한 기존 연구의 대부분은 수작업으로 대표구간을 선정하고, 그 구간의 특징을 이용하여 분위기를 판별한다. 이러한 접근 방법은 분류 성능이 좋은 반면 전문가의 간섭을 요구하기 때문에 새로운 음악에 대해서는 적용하기 어렵다. 더욱이, 곡의 진행에 따라 음악 분위기가 달라지기 때문에 음악의 대표 분위기를 검출하는 것이 더욱 어려워진다. 본 논문에서는 이러한 문제점들을 보완하기 위해 음악 분위기를 자동으로 판별하는 새로운 방법을 제안하였다. 먼저 곡 전체를 구조적 분석 방법을 통하여 비슷한 특성을 갖는 세그먼트들로 분리한 후 각각에 대해 분위기를 판별한다. 그리고 세그먼트별 분위기 파악 시 Thayer 의 2차원 분위기 모델에 기초한 회귀분석 방법으로 개인별 주관적 분위기 성향을 모델링하였다. 실험결과, 제안된 방법이 80% 이상의 정확도를 보였다.

Keywords

References

  1. T. Li and M. Ogihara, "Detecting Emotion in Music," Proc. of the International Symposium on Music Information Retrieval(ISMIR), pp.239-240, Washington D.C., USA, 2003.
  2. L. Lu, D. Liu and H. Zhang, "Automatic Mood Detection and Tracking of Music Audio Signals," IEEE Trans. on Audio, Speech, and Language Processing, Vol..14, pp.5-18, 2006. https://doi.org/10.1109/TSA.2005.860344
  3. Y. Feng, Y. Zhang and Y. Pan, "Popular Music Retrieval by Detecting Mood," Proc. of ACM SIGIR 2003, pp.375-376, 2003.
  4. Y.H. Yang, C.C. Liu and H.H. Chen, "Music Emotion Classification: a Fuzzy Approach," Proc. of ACM Multimedia 2006 (ACM MM'06), pp.81-84, Santa Barbara, CA, USA, 2006.
  5. R.E. Thayer, "The Biopsychology of Mood and Arousal", Oxford University Press, 1989.
  6. H. Katayose, M. Imai and S. Inokuchi, "Sentiment Extraction in Music," Proc. of International Conference Pattern Recognition, Vol.2, pp.1083-1087, 1998.
  7. D. Liu, N. Zhang and H. Zhu, "Form and Mood Recognition of Johann Strauss's Waltz Centos," Chinese Journal of Electronics, Vol.12, Part.4, pp.587-593, 2003.
  8. D. Hevner, "Experimental Studies of the Elements of Expression in Music," American Journal of Psychology, Vol.48, pp.246-268, 1936. https://doi.org/10.2307/1415746
  9. P.R. Farnsworth, "The Social Psychology of Music", The Dryden Press, 1958.
  10. T. Li and M. Ogihara, "Content-based Music Similarity Search and Emotion Detection," Proc. of ICASSP '04, Vol.5, pp.705-708, 2004.
  11. Y.H. Yang, Y.F. Su, Y.C. Lin and H.H. Chen, "Music Emotion Recognition: the Role of Individuality," Proc. of ACM SIGMM International Workshop on Human-centered Multimedia 2007, pp.13-21, Augsburg, Germany, 2007.
  12. Y.H. Yang, C.C. Liu and H.H. Chen, "A Regression Approach to Music Emotion Recognition," IEEE Trans. on Audio, Speech, and Language Processing, Vol.16, pp.448-457, 2008. https://doi.org/10.1109/TASL.2007.911513
  13. G. Tzanetakis and P. Cook, "Musical Genre Classification of Audio Signals," IEEE Trans. on Speech and Audio Processing, Vol.10, No.5, pp.293-302, 2002. https://doi.org/10.1109/TSA.2002.800560
  14. J. J. Burred and A. Lerch, "A Hierarchical Approach to Automatic Musical Genre Classification," Proc. of the 6th International Conference on Digital Audio Effects (DAFx-03), 2003.
  15. J. J. Burred and A. Lerch, "Hierarchical Automatic Audio Signal Classification," Journal of the Audio Engineering Society, Vol.52, No.7/8, pp.357-365, 2004.
  16. D. Jiang, L. Lu, H. Zhang, J. Tao and L. Cai, "Music Type Classification by Spectral Contrast Feature," Proc. of ICME `02, Vol.1, pp.113-116, 2002.
  17. T. Tolonen and M. Karjalainen, "A Computationally Efficient Multipitch Analysis Model," IEEE Trans. on Speech Audio Processing, Vol.8, pp.708-716, Nov. 2000. https://doi.org/10.1109/89.876309
  18. T. Li, M. Ogihara and Q. Li, "A Comparative Study on Content-based Music Genre Classification," Proc. of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp.282-289, 2003.
  19. Y. K. Kim and Y. Brian, "Singer Identification in Popular Music Recordings Using Voice Coding Features," Proc. of International Conference on Music Information Retrieval, 2002.
  20. T. Zhang, "Automatic Singer Identification," Proc. of IEEE International Conference on Multimedia and Expo, IEEE CS Press, 2003.
  21. X. Shao, N.C. Maddage, C. Xu and M.S. Kankanhalli, "Automatic Music Summarization Based on Music Structure Analysis," Proc. of ICASSP'05, Vol.2, pp.1169-1172, 2005.
  22. Y. Shiu, H. Jeong and C.-C. J. Kuo, "Musical Structure Analysis Using Similarity Matrix and Dynamic Programming," Proc. of SPIE, Multimedia systems and applications, Vol.3, pp.398-409, 2005.
  23. J. Paulus and A. Klapuri, "Music Structure Analysis by Finding Repeated Parts," Proc. of ACM AMCMM'06, pp.59-67, 2006.
  24. M. Goto, "SmartMusicKIOSK: Music Listening station with Chorus-Search Function," Proc. of the 16th annual ACM symposium on User Interface Software and Technology, pp.31-40, 2003.
  25. S. Abdallah, K. Nolad, M. Sandler, M. Casey and C. Rhodes, "Theory and Evaluation of a Bayesian Music Structure Extractor," Proc. of 6th International Conference on Music Information Retrieval London, UK, Sept. 2005.
  26. M. Levy, M. Sandier and M. Casey, "Extraction of High-Level Musical Structure From Audio Data and Its Application to Thumbnail Generation," Proc. of ICASSP'06, Vol.5, pp.13-16, Toulouse, France, May 2006.
  27. M. Levy, M. Sandier and M. Casey, "Structural Segmentation of Musical Audio by Constrained Clustering," IEEE Trans. on Audio, Speech, and Language Processing, Vol.16, pp.318-326, 2008. https://doi.org/10.1109/TASL.2007.910781
  28. L. Lu and H. Zhang, "Automated Extraction of Music Snippets," Proc. of the 11'th ACM International Conference on Multimedia, pp.140-147, 2003.
  29. T. Zhang and R. Samadani, "Automatic Generation of Music Thumbnails," Proc. of IEEE International Conference on Multimedia and Expo, pp.228-231, 2007.
  30. G. Peeters, "Deriving Musical Structure from Signal Analysis for Music Audio Summary Generation: "Sequence" and "State" Approach," In Lecture Notes in Computer Science, Vol.2771, pp.143-166. Springer-Verlag, 2004. https://doi.org/10.1007/978-3-540-39900-1_14