DOI QR코드

DOI QR Code

An acoustical analysis of emotional speech using close-copy stylization of intonation curve

억양의 근접복사 유형화를 이용한 감정음성의 음향분석

  • 이서배 (창원대학교, 영어영문학과)
  • Received : 2014.08.18
  • Accepted : 2014.09.22
  • Published : 2014.09.30

Abstract

A close-copy stylization of intonation curve was used for an acoustical analysis of emotional speech. For the analysis, 408 utterances of five emotions (happiness, anger, fear, neutral and sadness) were processed to extract acoustical feature values. The results show that certain pitch point features (pitch point movement time and pitch point distance within a sentence) and sentence level features (pitch range of a final pitch point, pitch range of a sentence and pitch slope of a sentence) are affected by emotions. Pitch point movement time, pitch point distance within a sentence and pitch slope of a sentence show no significant difference between male and female participants. The emotions with high arousal (happiness and anger) are consistently distinguished from the emotion with low arousal (sadness) in terms of these acoustical features. Emotions with higher arousal show steeper pitch slope of a sentence. They have steeper pitch slope at the end of a sentence. They also show wider pitch range of a sentence. The acoustical analysis in this study implies the possibility that the measurement of these acoustical features can be used to cluster and identify emotions of speech.

Keywords

References

  1. Bachorowski, J.& Owren, M. (1995) Vocal expression of emotion: acoustic properties of speech are associated with emotional intensity and context. Psychological Science, 6, 219-224. https://doi.org/10.1111/j.1467-9280.1995.tb00596.x
  2. Banziger, T. & Scherer, K.R. (2005). The role of intonation in emotional expressions. Speech Communication. 46, 252-267. https://doi.org/10.1016/j.specom.2005.02.016
  3. Boersma, P. (2001). Praat, a system for doing phonetics by computer. Glot International. 5:9/10, 341-345.
  4. Burkhardt, F., Paeschke, A., Rolfes, M.,. Sendlmeier, W. and Weiss, B. (2005). A database of German emotional speech. Proceedings of Interspeech 2005, 1517-1520.
  5. Fontaine, J., Scherer, K., Roesch, E., and Ellsworth, P. (2007). The world of emotions is not two-dimensional. Psychological Science, 18(12),1050-1057. https://doi.org/10.1111/j.1467-9280.2007.02024.x
  6. Forsell, M. (2007). Acoustic correlates of perceived emotions in speech. MS Thesis, KTH, Royal Institute of Technology, Stockholm, Sweden.
  7. Goudbeek, M. & Scherer, K. (2010). Beyond arousal: valence and potency/control cues in the vocal expression of emotion. Journal of the Acoustical Society of America, 128(3),1322-36. https://doi.org/10.1121/1.3466853
  8. Jun, Sun-Ah. (1993). The phonetics and phonology of Korean prosody. Ph. D. Dissertation, The Ohio State University.
  9. Lee. Ho-Young. (1990). The structure of Korean prosody. Ph. D. Dissertation, University College London. (Published by Hanshin Publishing.)
  10. Mozziconacci, S. (2002). Prosody and emotions. Proceedings of Speech Prosody, Aix-en-Provence, 1-9.
  11. Oh, J. H. (2014a). A study of methods of standardization for Korean intonation curve. Korean Linguistics. 62, 395-420. (오재혁 (2014a). 한국어 억양 곡선의 정규화 방안에 대한 연구. 한국어학, 62, 395-420.)
  12. Oh, J. H. (2014b). A study of intonation curve slopes in Korean spontaneous speech. Phonetics and Speech Sciences. 6(1), 21-30.(오재혁 (2014b). 자유 발화 자료에서 나타나는 한국어 억양 곡선의 기울기 특성에 대한 연구. 말소리와 음성과학, 6(1), 21-30.) https://doi.org/10.13064/KSSS.2014.6.1.021
  13. Pell, M., Paulmann, M., Dara, S., Alasseri, A. and Kotzb, S. (2009) Factors in the recognition of vocally expressed emotions: a comparison of our languages. Journal of Phonetics, 37, 417-435. https://doi.org/10.1016/j.wocn.2009.07.005
  14. Rodero, E. (2011). Intonation and emotion: influence of pitch levels and contour type on creating emotions. Journal of Voice. 25(1), 25-34. https://doi.org/10.1016/j.jvoice.2010.02.002
  15. Yang, B. G. (2000). An acoustical analysis method of numeric sounds by Praat. Speech Science. 7(2), 127-137. (양병곤 (2000). Praat를 이용한 숫자음의 음향적 분석법, 음성과학, 7(2), 127-137.)
  16. Yi, S., and Kim, S. (2011). A study on low pitch accent produced in different locations in English sentences. Phonetics and Speech Sciences, 3(4), 63-70. (이서배, 김수정 (2011). 영어 문장 내 상이한 위치에 나타난 저성조 피치 액센트 연구, 말소리와 음성과학, 3(4), 63-70.)
  17. Yi, S. (2011). An analysis of formants extracted from emotional speech and acoustical implications for the emotion recognition system and speech recognition system. Phonetics and Speech Sciences, 3(1), 45-50. (이서배 (2011). 독일어 감정음성에서 추출한 포먼트의 분석 및 감정인식 시스템과 음성인식 시스템에 대한 음향적 의미, 말소리와 음성과학, 3(1), 45-50.)
  18. Williams, C. & Stevens, K. (1972) Emotions and speech: some acoustical correlates. Journal of the Acoustical Society of America, 52, 1238-1250. https://doi.org/10.1121/1.1913238

Cited by

  1. An acoustical analysis of speech of different speaking rates and genders using intonation curve stylization of English vol.6, pp.4, 2014, https://doi.org/10.13064/KSSS.2014.6.4.079
  2. Tonal Characteristics Based on Intonation Pattern of the Korean Emotion Words vol.13, pp.2, 2016, https://doi.org/10.21187/jmhb.2016.13.2.67
  3. An acoustical analysis of synchronous English speech using automatic intonation contour extraction vol.7, pp.1, 2015, https://doi.org/10.13064/KSSS.2015.7.1.097