DOI QR코드

DOI QR Code

An acoustical analysis of synchronous English speech using automatic intonation contour extraction

영어 동시발화의 자동 억양궤적 추출을 통한 음향 분석

  • 이서배 (창원대학교, 영어영문학과)
  • Received : 2015.01.31
  • Accepted : 2015.03.11
  • Published : 2015.03.31

Abstract

This research mainly focuses on intonational characteristics of synchronous English speech. Intonation contours were extracted from 1,848 utterances produced in two different speaking modes (solo vs. synchronous) by 28 (12 women and 16 men) native speakers of English. Synchronous speech is found to be slower than solo speech. Women are found to speak slower than men. The effect size of speech rate caused by different speaking modes is greater than gender differences. However, there is no interaction between the two factors (speaking modes vs. gender differences) in terms of speech rate. Analysis of pitch point features has it that synchronous speech has smaller Pt (pitch point movement time), Pr (pitch point pitch range), Ps (pitch point slope) and Pd (pitch point distance) than solo speech. There is no interaction between the two factors (speaking modes vs. gender differences) in terms of pitch point features. Analysis of sentence level features reveals that synchronous speech has smaller Sr (sentence level pitch range), Ss (sentence slope), MaxNr (normalized maximum pitch) and MinNr (normalized minimum pitch) but greater Min (minimum pitch) and Sd (sentence duration) than solo speech. It is also shown that the higher the Mid (median pitch), the MaxNr and the MinNr in solo speaking mode, the more they are reduced in synchronous speaking mode. Max, Min and Mid show greater speaker discriminability than other features.

Keywords

References

  1. Banzina, E., Hewitt, L. & Dilley, L. (2014). Using synchronous speech to facilitate the acquisition of English rhythm: A small scale study. E-JournALL, EuroAmerican Journal of Applied Linguistics and Languages, 1(1), 69-84.
  2. Boersma, P. (2001). Praat, a system for doing phonetics by computer. Glot International. 5:9/10, 341-345.
  3. Campione, E. (2001). Etiquetage prosodique semi-automatique de corpus oraux : algorithmes et methodologie. These de doctorat. Aix-en-Provence: Universite de Provence.
  4. Cummins, F. (2000). Prosodic characteristics of synchronous speech in Puppel, S. and Demenko, G., editors, Prosody 2000: Speech Recognition and Synthesis, Adam Mickiewicz University, Krakow, Poland, 45-49.
  5. Cummins, F. & Roy, D. (2001). Using synchronous speech to minimize variability in pause placement. Proceedings of the Institute of Acoustics, Stratford-upon-Avon, 23 (3), 201-206.
  6. Cummins, F. (2003). Practice and performance in speech produced synchronously. Journal of Phonetics, 31(2), 139-148. https://doi.org/10.1016/S0095-4470(02)00082-7
  7. Cummins, F., Grimaldi, M., Leonard, T. & Simko, J. (2006). The CHAINS corpus: Characterizing individual speakers. Proceedings of SPECOM 2006, 431-435, St. Petersburg, RU.
  8. Cummins, F., Li, C. & Wang, B. (2013).Coupling among speakers during synchronous speaking in English and Mandarin. Journal of Phonetics. 41(6), 432-441. https://doi.org/10.1016/j.wocn.2013.07.001
  9. Forsell, M. (2007). Acoustic correlates of perceived emotions in speech. MS Thesis, KTH, Royal Institute of Technology, Stockholm, Sweden.
  10. Fougeron, C. & Jun, S. (1998). Rate effects on French intonation: prosodic organization and phonetic realization. Journal of Phonetics, 26, 45-69. https://doi.org/10.1006/jpho.1997.0062
  11. Hirst, D., Cristo, A. & Espesser, R. (2000). Levels of representation and levels of analysis for intonation. in M. Horne (ed) Prosody : Theory and Experiment. Kluwer Academic Publishers, Dordrecht. 51-87.
  12. Hirst, D. (2007). A Praat plugin for Momel and INTSINT with improved algorithms for modelling and coding intonation. Proceedings of the XVIth International Conference of Phonetic Sciences, Saarbrucken, 1233-1236.
  13. Kim, M. & Nam, H. (2012). Speech Rate Variation in Synchronous Speech. Phonetics and Speech Sciences, 4(4), 19-27. (김미란, 남호성 (2012). 동시발화에 나타나는 발화 속도 변이분석. 말소리와 음성과학, 4(4), 19-27.) https://doi.org/10.13064/KSSS.2012.4.4.019
  14. Looze, C. & Hirst, D. (2008). Detecting changes in key and range for the automatic modelling and coding of intonation, Proceedings of Speech Prosody 2008, Campinas, Brazil, 135-138.
  15. Oh, J. H. (2014a). A Study of methods of standardization for Korean intonation curve. Korean Linguistics, 62, 395-420. (오재혁 (2014a). 한국어 억양 곡선의 정규화 방안에 대한 연구. 한국어학, 62, 395-420.)
  16. Oh, J. H. (2014b). A study of intonation curve slopes in Korean spontaneous speech. Phonetics and Speech Sciences, 6(1), 21-30. (오재혁, (2014b). 자유 발화 자료에서 나타나는 한국어 억양곡선의 기울기 특성에 대한 연구. 말소리와 음성과학, 6(1), 21- 30.) https://doi.org/10.13064/KSSS.2014.6.1.021
  17. Quene, H. (2007). On the just noticeable difference for tempo in speech. Journal of Phonetics. 35, 353-362. https://doi.org/10.1016/j.wocn.2006.09.001
  18. Yi, S., & Kim, S. (2011). A study on low pitch accent produced in different locations in English sentences. Phonetics and Speech Sciences, 3(4), 63-70. (이서배, 김수정 (2011). 영어 문장 내 상이한 위치에 나타난 저성조 피치 액센트 연구, 말소리와 음성과학, 3(4), 63-70.)
  19. Yi, S. (2014a). An acoustical analysis of emotional speech using close-copy stylization of intonation curve, Phonetics and Speech Sciences, 6(3), 131-138. (이서배, (2014a). 억양의 근접복사 유형화를 이용한 감정음성의 음향분석, 말소리와 음성과학, 6(3), 131-138.) https://doi.org/10.13064/KSSS.2014.6.3.131
  20. Yi, S. (2014b). An acoustical analysis of speech of different speaking rates and genders using intonation curve stylization of English, Phonetics and Speech Sciences, 6(4), 79-90. (이서배, (2014b). 영어의 억양 유형화를 이용한 발화 속도와 녀 화자에 따른 음향 분석, 말소리와 음성과학, 6(4), 79-90.) https://doi.org/10.13064/KSSS.2014.6.4.079