DOI QR코드

DOI QR Code

Developing a Korean standard speech DB (II)

한국인 표준 음성 DB 구축(II)

  • Received : 2017.03.08
  • Accepted : 2017.05.14
  • Published : 2017.06.30

Abstract

The purpose of this paper is to report the whole process of developing Korean Standard Speech Database (KSS DB). This project is supported by SPO (Supreme Prosecutors' Office) research grant for three years from 2014 to 2016. KSS DB is designed to provide speech data for acoustic-phonetic and phonological studies and speaker recognition system. For the samples to represent the spoken Korean, sociolinguistic factors, such as region (9 regional dialects), age (5 age groups over 20) and gender (male and female) were considered. The goal of the project is to collect over 3,000 male and female speakers of nine regional dialects and five age groups employing direct and indirect methods. Speech samples of 3,191 speakers (2,829 speakers and 362 speakers using direct and indirect methods, respectively) are collected and databased. KSS DB designs to collect read and spontaneous speech samples from each speaker carrying out 5 speech tasks: three (pseudo-)spontaneous speech tasks (producing prolonged simple vowels, 28 blanked sentences and spontaneous talk) and two read speech tasks (reading 55 phonetically and phonologically rich sentences and reading three short passages). KSS DB includes a 16-bit, 44.1kHz speech waveform file and a orthographic file for each speech task.

Keywords

References

  1. Shin, J., Jang, H., Kang, Y., & Kim, K. (2015). Developing a Korean Standard Speech DB. Phonetics and Speech Sciences, 7(1), 139-150. (신지영.장혜진.강연민.김경화 (2015). 한국인 표준 음성 DB 구축. 말소리와 음성과학, 7(1), 139-150.) https://doi.org/10.13064/KSSS.2015.7.1.139
  2. National Institution fo Korean Language (2007). 21st Century Sejiong Project Developing Special Data of Korean Language. Seoul: National Institution fo Korean Language. (국립국어원 (2007). 21 세기 세종계획 국어 특수자료 구축. 서울: 국립국어원.)