Advanced SearchSearch Tips
Implement of Semi-automatic Labeling Using Transcripts Text
facebook(new window)  Pirnt(new window) E-mail(new window) Excel Download
 Title & Authors
Implement of Semi-automatic Labeling Using Transcripts Text
Won, Dong-Jin; Chang, Moon-soo; Kang, Sun-Mee;
  PDF(new window)
In transcription for spoken language research, labeling is a work linking text-represented utterance to recorded speech. Most existing labeling tools have been working manually. Semi-automatic labeling we are proposing consists of automation module and manual adjustment module. Automation module extracts voice boundaries utilizing G.Saha's algorithm, and predicts utterance boundaries using the number and length of utterance which established utterance text. For maintaining existing manual tool's accuracy, we provide manual adjustment user interface revising the auto-labeling utterance boundaries. The implemented tool of our semi-automatic algorithm speed up to 27% than existing manual labeling tools.
Transcription;Labeling;Utterance;Spoken Language;User Interface;
 Cited by
TalkBank, "TalkBank Transcript Browser," Available:, [Accessed: Feb 2, 2015].

CHILDES, "CHILDES Transcript Browser," Available:, [Accessed: Feb 2, 2015].

Bigi, Brigitte, "SPPAS: a tool for the phonetic segmentations of Speech," The eighth international conference on Language Resources and Evaluation, vol. 8, pp. 1748-1755, 2012.

Sharmistha S. Gray, et al., "Child Automatic Speech Recognition for US English: Child Interaction with Living-Room-Electronic-Devices," WOCCI 2014, poster session, 2014.

Jiyoung Shin et al., "Developing a Korean Standard Speech DB," Journal of the Korean society of speech sciences, vol. 7, no. 1, pp. 139-150, 2015.

CHILDES, "Using CLAN," Available:, [Accessed: Feb 2, 2015].

Claude Barras, et al., "Transcriber: a free tool for segmenting, labeling and transcribing speech." First international conference on language resources and evaluation (LREC). pp. 1373-1376, 1998.

Boersma, P. and Weenink, D., "Praat: doing phonetics by computer," Available:, 2009, [Accessed: Feb 2, 2015].

Jongmo Sung and Hyung Soon Kim, "Implemen- tation of the Automatic Speech Segmentation and Labeling System," The Journal of The Acoustical Society of Korea, vol. 16, no. 5, pp. 50-59, 1997.

Kang-Chun So, "A Study on the Method of Computational Processing of Dialectal Sound Data," The Society of Korean Language and Literature, vol. 142, pp. 7-30, 2006.

Sun-dong Kwak and Moon-soo Chang, "CosmoScriBe 2.0 : The development of Korean transcription tools," Journal of Korean Institute of Intelligent Systems, vol. 24, no. 3, pp. 323-329, 2014. crossref(new window)

G. Saha, Sandipan Chakroborty, and Suman Senapati, "A new silence removal and endpoint detection algorithm for speech and speaker recognition applications," Proceedings of the 11th National Conference on Communications (NCC), pp. 291-295, 2005.

Dong-jin Won and Moon-soo Chang, "An Improvement of Audio controller in Transcription Tool," Proceedings of KIIS Spring Conference, Vol. 22, no. 2, pp. 121-122, 2012.

Donald A. Norman, The Design of Everyday Things, Basic Books, 2002.

Tekla S. Perry, and John Voelcker, "Of mice and menus: designing the user-friendly interface," IEEE Spectrum, vol. 26, no. 9, pp. 46-51, 1989.