• Title/Summary/Keyword: Utterance

Search Result 382, Processing Time 0.047 seconds

A Study on the Declination According to Length of Utterance, Clause Boundary and Focus in Korean (한국어의 발화 길이 및 절 경계와 초점에 의한 점진하강(declination) 연구)

  • Kwak, Sook-Young
    • Phonetics and Speech Sciences
    • /
    • v.2 no.3
    • /
    • pp.11-22
    • /
    • 2010
  • The present study attempts to investigate declination in Korean and its relevant aspects to the length of utterance, the clause boundary, and focus. More specifically, I examine the relation of declination with the length of utterance, the declination reset at the clause boundary, and the effect of focus on declination. Results showed that the length of utterance had no relation with the first and last pitch values of the utterance but that they were consistent regardless of the length of utterance. However, the declination slope changed to be relatively gentle from the fourth accentual phrase to the end of the whole intonational phrase. There was a reset of declination in such a way that the first pitch in the second phrase was always lower than that of the first phrase, but the first pitch in the third phrase was not always lower than that of the second phrase when the whole utterance was composed of three phrases. Finally, the pitch values of the focusing words decreased as their position went back in a sentence. One declination line was formed in the case of focused utterance, but in the case of an utterance that contained a clause boundary, a new declination line was formed at the start of each new clause. These findings can be applied to developing a Korean speech synthesizer that contains natural prosody; they can be also utilized for teaching Korean prosody.

  • PDF

Effects of stuttering severity on articulation rate in fluent and dysfluent utterances of preschool children who stutter (취학 전 말더듬 아동의 말더듬 중증도에 따른 발화 형태 별 조음속도 비교)

  • Chon, HeeCheong;Lee, SooBok
    • Phonetics and Speech Sciences
    • /
    • v.8 no.3
    • /
    • pp.79-90
    • /
    • 2016
  • The purpose of this study was to investigate the effects of stuttering severity on articulation rate measured from different types of utterances in preschool children who stutter. Participants were 40 boys who stutter (CWS) and age-matched 10 boys who do not stutter (CWNS). CWS were sub-grouped based on the severity of their stuttering: 15 mild, 13 moderate, and 12 severe. Utterances were categorized as "overall utterance" including all utterances that children spoke and "fluent utterance" which did not contain any disfluencies. Utterances containing abnormal disfluencies were categorized as "SLD utterance" for CWS. The results revealed no significant difference among groups in any type of utterance. There were significant positive correlations in articulation rates between utterance types. Stuttering severity was not a factor for characterizing the articulation rate of each type of utterance. Also, current findings suggest that articulation rate may not predict speech motor control ability in preschool CWS.

The Role of Prosody in Dialect Synthesis and Authentication

  • Yoon, Kyu-Chul
    • Phonetics and Speech Sciences
    • /
    • v.1 no.1
    • /
    • pp.25-31
    • /
    • 2009
  • The purpose of this paper is to examine the viability of synthesizing Masan dialect with Seoul dialect and to examine the role of prosody in the authentication of the synthesized Masan dialect. The synthesis was performed by transferring one or more of the prosodic features of the Masan utterance onto the Seoul utterance. The hypothesis is that, given an utterance composed of the phonemes shared by both dialects, as more prosodic features of the Masan utterance are transferred onto the Seoul utterance, the Seoul utterance will be identified as more authentic Masan utterance. The prosodic features involved were the fundamental frequency contour, the segmental durations, and the intensity contour. The synthesized Masan utterances were evaluated by thirteen native speakers of Masan dialect. The result showed that the fundamental frequency contour and the segmental durations had main effects on the perceptual shift from Seoul to Masan dialect.

  • PDF

The Comparisons of GRBAS Perceptual Judgments according to Levels of Utterances

  • Pyo, Hwa-Young;Sim, Hyun-Sub
    • Speech Sciences
    • /
    • v.8 no.1
    • /
    • pp.135-142
    • /
    • 2001
  • The present study was performed to investigate adequate levels of utterances which can give essential as well as useful information about the patients' voice, by examining the degrees of correlation between the levels of utterances (vowels, words, and phrase paragraph reading) and the entire utterance including all of the levels. For this purpose, a total of 10 individual utterance samples (5 vowels, 3 words, 1 phrase, 1 paragraph reading) were collected from each of the 30 subjects with voice disorder patients, and four experienced voice therapists evaluated them using GRBAS. The results showed that four therapists highly agreed upon on 'G' parameter. The coefficient of the correlation between each level of utterance and entire utterance tended to be above 0.70. Judgements of the vowel /$\varepsilon$/ as well as /o/ highly correlated with the judgement of the entire utterance. Regardless of severity, the judgement of the entire utterance highly correlated with the judgements of the vowel /u/ and the paragraph reading. These results suggest that experienced voice therapists can precisely evaluate patients' voice quality with only one sustained vowel in the clinic field, as is done with the entire utterance evaluation.

  • PDF

The influence of utterance length on speech rate in spontaneous speech (자연발화 음성 코퍼스에서 발화 속도에 대한 발화 길이의 영향)

  • Kim, Jungsun
    • Phonetics and Speech Sciences
    • /
    • v.9 no.1
    • /
    • pp.9-17
    • /
    • 2017
  • The current study examined speech rate and its variance in spontaneous Seoul Korean speech. The current study focused on factors affecting the variance of speech rate such as utterance length, individual speakers, and gender. The results revealed that, first, utterance length has a significant influence on speech rate. Longer utterances were spoken at a faster rate. Second, regarding the effect of utterance length, individual speakers differed significantly in their speaking rate. The variation between speakers and within speakers tended to increase as utterance length increases. Third, there were speakers' gender differences, indicating that males produced considerably faster speaking rate than females. Additionally, the current study implied that non-linguistic factors in spontaneous speech can affect the variance of speakers' speaking rate.

An Utterance Verification using Vowel String (모음 열을 이용한 발화 검증)

  • 유일수;노용완;홍광석
    • Proceedings of the Korea Institute of Convergence Signal Processing
    • /
    • 2003.06a
    • /
    • pp.46-49
    • /
    • 2003
  • The use of confidence measures for word/utterance verification has become art essential component of any speech input application. Confidence measures have applications to a number of problems such as rejection of incorrect hypotheses, speaker adaptation, or adaptive modification of the hypothesis score during search in continuous speech recognition. In this paper, we present a new utterance verification method using vowel string. Using subword HMMs of VCCV unit, we create anti-models which include vowel string in hypothesis words. The experiment results show that the utterance verification rate of the proposed method is about 79.5%.

  • PDF

A Comparative Study on the Speech Rate of Advanced Korean(L2) Learners and Korean Native Speakers in Conversational Speech (자유 대화에서의 한국어 원어민 화자와 한국어 고급 학습자들의 발화 속도 비교)

  • Hong, Minkyoung
    • Journal of Korean language education
    • /
    • v.29 no.3
    • /
    • pp.345-363
    • /
    • 2018
  • The purpose of this study is to compare the speech rate of advanced Korean(L2) learners and Korean native speakers in spontaneous utterances. Specifically, the current study investigated the difference of the two groups' speech pattern according to utterance length. Eight advanced Korean(L2) learners and eight Korean native speakers participated in this study. The data were collected by recording their conversation and physical measurements (speaking rate, articulatory rates, pause and several types of speech disfluency) were taken on extracted 120 utterances from 12 out of the 16 participants. The findings show that advanced Korean learners' speech pattern is similar to that of Koreans in the short-length utterance. However, in the long-length utterance, two groups show different speech patterns; while the articulatory rate of Korean native speakers increased in the long-length utterance, that of Korean learners decreased. This suggests that the frequency of speech disfluency factors might affect this result.

A Study on DNN-based STT Error Correction

  • Jong-Eon Lee
    • International journal of advanced smart convergence
    • /
    • v.12 no.4
    • /
    • pp.171-176
    • /
    • 2023
  • This study is about a speech recognition error correction system designed to detect and correct speech recognition errors before natural language processing to increase the success rate of intent analysis in natural language processing with optimal efficiency in various service domains. An encoder is constructed to embedded the correct speech token and one or more error speech tokens corresponding to the correct speech token so that they are all located in a dense vector space for each correct token with similar vector values. One or more utterance tokens within a preset Manhattan distance based on the correct utterance token in the dense vector space for each embedded correct utterance token are detected through an error detector, and the correct answer closest to the detected error utterance token is based on the Manhattan distance. Errors are corrected by extracting the utterance token as the correct answer.

Implement of Semi-automatic Labeling Using Transcripts Text (전사텍스트를 이용한 반자동 레이블링 구현)

  • Won, Dong-Jin;Chang, Moon-soo;Kang, Sun-Mee
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.25 no.6
    • /
    • pp.585-591
    • /
    • 2015
  • In transcription for spoken language research, labeling is a work linking text-represented utterance to recorded speech. Most existing labeling tools have been working manually. Semi-automatic labeling we are proposing consists of automation module and manual adjustment module. Automation module extracts voice boundaries utilizing G.Saha's algorithm, and predicts utterance boundaries using the number and length of utterance which established utterance text. For maintaining existing manual tool's accuracy, we provide manual adjustment user interface revising the auto-labeling utterance boundaries. The implemented tool of our semi-automatic algorithm speed up to 27% than existing manual labeling tools.

Geophysics of Vowel Space in Bahasa Malaysia and Bahasa Indonesia (말레이시아어와 인도네시아어 모음 공간의 지형도)

  • Park Han-Sang;Park Jeong-Sook;Chun Tai-Hyun
    • MALSORI
    • /
    • no.58
    • /
    • pp.19-34
    • /
    • 2006
  • The present study investigates the vowels of Bahasa Malaysia and Bahasa Indonesia in terms of the first two formant frequencies and provides a three-dimensional formant chart of vowels by plotting F1, F2, and the frequency of datapoints on 4 different scales: Hz, mel, bark, and the number of ERB. For this study, we recruited 30 male native speakers of Bahasa Malaysia and Bahasa Indonesia (15 each) which include 6 vowels (i, e, a, o, u, e) in various contexts. The three-dimensional formant chart showed geophysics of vowel space, such that mountain peaks stand in particular locations with a higher frequency of occurrence of datapoints. The geophysics of vowel space may shed lights on the perceptual structure of vowel space. The results also showed that vowels in utterance-final positions have a significantly higher F1 and a significantly lower F2 than those in utterance-medial or utterance-initial positions, which means that vowels in utterance-final positions are more back and lower in vowel space than those in utterance-medial or utterance-initial positions.

  • PDF