• Title/Summary/Keyword: Effect of morphemes

Search Result 8, Processing Time 0.024 seconds

Effects of orthographic and morphological frequency of a syllable in Korean word recognition (한국어 음절의 표기빈도와 형태소빈도가 단어인지에 미치는 효과)

  • Yi, Kwang-Oh;Bae, Sung-Bong
    • Korean Journal of Cognitive Science
    • /
    • v.20 no.3
    • /
    • pp.309-333
    • /
    • 2009
  • Two experiments were conducted to examine the role of Kulja and morpheme in processing two-syllable Sino-Korean words. In Experiment 1, the effects of morphemic frequency were not significant at the initial and final positions of a word while Kulja frequency and Kulja-morpheme correspondence at both positions in a word had a significant impact on the processing of nonwords. Lexical decision times were longer for nonwords with high frequency Kulja and for nonwords with ambiguous Kulja-morpheme correspondence whose Kulja can go with many different morphemes. In Experiment 2 Kulja-morpheme correspondence was examined for words as well as nonwords. Lexical decisions were slower for stimuli with ambiguous Kulja-morpheme correspondence. The effect was more stable for nonwords, which replicated the result of Experiment 1. In sum, the results of this study suggest that words with ambiguous Kulja-morpheme correspondence activate many different morphemes and competition among these morphemic candidates slows down the lexical selection process. Kulja frequency, Kulja neighborhood, morphemic frequency, morphological neighborhood, and Kulja-morpheme correspondence in Korean word recognition were also discussed.

  • PDF

The Syllable Frequency Effect in Semantic Categorization Tasks in Korean

  • Kim, Ji-Hye;Kwon, You-An;Nam, Ki-Chun
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.5 no.10
    • /
    • pp.1879-1890
    • /
    • 2011
  • Previous studies of syllable frequency effects have proposed that inhibitory effects due to high first syllable frequency were the products of competitions between activated lexical candidates within a lexical level. However, these studies have primarily used lexical decision tasks to examine the nature of syllable frequency effects. This study investigates whether a syllable frequency effect can arise in semantic categorization tasks and whether phonologically or orthographically defined syllables interact with semantically related variables such as morphological family size. If the syllable frequency effect was created by activations and competitions on a lexical level, it is highly possible that the effect was related to semantic categorization tasks. To test this hypothesis, we conducted two experiments. In Experiment 1, morphological family size and phonological syllable frequency were factorially manipulated. In Experiment 2, morphological family size and orthographic syllable frequency were factorially manipulated. The results demonstrate that morphemes have no relationship with phonological syllables but do with orthographic syllables. This suggests that phonological syllables and orthographic syllables have different roles in the syllable frequency effect on visual word recognition process.

Grammatical morphemes' effect on Korean word vector generation (형식형태소가 한국어 단어 벡터 생성에 미치는 영향)

  • Youn, Junyoung;Kim, Dowon;Min, Tae Hong;Lee, Jae Sung
    • Annual Conference on Human and Language Technology
    • /
    • 2017.10a
    • /
    • pp.179-183
    • /
    • 2017
  • 단어 벡터는 단어 사이의 관계를 벡터 연산으로 가능하게 할 뿐 아니라, 상위의 신경망 프로그램의 사전학습 데이터로 많이 활용되고 있다. 한국어 어절은 생산적인 조사나 어미 때문에 효율적인 단어 벡터 생성이 어려워 대개 실질형태소만을 사용하여 한국어 단어 벡터를 생성한다. 본 논문에서는 실질형태소와 형식형태소를 모두 사용하되, 형식형태소를 적절하게 분류하여 단어 벡터의 성능을 높이는 방법을 제안한다. 자체 구축한 단어 관계 테스트 집합으로 추출 성능을 평가해 본 결과, 제안한 방법으로 형식형태소를 사용할 경우, 성능이 향상되었다.

  • PDF

Grammatical morphemes' effect on Korean word vector generation (형식형태소가 한국어 단어 벡터 생성에 미치는 영향)

  • Youn, Junyoung;Kim, Dowon;Min, Tae Hong;Lee, Jae Sung
    • 한국어정보학회:학술대회논문집
    • /
    • 2017.10a
    • /
    • pp.179-183
    • /
    • 2017
  • 단어 벡터는 단어 사이의 관계를 벡터 연산으로 가능하게 할 뿐 아니라, 상위의 신경망 프로그램의 사전학습 데이터로 많이 활용되고 있다. 한국어 어절은 생산적인 조사나 어미 때문에 효율적인 단어 벡터 생성이 어려워 대개 실질형태소만을 사용하여 한국어 단어 벡터를 생성한다. 본 논문에서는 실질형태소와 형식형태소를 모두 사용하되, 형식형태소를 적절하게 분류하여 단어 벡터의 성능을 높이는 방법을 제안한다. 자체 구축한 단어 관계 테스트 집합으로 추출 성능을 평가해 본 결과, 제안한 방법으로 형식형태소를 사용할 경우, 성능이 향상되었다.

  • PDF

Predictive Morphological Analysis of Korean with Dynamic Programming (동적 프로그래밍기법에 근거한 예측중심의 한국어 형태소 분석)

  • 김덕봉;최기선
    • Korean Journal of Cognitive Science
    • /
    • v.4 no.2
    • /
    • pp.145-180
    • /
    • 1994
  • In this paper,we present an efficient morphological analysis model for Korean which produces from an input word all the feasible sequences of morphemes in the word.This model is deterministic in applying spelling rules,and has few redundant computations in processing complex and ambiguous words.This is the effect of three types of new techniques:first,a new method for interpreting speilling rules;second,predictive rule applications which restrict to the spelling rules suitable for the input word;third,the use of dynamic programming which enables the analyzer to avoid recomputing analyzed substring in case the input word is morphologically ambiguous.our model has been experimented with 413,975 word randomly selected from the corpus of Korean elementary textbooks.Experimental results show that our model guarantees fast and reliable processing.

The exploration of the effects of word frequency and word length on Korean word recognition (한국어 단어재인에 있어서 빈도와 길이 효과 탐색)

  • Lee, Changhwan;Lee, Yoonhyoung;Kim, Tae Hoon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.17 no.1
    • /
    • pp.54-61
    • /
    • 2016
  • Because a word is the basic unit of language processing, studies of the word recognition processing and the variables that contribute to word recognition processing are very important. Word frequency and word length are recognized as important factors on word recognition. This study examined the effects of those two variables on the Korean word recognition processing. In Experiment 1, two types of Hangul words, pure Hangul words and Hangul words with Hanja counterparts, were used to explore the frequency effects. A frequency effect was not observed for Hangul words with Hanja counterparts. In Experiment 2, the word length was manipulated to determine if the word length effect appears in Hangul words. Contrary to the expectation, one syllable words were processed more slowly than two syllable words. The possible explanations for these results and future research directions are discussed.

A Comparison of Hospice Care Research Topics between Korea and Other Countries Using Text Network Analysis (텍스트네트워크분석을 활용한 국내·외 호스피스 간호 연구 주제의 비교 분석)

  • Park, Eun-Jun;Kim, Youngji;Park, Chan Sook
    • Journal of Korean Academy of Nursing
    • /
    • v.47 no.5
    • /
    • pp.600-612
    • /
    • 2017
  • Purpose: This study aimed to identify and compare hospice care research topics between Korean and international nursing studies using text network analysis. Methods: The study was conducted in four steps: 1) collecting abstracts of relevant journal articles, 2) extracting and cleaning keywords (semantic morphemes) from the abstracts, 3) developing co-occurrence matrices and text-networks of keywords, and 4) analyzing network-related measures including degree centrality, closeness centrality, betweenness centrality, and clustering using the NetMiner program. Abstracts from 347 Korean and 1,926 international studies for the period of 1998-2016 were analyzed. Results: Between Korean and international studies, six of the most important core keywords-"hospice," "patient," "death," "RNs," "care," and "family"-were common, whereas "cancer" from Korean studies and "palliative care" from international studies ranked more highly. Keywords such as "attitude," "spirituality," "life," "effect," and "meaning" for Korean studies and "communication," "treatment," "USA," and "doctor" for international studies uniquely emerged as core keywords in recent studies (2011~2016). Five subtopic groups each were identified from Korean and international studies. Two common subtopics were "hospice palliative care and volunteers" and "cancer patients." Conclusion: For a better quality of hospice care in Korea, it is recommended that nursing researchers focus on study topics of patients with non-cancer disease, children and family, communication, and pain and symptom management.

The Unsupervised Learning-based Language Modeling of Word Comprehension in Korean

  • Kim, Euhee
    • Journal of the Korea Society of Computer and Information
    • /
    • v.24 no.11
    • /
    • pp.41-49
    • /
    • 2019
  • We are to build an unsupervised machine learning-based language model which can estimate the amount of information that are in need to process words consisting of subword-level morphemes and syllables. We are then to investigate whether the reading times of words reflecting their morphemic and syllabic structures are predicted by an information-theoretic measure such as surprisal. Specifically, the proposed Morfessor-based unsupervised machine learning model is first to be trained on the large dataset of sentences on Sejong Corpus and is then to be applied to estimate the information-theoretic measure on each word in the test data of Korean words. The reading times of the words in the test data are to be recruited from Korean Lexicon Project (KLP) Database. A comparison between the information-theoretic measures of the words in point and the corresponding reading times by using a linear mixed effect model reveals a reliable correlation between surprisal and reading time. We conclude that surprisal is positively related to the processing effort (i.e. reading time), confirming the surprisal hypothesis.