JOURNAL BROWSE
Search
Advanced SearchSearch Tips
A Recognition Method for Main Characters Name in Korean Novels
facebook(new window)  Pirnt(new window) E-mail(new window) Excel Download
 Title & Authors
A Recognition Method for Main Characters Name in Korean Novels
Kim, Seo-Hee; Park, Tae-Keun; Kim, Seung-Hoon;
  PDF(new window)
 Abstract
The main characters play leading roles in novels. In the previous studies, they recognize the main characters in a novel mainly based on dictionaries that built beforehand. In English, names begin with upper cases and are used with some words. In this paper, we propose a recognition method for main characters name in Korean novels by using predicates, rules and weights. We first recognize candidates for the characters name by predicates and propose some rules to exclude candidates that cannot be characters. We assign importances for candidates, considering weights that given by the number of candidates appeared in a sentence. Finally, if the importance of the character is more than a threshold, we decide that the character is one of main characters. The results from the experiments for 300 novels show that an average accuracy is 85.97%. The main characters name may be used to grasp relationships among characters, character`s action and tendency.
 Keywords
Data Mining;Korean Linguistic Feature;Korean Novels;Main Characters;Text Mining;
 Language
Korean
 Cited by
 References
1.
K. H. Lee, J. H. Lee, M. S. Choi, G. C. Kim, "Study on Named Entity Recognition in Korean Text", Proceedings of the 12th Annual Conference on Human and Cognitive Language Technology, pp. 292-299, October, 2000.

2.
K. M. Bae, S. H. Kim, Y. J. Ko, J. H. Kim, "An Efficient Named Entity and Topic Word Recognition Method Based on Named Entity Pattern in a Natural Language Interface", Journal of Korean Institute of Information Technology, Vol.12, No.1, pp. 121-129, Korean Institute Of Information Technology, January, 2014.

3.
S. K. Han, "A Comparative Study about Construction and the Service of the Domestic Biographical Database", Journal of Korean Library And Information Science Society (JKLISS), Vol.39, No.4, pp. 331-352, December, 2008. crossref(new window)

4.
G. M. Park, S. H. Kim, H. G. Cho, "Analysis of Social Network According to The Distance of Characters Statements", Journal of The Korea Contents Association, Vol.13, No.4, pp. 427-439, April, 2013.

5.
S. Morwal, N. Jahan, D. Chopra, "Named Entity Recognition using Hidden Markov Model(HMM)", International Journal on Natural Language Computing (IJNLC), Vol.1, No,4, pp. 15-23. December, 2012.

6.
E. Minkov, R. C. Wang, W. W. Cohen, "Extracting Personal Names from Email: Applying Named Entity Recognition to Informal Text", Proceedings of the conference on Human Language Technology and Empirical Method in Natural Language Processing, pp. 443-450, Association for Computational Linguistics, 2005.

7.
D. Maynard, V. Tablan, C. Ursu, H. Cunningham, Y. Wilks, "Named Entity Recognition from Diverse Text Types", In Recent Advances in Natural Language Processing 2001 Conference, 2001.

8.
R. Fu, B. Qin, T. Liu, "Generating Chinese named entity data from parallel corpora", IJCNLP, 2011.