• Title/Summary/Keyword: Person Name Retrieval

Search Result 6, Processing Time 0.02 seconds

Korean-Chinese Person Name Translation for Cross Language Information Retrieval

  • Wang, Yu-Chun;Lee, Yi-Hsun;Lin, Chu-Cheng;Tsai, Richard Tzong-Han;Hsu, Wen-Lian
    • Proceedings of the Korean Society for Language and Information Conference
    • /
    • 2007.11a
    • /
    • pp.489-497
    • /
    • 2007
  • Named entity translation plays an important role in many applications, such as information retrieval and machine translation. In this paper, we focus on translating person names, the most common type of name entity in Korean-Chinese cross language information retrieval (KCIR). Unlike other languages, Chinese uses characters (ideographs), which makes person name translation difficult because one syllable may map to several Chinese characters. We propose an effective hybrid person name translation method to improve the performance of KCIR. First, we use Wikipedia as a translation tool based on the inter-language links between the Korean edition and the Chinese or English editions. Second, we adopt the Naver people search engine to find the query name's Chinese or English translation. Third, we extract Korean-English transliteration pairs from Google snippets, and then search for the English-Chinese transliteration in the database of Taiwan's Central News Agency or in Google. The performance of KCIR using our method is over five times better than that of a dictionary-based system. The mean average precision is 0.3490 and the average recall is 0.7534. The method can deal with Chinese, Japanese, Korean, as well as non-CJK person name translation from Korean to Chinese. Hence, it substantially improves the performance of KCIR.

  • PDF

Enhanced Method for Person Name Retrieval in Academic Information Service (학술정보서비스에서 인명검색 고도화 방법)

  • Han, Hee-Jun;Yae, Yong-Hee;You, Beom-Jong
    • The Journal of the Korea Contents Association
    • /
    • v.10 no.2
    • /
    • pp.490-498
    • /
    • 2010
  • In the web or not, all academic information have the creator which produces that information. The creator can be individual, organization, institution, or country. Most information consist of the title, author and content. The article among academic information is described by title, author, keywords, abstract, publisher, ISSN(International Standard Serial Number) and etc., and the patent information is consisted some metadata such as invention title, applicant, inventors, agents, application number, claim items etc. Most web-based academic information services provide search functions to user by processing and handling these metadata, and the search function using the author field is important. In this paper, we propose an effective indexing management for person name search, and search techniques using boosting factor and near operation based on phrase search to improve precision rate of search result. And we describe person name retrieval result with another expression name, co-authors and persons in same research field. The approach presented in this paper provides accurate data and additional search results to user efficiently.

Indexing and Retrieval of Human Individuals on Video Data Using Face and Speaker Recognition

  • Y.Sugiyama;N.Ishikawa;M.Nishida;Y.Ariki
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 1998.06b
    • /
    • pp.122-127
    • /
    • 1998
  • In this paper, we focus on the information retrieval of human individuals who are recorded on the video database. Our purpose is to index persons by their faces or voice and to retrieve their existing time sections on the video data. The database system can track as well as extract a face or voice of a certain person and construct a model of the individual person in self-organization mode. If he appears again at different time, the system can put the mark of the same person to the associated frames. In this way, the same person can be retrieved even if the system does not know his exact name. As the face and speaker modeling, a subspace method is employed to improve the indexing accuracy.

  • PDF

Personal Name Authority Control in Korean Public Libraries (국내 공공도서관의 인명 전거제어의 현황 및 발전 방향)

  • Shim, Kyung
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.40 no.4
    • /
    • pp.221-244
    • /
    • 2006
  • This research analyzes the current status of personal name authority control and its impact on the end user searching against OPACs in public libraries in Korea. Further it also suggests the ways to improve the recall ratio in author search with a minimal modification, system-wise and authority-wise. on the KOLISNET as a stepping stone for other public libraries. Finally, a long-term plan for establishing proper authority work in public libraries including the National Library of Korea is briefly proposed. In order to find out whether authority works are conducted and examine how variant written forms of the same foreign name and variant names of the same person are treated, OPACS of the National Library of Korea, KOLISNET, and ten randomly selected public libraries were searched. Findings indicate that while the National Library of Korea was performing authority control, even incomplete, the rest did not appear to conduct any form or authority control. As a spinoff of the research, it is observed that in many public libraries their bibliographic records and retrieval methods are inaccurate, lacking consistency, and incomplete. In sum. it is strongly recommended that (1) as a start for authority work among public libraries personal name authority control should be conducted to enhance the identifying and collocating functions in OPACs, (2) a shared authority database, for which the National Library of Korea's authority database might be used. should be built for Public libraries.

A Study on the Optimization of Semantic Relation of Author Keywords in Humanities, Social Sciences, and Art and Sport of the Korea Citation Index (KCI) (한국학술지인용색인(KCI)의 인문학, 사회과학, 예술체육 분야 저자키워드의 의미적 관계 유형 최적화 연구)

  • Ko, Young Man;Song, Min-Sun;Lee, Seung-Jun
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.49 no.1
    • /
    • pp.45-67
    • /
    • 2015
  • The purpose of this study is to analyse the semantic relations of terms in STNet, a structured terminology dictionary based on author keywords of humanities, social sciences, and art and sport in the Korea Citation Index (KCI) and to describe the procedure for optimizing the relation types and specifying the name of relationships. The results indicate that four logical criteria, such as creating new names for relationships or limitation of typing the relationship by the appearance frequency of same type, consideration of direction of relationship, reflection to accept the existing name of relationships, are required for the optimization of the typing and naming the relationships. We applied these criteria to the relationships in the class "real person" of STNet and the result shows that 1,135 out of 1,743 uncertain relationships such as RT, RT_X or RT_Y are specified and clarified. This rate of optimization with ca. 65% represents the usefulness of the criteria applicable to the cases of database construction and retrieval.

A Study on Developing Facets for Subject Headings in Korea (한국 주제명 표목의 패싯 유형 개발에 관한 연구)

  • Choi, Yoon Kyung;Chung, Yeon-Kyoung
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.49 no.4
    • /
    • pp.179-201
    • /
    • 2015
  • The subject heading is an elaborate access tool for subject browsing and searching in information retrieval environment. The purpose of this study is to suggest the applicable facets to subject headings in Korea. First, the concepts of subject and the definitions of facets were investigated in the literature review. Second, six cases including OCLC's FAST, PRECIS, "Thesaurus construction and use", CC $7^{th}$ edition, BC $2^{nd}$ Edition, and UDC $3^{rd}$ Edition were analyzed to focus on configuration of facets as case studies. Based on the results, twenty-two facets were proposed including Topical, Event, Geography, Chronology, Personal and Corporate Name, Title, Form, Genre, Language, and Person facets as 11 top facets. Also, Topical-Thing/Entity and Topical-Action/Status, Part, Kind, Property, Whole, Material, Patient, Product, By-Product and Agent facets as sub-facets of Topical facet.