DOI QR코드

DOI QR Code

Concept-based Question Analysis for Accurate Answer Extraction

정확한 해답 추출을 위한 개념 기반의 질의 분석

  • Published : 2007.01.28

Abstract

This paper describes a concept-based question analysis to analyze concept which is more important than keyword for the accurate answer extraction. Our idea is that we can extract correct answers from various paragraphs with different structures when we use well-defined concepts because concepts occurred in questions of same answer type are similar. That is, we will analyze the syntactic and semantic role of each word or phrase in a question in order to extract more relevant documents and more accurate answer in them. For each answer type, we define a concept frame which is composed of concepts commonly occurred in that type of questions and analyze user's question by filling a concept frame with a word or phrase. Empirical results show that our concept-based question analysis can extract more accurate answer than any other conventional approach. Also, concept-based approach has additional merits that it is language universal model, and can be combined with arbitrary conventional approaches.

Keywords

Question Answering System;Question Analysis;Concept;Answer Extraction

References

  1. D. Moldovan, S. Harabargiu, M Pasca, R. Mihalcea,R. Goodrun,R. Girju,and V. Rus, ''LASSO: A Tool for Surfing the Answer Net," In the 8th Text REtrieval conference (TREC-8), 1999.
  2. J. Prager, D. Radev, E. Brown, and A Coden,"The Use of Predictive Annotation for Question Answering in TREC8," In the 8th Text REtrieval Conference (TREC-8), 1999.
  3. J. Kupiec, "MURAX A Robust Linguistic Approach For Question Answering Using An On-Line Encyclopedia," In Proceedings 16'th ACM SIGIR International conference on Research and Development in Information Retrieval, pp.181-190, 1993.
  4. A Ittycheriah, M Franz, W. J. Zhu, and A. Ratnaparkhi, "Question Answering Using Maximum Entropy Components," In Proceeding of NAACL, 2001.
  5. G. S. Marrn, "A Statistical Method for Short Answer Extraction," In Proceedings of the ACL Workshop Open-Domain Question Answering, pp.13-30, 2001.
  6. 김학수,안영훈,서정연,"한국어 질의응답시스템을 위한 지지벡터기계 기반의 질의유형분류기" 정보과학회 논문지,제30권,제5호,pp.466-475, 2003.
  7. E. Voorhees, "Query Expansion using Lexical Semantic Relation," In Proceedings of the 17th ACM-SIGIR Conference, pp.61-69, 1994.
  8. B. V. Dobrow, N. V. Loukachevitch,and T. N. Yudina, "Conceptual Indexing Using thematic Representation of Texts" TREC-6, 1997.
  9. 강승식, "한글 문서의 색언어와 색인 기법" 정보과학회지,제 22권, 제4호, pp.72-71, 2004.
  10. E. M. Voorhees and H. T. Dang, "Overview of the TREC 2005 Question Answering Track," TREC 2005, 2005.
  11. 장명길,김현진,장문수,최재훈,오효정,이충희,허정,"의미기반 정보검색" 정보과학회지,제 19권, 제 10호,pp.7-18, 2001.
  12. E. M. Voorhees and H T. Dang,"Overview of the TREC 2005 Question Answering Track," In Proceedings of the TREC 2005, 2005.
  13. 황이규,김현진,장명길,"질의응답 기술 개발" 정보처리학회지,제 11권,제2호,pp.48-56, 2004.