DOI QR코드

DOI QR Code

A Experimental Study on the Usefulness of Structure Hints in the Leaf Node Language Model-Based XML Document Retrieval

단말노드 언어모델 기반의 XML문서검색에서 구조 제한의 유용성에 관한 실험적 연구

  • Published : 2007.03.30

Abstract

XML documents format on the Web provides a mechanism to impose their content and logical structure information. Therefore, an XML processor provides access to their content and structure. The purpose of this study is to investigate the usefulness of structural hints in the leaf node language model-based XML document retrieval. In order to this purpose, this experiment tested the performances of the leaf node language model-based XML retrieval system to compare the queries for a topic containing only content-only constraints and both content constrains and structure constraints. A newly designed and implemented leaf node language model-based XML retrieval system was used. And we participated in the ad-hoc track of INEX 2005 and conducted an experiment using a large-scale XML test collection provided by INEX 2005.

XML웹 문서 포맷은 문헌 내에 내용과 의미있는 논리적인 구조 정보를 포함할 수 있어, 검색에서 문서의 내용뿐만 아니라 구조로 접근하는 것을 제공한다. 그래서 본 연구의 목적은XML검색에 있어 내용 검색에 추가적인 요소로 사용된 구조적인 제한이 얼마나 유용한지를 실험하기 위해 내용만으로 검색한 결과와 내용과 구조적인 제한을 가지고 검색한 결과간의 성능을 비교하였다. 이 실험은 자체 개발된 단말노드 언어모델기반의 XML 검색시스템을 사용하였고 INEX 2005의 ad-hoc track에 참여하여 모든 실험방법과 INEX 2005의 실험 문헌 집단을 사용하였다.

Keywords

References

  1. 김희섭. 2004. Retrieval Performance of XML document Using Object-Relational Database. 정보관리학회지, 22(2): 189-210. https://doi.org/10.3743/KOSIM.2004.21.2.189
  2. 박종관. 2001. XML 문서의 효율적인 구조검색을 위한 색인모텔. 한국정보처리학회논문지, 8(D): 451-460.
  3. 정영미, 김희섭. 2005. 정보검색에서의 언어모델 적용에 관한 분석. 한국도서관. 정보학회지 36(2): 49-68.
  4. 정영미 외. 2005. XQuery 기반 XML 검색시스템의 구조적인 질의 검색 성능평가. 제12회 한국정보관리학회 학술대회 논문집, 295-304
  5. Bos, B. 1997. XML representation INEX of a relational database [cited 2006. 11. 23] .
  6. INitiative for the Evaluation of XML Retrieval 2004 homepage.[cited 2006. 10. 28]
  7. Miller, D., T. Leek, and R. Schwartz. 1999. "A Hidden Markov Model Information Retrieval System." In Proceedings of the 22nd Annual International ACM SIGIR Conference, 214-221. https://doi.org/10.1145/312624.312680
  8. Ogilvie, Paul and J. Callan. 2003. "Language Models and Structured Document Retrieval." In Proceedings of the First Workshop of the INitiative for the Evaluation of XML Retrieval, 12-18
  9. Ponte, Jay M. and W. Bruce Croft. 1998. "A Language Modeling Approach to Information Retrieval." In Proceedings of the 21nd Annual International ACM SIGIR Conference, 275-281. https://doi.org/10.1145/290941.291008
  10. Robert, W. P. Luk et al. 2002. "A Survey in Indexing and Searching XML Document." Journal of The American Society for Information Science and Technology, 53(6): 415-437. https://doi.org/10.1002/asi.10056
  11. Sigurbjornsson, Borkur, Jaap Kamps, and Maarten de Rijke. 2004. "An Element-Based Approach to Retrieval." In Proceedings of the 2003 Workshop of the INitiative for the Evaluation of XML Retrieval, 19-26
  12. Turau, V. 1999. Making legacy data accessible for XML application. [cited 2006. 9. 1]
  13. Wiegand, Nancy. 2002. "lnvestigating XQuery for Querying Across Database Object Types." SIGMOD Record, 31(2): 28-33. https://doi.org/10.1145/565117.565122
  14. Zaragiza, H., D. Hiemstra, and M. Tipping. 2003. "Bayesian Extension to the Language Model for Ad Hoc Information Retrieval." In:Proceedings of the 26nd Annual International ACM SIGIR Conference,4-9 https://doi.org/10.1145/860435.860439
  15. Zhai, C. and J. Lafferty. 2001. 'Document Language Models, Query Models, and Risk Minimization for Information Retrieval.' In Proceedings of the 24nd Annual International ACM SIGIR Conference, 111-119 https://doi.org/10.1145/383952.383970
  16. Zhai, Chengxiang and John Lafferty. 2004. "A Study of Smoothing Methods for Language Models Applied to Information Retrieval." ACM Transactions on Information Systems, 22(2): 179-214. https://doi.org/10.1145/984321.984322