DOI QR코드

DOI QR Code

Improving the Performance of Web Search using Query Types

질의유형에 기반한 웹 검색의 성능 향상

  • 강인호 (삼성종합기술원 Computing LAB) ;
  • 안동언 (전북대학교 전자정보공학부)
  • Published : 2004.08.01

Abstract

The Web is rich with various sources of information. Due to the massive and heterogeneous web document collections, users want to find various types of target pages. Each type of information for Web search has designated queries. If a user query is not a designated query, then we cannot have good result documents. Different strategies are needed to utilize the goodness of each type of information for a search engine. If we know the property of information, then we can refine candidate pages and rank them delicately. Various experiments are conducted to show the properties of each type of information. Therefore, we show an appropriate combining formula to utilize the properties of each type of information. In addition, for a service finding task, we propose Service Link Information that utilizes the existence of mechanisms for a user interaction.

인터넷의 발달로 인해 웹에서 얻을 수 있는 정보의 종류와 수는 급진적으로 증가하고 있다. 이에 따라 사용자가 요구하는 정보는 문서뿐만 아니라 사이트 그리고 서비스 단위로 확장되고 있다. 기존의 연구에서 웹 검색을 위해 사용되었던 정보들과 이들의 일률적인 결합형태는 다양한 사용자의 요구를 만족시키기 어렵다. 보다 좋은 결과를 얻기 위해서는 검색에 사용하는 정보의 특성을 분석하고, 질의에 따른 알맞은 정보의 사용이 필요하다. 본 연구에서는 사용자 질의유형에 따른 정보들의 유용성을 살펴보고 적절한 사용법을 분석한다. 그리고 차츰 대두되고 있는 서비스 검색을 위한 서비스 링크정보를 제안한다.

Keywords

References

  1. Croft, W. B., 'Combining Approaches to Information Retrieval : Recent Research from the Center for Intelligent Information Retrieval,' Kluwer Academic Publishers, pp. 1-36, 2000
  2. Brin, S. and Page, L., 'The Anatomy of a Large-scale Hypertextual Web Search Engine,' Computer Networks and ISDN Systems, Vol.30, No. 1-7, pp.107-117, 1998 https://doi.org/10.1016/S0169-7552(98)00110-X
  3. Craswell, N., Hawking, D., Griffiths, K., 'Which Search Engine is best at Finding Airline Site Home Pages?,' (Tech. Rep.), CSIRO Mathematical and Information Sciences, 2001
  4. Craswell, N., Hawking, D. and Robertson, S., 'Effective Site Finding using Link Anchor Information,' In Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, New Orleans, LA, pp.250-257, 2001 https://doi.org/10.1145/383952.383999
  5. Yang, K., 'Combining Text and Link-Based Retrieval Methods for Web IR,' In Text REtrieval Conference (TREC-10), Gaithersburg, Maryland, pp.609-618, 2001
  6. Broder, A., 'A Taxonomy of Web Search,' SIGIR Forum, Vol.36, No.2, 2002 https://doi.org/10.1145/792550.792552
  7. Ogilvie, P. and Callan, J., 'Combining Document Representations for Known-Item Search,' In Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Toronto, Canada, pp.143-150, 2003 https://doi.org/10.1145/860435.860463
  8. Baeza-Yates, R. and Ribeiro-Neto, B., 'Modern Information Retrieval,' Essex England : Addison-Wesley Pub Co, 1999
  9. Salton, G. and McGill, M. J., 'Introduction to Modern Information Retrieval,' New York : McGraw-Hill, 1983
  10. Robertson, S. E., Walker, S., Jones, S., Hancock-Beaulieu, M and Gatford, M., 'Okapi at TREC-3,' In Text REtrieval Conference (TREC-3), Gaithersburg, Maryland, pp.109-126, 1994
  11. Zhai, C. and Lafferty, J., 'A Study of Smoothing Methods for Language Models Applied to ad hoc Information Retrieval,' In Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, New Orleans, LA, pp. 334-342, 2001 https://doi.org/10.1145/383952.384019
  12. Amento, B.,Tervenn, L. and Hill, W., 'Does authority mean quality? Predicting expert quality ratings of Web documents,' In Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Athens, Greece, 2000 https://doi.org/10.1145/345508.345603
  13. Page, L.,Brin, S., Motwani, R. and Winograd, T., 'The PageRank Citation Ranking: Brining Order to the Web,' (Tech. Rep.), Stanford Digital Library Technologies Project, 1998
  14. Westerveld, T., Kraaij, W. and Hiemstra, D., 'Retrieving Web pages using content, links, urls and anchors,' In Text REtrieval Conference (TREC-10), Gaithersburg, Maryland, pp.663-672, 2001
  15. Bailey, P., Craswell, N., Hawking, D., 'Engineering a Multi-Purpose test Collection for Web Retrieval Experiments, Vol.39, No.6, pp.853-871, 2003 https://doi.org/10.1016/S0306-4573(02)00084-5
  16. Ogilvie, P. and Callan, J., 'Experiments using the Lemur Toolkit,' In Text REtrieval Conference (TREC-10), http://www.-2.cs.cmu.edu/~lemur, Gaithersburg, Maryland, pp.103-108, 2001
  17. Harman, D., 'Relevance Feedback and Other Query Modification Techniques,' In W. B. Frakes & R. Baeza-Yates(Eds.), Information Retrieval Data Structures & Algorithms, Englewood Cliffs, New Jersey: Prentice Hall, pp. 241-263, 1992
  18. Information Processing and Management v.39 no.6 Engineering a Multi-Purpose test Collection for Web Retrieval Experiments Bailey, P.;Craswell, N.;Hawking, D. https://doi.org/10.1016/S0306-4573(02)00084-5
  19. Text REtrieval Conference (TREC-10) Experiments using the Lemur Toolkit Ogilvie, P.;Callan, J.
  20. Information Retrieval Data Structures & Algorithms Relevance Feedback and Other Query Modification Techniques Harman, D.;W. B. Frakes(ed.);R. Baeza-Yates(ed.)