JOURNAL BROWSE
Search
Advanced SearchSearch Tips
Topic Level Disambiguation for Weak Queries
facebook(new window)  Pirnt(new window) E-mail(new window) Excel Download
 Title & Authors
Topic Level Disambiguation for Weak Queries
Zhang, Hui; Yang, Kiduk; Jacob, Elin;
  PDF(new window)
 Abstract
Despite limited success, today`s information retrieval (IR) systems are not intelligent or reliable. IR systems return poor search results when users formulate their information needs into incomplete or ambiguous queries (i.e., weak queries). Therefore, one of the main challenges in modern IR research is to provide consistent results across all queries by improving the performance on weak queries. However, existing IR approaches such as query expansion are not overly effective because they make little effort to analyze and exploit the meanings of the queries. Furthermore, word sense disambiguation approaches, which rely on textual context, are ineffective against weak queries that are typically short. Motivated by the demand for a robust IR system that can consistently provide highly accurate results, the proposed study implemented a novel topic detection that leveraged both the language model and structural knowledge of Wikipedia and systematically evaluated the effect of query disambiguation and topic-based retrieval approaches on TREC collections. The results not only confirm the effectiveness of the proposed topic detection and topic-based retrieval approaches but also demonstrate that query disambiguation does not improve IR as expected.
 Keywords
Topic Detection;Query Disambiguation;Language Model;Information Retrieval;Natural Language Processing;
 Language
English
 Cited by
 References
1.
Baillie, M., Azzopardi, L., & Crestani, F. (2006). Adaptive query-based sampling of distributed collections. Lecture Notes in Computer Science, 4209, 316.

2.
Bendersky, M., Croft, W. B., & Smith, D. A. (2009). Twostage query segmentation for information retrieval. Paper presented at the Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, Boston, MA.

3.
Buckley, C., & Harman, D. (2004). Reliable information access final workshop report. ARDA Northeast Regional Research Center Technical Report.

4.
Buckley, C., Salton, G., Allan, J., & Singhal, A. (1995). Automatic query expansion using SMART: TREC 3. Overview of the Third Text REtrieval Conference (TREC-3), 500-225.

5.
Gale, W. A., Church, K. W., & Yarowsky, D. (1992). A method for disambiguating word senses in a large corpus. Computers and the Humanities, 26(5), 415-439. crossref(new window)

6.
Guo, J., Xu, G., Li, H., & Cheng, X. (2008). A unified and discriminative model for query refinement. Paper presented at the Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval.

7.
Harman, D. (1992). Relevance feedback revisited. Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval, 1-10.

8.
Jing, Y., & Croft, W. B. (1994). An association thesaurus for information retrieval. Proceedings of RIAO, 94(1994), 146-160.

9.
Klyuev, V., & Haralambous, Y. (2011). Query expansion: Term selection using the ewc semantic relatedness measure. Paper presented at the Computer Science and Information Systems (FedCSIS), 2011 Federated Conference on.

10.
Metzler, D., & Croft, W. B. (2004). Combining the language model and inference network approaches to retrieval. Information processing & management, 40(5), 735-750. crossref(new window)

11.
Mihalcea, R. (2003). Turning WordNet into an information retrieval resource: Systematic polysemy and conversion to hierarchical codes. International Journal of Pattern Recognition and Artificial Intelligence, 17(05), 689-704. doi: doi:10.1142/S0218001403002605 crossref(new window)

12.
Mihalcea, R., & Csomai, A. (2007). Wikify!: Linking documents to encyclopedic knowledge. Paper pre-sented at the Proceedings of the sixteenth ACM conference on Information and knowledge management, Lisbon, Portugal.

13.
Milne, D., & Witten, I. H. (2008). Learning to link with Wikipedia. Paper presented at the Proceedings of the 17th ACM conference on Information and knowledge management, Napa Valley, CA.

14.
Navigli, R. (2009). Word sense disambiguation: A survey. ACM Computing Surveys (CSUR), 41(2), 10.

15.
Prakash, R. S. S., Jurafsky, D., & Ng, A. Y. (2007). Learning to merge word senses. Computer Science Department, Stanford University.

16.
Qiu, Y., & Frei, H. P. (1993). Concept based query expansion. Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval, 160-169.

17.
Ruthven, I. A. N., & Lalmas, M. (2003). A survey on the use of relevance feedback for information access systems. The Knowledge Engineering Review, 18(02), 95-145. crossref(new window)

18.
Salton, G., & Buckley, C. (1990). Improving retrieval performance by relevance feedback. Journal of the American Society for Information Science, 41(4), 288-297. crossref(new window)

19.
Sanderson, M. (1994). Word sense disambiguation and information retrieval. New York, NY: Springer-Verlag.

20.
Sanderson, M. (2000). Retrieving with good sense. Information Retrieval, 2(1), 49-69. crossref(new window)

21.
Selvaretnam, B., & Belkhatir, M. (2012). Natural language technology and query expansion: Issues, state-of-the-art and perspectives. Journal of Intelligent Information Systems, 38(3), 709-740. doi: 10.1007/s10844-011-0174-3 crossref(new window)

22.
Spink, A., Wolfram, D., Jansen, M. B. J., & Saracevic, T. (2001). Searching the web: The public and their queries. Journal of the American Society for Information Science and Technology, 52(3), 226-234. crossref(new window)

23.
Voorhees, E. M. (1993). Using WordNet to disambiguate word senses for text retrieval. Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval, 171-180.

24.
Voorhees, E. M. (1994). Query expansion using lexicalsemantic relations. New York, NY: Springer-Verlag.

25.
Zhai, C., & Lafferty, J. (2004). A study of smoothing methods for language models applied to information retrieval. ACM Transactions on Information Systems, 22(2), 179-214. crossref(new window)

26.
Zhai, C. X. (2008). Statistical language models for information retrieval: A critical review. Foundations and Trends in Information Retrieval, 2(3), 137-213.