JOURNAL BROWSE
Search
Advanced SearchSearch Tips
Subtopic Mining of Two-level Hierarchy Based on Hierarchical Search Intentions and Web Resources
facebook(new window)  Pirnt(new window) E-mail(new window) Excel Download
 Title & Authors
Subtopic Mining of Two-level Hierarchy Based on Hierarchical Search Intentions and Web Resources
Kim, Se-Jong; Lee, Jong-Hyeok;
 
 Abstract
Subtopic mining is the extraction and ranking of possible subtopics, which disambiguate and specify the search intentions of an input query in terms of relevance, popularity, and diversity. This paper describes the limitations of previous studies on the utilization of web resources, and proposes a subtopic mining method with a two-level hierarchy based on hierarchical search intentions and web resources, in order to overcome these limitations. Considering the characteristics of resources provided by the official subtopic mining task, we extract various second-level subtopics reflecting hierarchical search intentions from web documents, and expand and re-rank them using other provided resources. Terms in subtopics with wider search intentions are used to generate first-level subtopics. Our method performed better than state-of-the-art methods in almost every aspect.
 Keywords
search intention;subtopic mining;popularity;diversity;hierarchical structure;
 Language
Korean
 Cited by
 References
1.
R. Song, M. Zhang, T. Sakai, M. P. Kato, Y. Liu, M. Sugimoto, Q. Wang, and N. Orii, "Overview of the ntcir-9 intent task," Proc. of NTCIR-9 Workshop Meeting, pp. 82-105, 2011.

2.
T. Sakai, Z. Dou, T. Yamamoto, Y. Liu, M. Zhang, and R. Song, "Overview of the ntcir-10 intent-2 task," Proc. of NTCIR-10 Workshop Meeting, pp. 94-123, 2013.

3.
Y. Liu, R. Song, M. Zhang, Z. Dou, T. Yamamoto, M. Kato, H. Ohshima, and K. Zhou, "Overview of the ntcir-11 imine task," Proc. of NTCIR-11 Workshop Meeting, pp. 8-23, 2014.

4.
T. Yamamoto, M. P. Kato, H. Ohshima, and K. Tanaka, "Kuidl at the ntcir-11 imine task," Proc. of NTCIR-11 Workshop Meeting, pp. 53-54, 2014.

5.
C. Luo, X. Li, A. Khodzhaev, F. Chen, K. Xu, Y. Cao, Y. Liu, M. Zhang, and S. Ma, "Thusam at ntcir-11 imine task," Proc. of NTCIR-11 Workshop Meeting, pp. 55-62, 2014.

6.
S. J. Kim and J. H. Lee, "Subtopic Mining Using Simple Patterns and Hierarchical Structure of Subtopic Candidates from Web Documents," Information Processing & Management, Vol. 51, issue 6, pp. 773-785, 2015. crossref(new window)

7.
Z. Dou, S. Hu, Y. Luo, R. Song, and J. R. Wen, "Finding dimensions for queries," Proc. of the 20th ACM International Conference on Information and Knowledge Management, pp. 1311-1320, 2011.

8.
D. M. Blei, A. Y. Ng, and M. I. Jordan, "Latent dirichlet allocation," Journal of Machine Learning Research, Vol. 3, pp. 993-1022, 2003.

9.
T. Sakai, "Ntcireval: A generic toolkit for information access evaluation," Proc. of the Forum on Information Technology 2011, Vol. 2, pp. 23-30, 2011.