DOI QR코드

DOI QR Code

Ontology-based User Customized Search Service Considering User Intention

온톨로지 기반의 사용자 의도를 고려한 맞춤형 검색 서비스

  • Kim, Sukyoung (Department of Computer Engineering, Hanbat National University) ;
  • Kim, Gunwoo (Dep. of Business and Accounting, Hanbat National University)
  • 김수경 (국립한밭대학교 컴퓨터공학과) ;
  • 김건우 (국립한밭대학교 경영회계학과)
  • Received : 2012.12.16
  • Accepted : 2012.12.20
  • Published : 2012.12.31

Abstract

Recently, the rapid progress of a number of standardized web technologies and the proliferation of web users in the world bring an explosive increase of producing and consuming information documents on the web. In addition, most companies have produced, shared, and managed a huge number of information documents that are needed to perform their businesses. They also have discretionally raked, stored and managed a number of web documents published on the web for their business. Along with this increase of information documents that should be managed in the companies, the need of a solution to locate information documents more accurately among a huge number of information sources have increased. In order to satisfy the need of accurate search, the market size of search engine solution market is becoming increasingly expended. The most important functionality among much functionality provided by search engine is to locate accurate information documents from a huge information sources. The major metric to evaluate the accuracy of search engine is relevance that consists of two measures, precision and recall. Precision is thought of as a measure of exactness, that is, what percentage of information considered as true answer are actually such, whereas recall is a measure of completeness, that is, what percentage of true answer are retrieved as such. These two measures can be used differently according to the applied domain. If we need to exhaustively search information such as patent documents and research papers, it is better to increase the recall. On the other hand, when the amount of information is small scale, it is better to increase precision. Most of existing web search engines typically uses a keyword search method that returns web documents including keywords which correspond to search words entered by a user. This method has a virtue of locating all web documents quickly, even though many search words are inputted. However, this method has a fundamental imitation of not considering search intention of a user, thereby retrieving irrelevant results as well as relevant ones. Thus, it takes additional time and effort to set relevant ones out from all results returned by a search engine. That is, keyword search method can increase recall, while it is difficult to locate web documents which a user actually want to find because it does not provide a means of understanding the intention of a user and reflecting it to a progress of searching information. Thus, this research suggests a new method of combining ontology-based search solution with core search functionalities provided by existing search engine solutions. The method enables a search engine to provide optimal search results by inferenceing the search intention of a user. To that end, we build an ontology which contains concepts and relationships among them in a specific domain. The ontology is used to inference synonyms of a set of search keywords inputted by a user, thereby making the search intention of the user reflected into the progress of searching information more actively compared to existing search engines. Based on the proposed method we implement a prototype search system and test the system in the patent domain where we experiment on searching relevant documents associated with a patent. The experiment shows that our system increases the both recall and precision in accuracy and augments the search productivity by using improved user interface that enables a user to interact with our search system effectively. In the future research, we will study a means of validating the better performance of our prototype system by comparing other search engine solution and will extend the applied domain into other domains for searching information such as portal.

웹 기술의 급속한 발전은 기업들이 관리해야 하는 정보량의 폭발적인 증가를 초래하였다. 이와 더불어 보다 정확한 정보를 찾기 위한 검색 엔진 솔루션 시장의 규모도 더불어 크게 증가하였다. 하지만 대부분의 검색엔진들은 사용자의 검색 의도를 고려하지 않고 사용자가 입력한 특정 키워드를 포함하는 문서들을 반환하는 방법을 채택하고 있어, 실제 사용자가 원하는 정보를 찾는데 까지는 부가적인 시간과 노력이 요구된다. 본 연구에서는 이러한 문제를 해결 하기 위한 중요 기술인 적합성을 만족시키기 위해 재현율과 정확율을 높일 수 있는 방법을 제안하였다. 우선 검색어의 재현율을 높일 수 있도록 유사어 관계 확장을 위한 온톨로지 스키마 모델을 제안하고 이를 기반으로 한 추론을 통해 검색어의 확장을 제시하였다. 확장된 검색어들을 이용하여 문서 검색을 하기 위한 다단계 유사도 검색 순위화 알고리즘을 제안하였다. 설계된 온톨로지 스키마와 온톨로지 저장소의 데이터를 기반으로 추론과 유사도 검색 순위화 엔진이 포함된 웹사이트 형식의 사용자 의도 적응형 검색 솔루션을 구현하였다. 구현된 검색 솔루션을 통해 다양한 검색어를 입력하여 제안 방법의 타당성을 입증하였고 사용자 의도를 고려한 맞춤형 검색 솔루션의 필요성을 설명하였다.

Keywords

Acknowledgement

Supported by : 한국연구재단

References

  1. Choi, B. G., S. J. Park, J. H. Lee, T. Park, and J. W. Song, "Design and Implementation of Web Directory Engine Using Dynamic Category Hierarchy", Journal of Korean Society for internet information, Vol.7, No.2(2005), 71-80.
  2. Cui, J., J. Liu, Y. Wu, and N. Gu, "An Ontology Modeling Method in Semantic Composition of Web Services", Proceedings of the IEEE International Conference on E-Commerce Technology for Dynamic E-Business, 2004.
  3. Cranefield, S., S. Haustein, and M. Purvis, The Information Science Discussion Paper Series, UML-Based Ontology Modeling for Software Agent, University of Otago, 2004. Available, at http://otago.ourarchive.ac.nz/bitstream/ handle/10523/1081/dp2001-07.pdf?...3 (Downloaded 17 September, 2011).
  4. Dean, M., "Semantic Web Rules : Covering the Use" Case, LNCS 3323(2004), 1-5.
  5. Djuric, D., D. Gasevic, and V. Devedzic, "Ontology Modeling and MDA", Journal of Object Technology, Vol.4, No.1(2005), 109-128. https://doi.org/10.5381/jot.2005.4.1.a3
  6. Gomez-Perez, "Evaluating Ontology Evaluation, in Why evaluate ontology technologies? Because it works", IEEE Intelligent Systems, Vol.19. No.4(2004), 74-76. https://doi.org/10.1109/MIS.2004.37
  7. Hong, G. C. and H. S. Chung, "An Implementation of Web-Based Korean Language Information Retrieval System", Electronics and Telecommunications Trends, Vol.14, No.6(2001), 9-21.
  8. Horrocks, I. and P. F. Patel-Schneider, H. Boley, S. Tabet, B. Grosof and M. Dean. SWRL : A Semantic Web Rule Language Combining OWL and RuleML, W3C. 2003. Available at http://www.w3.org/Submission/SWRL/ (Downloaded 12 June, 2012).
  9. Hwang, S. Y. and H. S. Yang, RDF/OWL Introduction Hong-Rung Publishing Company, Seoul: Korea, 2009.
  10. Jena, Apache Software Foundation, 2011. Available at http://jena.sourceforge.net/(Downloaded 8 June, 2012).
  11. Jeon, H. C. and J. M. Choi, "PIRS : Personalized Information Retrieval System using Adaptive User Profiling and Real-time Filtering for Search Results", Journal of Intelligence and Information Systems, Vol.16, No.4(2010), 21-41.
  12. Jin, B. S. and Y. G. Ji, "A Study about Search Engine Interface Design including User's Search Goal", The Journal of Society for e-Business Studies, Vol.13, No.4(2008), 111- 124.
  13. Kim, H. S., S. C. Park, and S. H. Kim, "Measurement of Document Similarity using Term/Termpair Features and Neural Network ", Journal of KIISE : Software and Application, Vol.31, No.12(2004), 1660-1671.
  14. Kim, S. K., "Using Description Logic and Rule Language for Web Ontology Modeling", Proceedings of Korea Intelligent Information System Society (KIISS) 2007 Spring Conference, 277-285.
  15. Kim, S. K. and K. H. Ahn "A Study of Dynamic Web Ontology for Comparison-shopping Agent based on Semantic Web", Journal of Intelligence and Information Systems, Vol.11, No.2(2005), 31-45.
  16. Klein, M. and U. Visser, "Semantic Web Challenge 2003", IEEE Intelligent Systems, Vol.19, No.3 (2004), 31-33. https://doi.org/10.1109/MIS.2004.11
  17. Korea Internet and Security Agency(KISA), Survey on the Internet Usage, (2011), 2-4. Available at http://www.itstat.go.kr/board/ boardDetailView.htm;jsessionid=6446942195 2064F65ED5334E3EAE320D?identifier=02-0 08-120222-000001&pub_code=6_5&page=1 (Downloaded 12 September, 2012).
  18. Management and Computer, Market Size of Domestic Search Engine, KyungCom, Seoul, 2008.
  19. National Information Society Agency (NIA), Development of Knowledge Representation and Inference Mechanism for Knowledge Exchange and Delivery on the Web, 2004, Available at http://www.nia.or.kr/bbs/board_ view.asp?BoardID=201111281321074458&id= 1217&Order=010200&search_target=&keywo rd=&Flag=010000/ (Downloaded 10 October, 2012).
  20. Park. J. S., N. W. Kim, M. J. Choi, C. Kim, and Y. S. Choi, "Semantic Search : A Survey", Journal of Intelligence and Information Systems, Vol.17, No.4(2011), 19-36.
  21. Pellet, Clark and Parsia, 2004. Available at http:// clarkparsia.com/pellet(Downloaded 8 June, 2012).
  22. Posada, J., C. Toro, S. Wundrak, and A. Stork, "Ontology Modelling of Industry Standards for Large Model Visualization and Desing Review using Protege", 8th Intl. Protege Conference, July, 2005.
  23. Prud'hommeaux, E. and A. Seaborne. Editors working draft. SPARQL Query Language for RDF, W3C, 2006. Available at http://www. w3.org/2001/sw/DataAccess/rq23/(Downloaded 7 July, 2012).