Rule Acquisition Using Ontology Based on Graph Search

그래프 탐색을 이용한 웹으로부터의 온톨로지 기반 규칙습득

  • Park, Sangun (The u-City Research Institute, Yonsei University) ;
  • Lee, Jae Kyu (Management Engineering, KAIST GraduateSchool of Management) ;
  • Kang, Juyoung (Department of E-Business, School of Business, Ajou University)
  • 박상언 (연세대학교 u-City 융합서비스 연구개발단) ;
  • 이재규 (한국과학기술원 테크노경영대학원) ;
  • 강주영 (아주대학교 e비즈니스학부)
  • Published : 2006.09.30

Abstract

To enhance the rule-based reasoning capability of Semantic Web, the XRML (eXtensible Rule Markup Language) approach embraces the meta-information necessary for the extraction of explicit rules from Web pages and its maintenance. To effectuate the automatic identification of rules from unstructured texts, this research develops a framework of using rule ontology. The ontology can be acquired from a similar site first, and then can be used for multiple sites in the same domain. The procedure of ontology-based rule identification is regarded as a graph search problem with incomplete nodes, and an A* algorithm is devised to solve the problem. The procedure is demonstrated with the domain of shipping rates and return policy comparison portal, which needs rule based reasoning capability to answer the customer's inquiries. An example ontology is created from Amazon.com, and is applied to the many online retailers in the same domain. The experimental result shows a high performance of this approach.

지능형 에이전트와 규칙기반 시스템을 이용해 보다 지능적인 웹 환경을 구축하고자 하는 노력이 시맨틱 웹의 발전과 함께 증가하고 있다. 이러한 에이전트와 규칙기반 시스템에 필요한 규칙들을 이미 많은 지식들이 산재해 있는 웹으로부터 습득할 수 있다면 보다 효율적으로 시스템을 구축하는 것이 가능하며, 이러한 응용시스템의 확장은 시맨틱 웹의 발전을 더욱 가속화하는 계기가 될 수 있을 것이다. XRML 방법론은 웹으로부터 규칙을 습득하기 위한 단계적 방법을 제시하고 있으며, 온톨로지를 이용함으로써 규칙의 구성요소들을 자동으로 추출할 수 있도록 지원한다. 그러나 추출된 규칙구성요소들을 조합하여 완전한 규칙을 만드는 과정이 규칙관리자의 수작업에 의존하고 있다. 본 연구는 온톨로지와 그래프 탐색을 사용함으로써 이 과정을 자동화하고자 하는 연구이다. 온톨로지에 있는 규칙의 일반적 패턴을 기반으로 하여 그래프 탐색을 이용해 규칙구성요소들을 조합함으로써 웹 페이지로부터 자동으로 규칙을 추출할 수 있다.

Keywords

References

  1. Alani, H., S. Kim, D. E. Millard, M. J. Weal, W. Hall, P. H. Lewis, and N. R. Shadbolt, "Automatic Ontology-Based Knowledge Extraction from Web Documents", IEEE Intelligent Systems, Vol.18, No.1(2003), 14-21.
  2. Beck J. C. and M. Fox, "A Generic Framework for Constraint Directed Search and Scheduling", AI Magazine, Vol.19, No.4 (1998), 101-130.
  3. Berners-Lee, T., J. Hendler, and O. Lassila, The Semantic Web, Scientific American, 2001.
  4. Brickley, D. and R. V. Guha, RDF Vocabulary Description Language 1.0 : RDF Schema, W3C Recommendation, , 2004.
  5. Chae S., W. Kim, and S. Park, Extracting Rule Components from the Web using Ontology, Working Paper, 2006.
  6. Crow, L. and N. Shadbolt, "Extracting Focused Knowledge from the Semantic Web, International", Journal of Human-Computer Studies, Vol.54(2001), 155-184. https://doi.org/10.1006/ijhc.2000.0453
  7. Decker, S., M. Erdmann, D. Fensel, and R. Studer, Ontobroker : Ontology based Access to Distributed and Semi-Structured Information, In R. Meersman et al. (eds.), Database Semantics, Semantic Issues in Multimedia Systems, Kluwer Academic Publisher, Boston, 1999.
  8. Donini, F. M., M. Lenzerini, D. Nardi, and A. Schaerf, "Reasoning in Description Logics, Principles of Knowledge Representative," Ed : G. Brweka,. CSLI Publications, Stanford, (1996), 191-236.
  9. Grosof, B. N., I. Horrocks, R. Volz, and S. Decker, Description Logic Programs : Combining Logic Programs with Description Logic, In Proceedings of 12th International Conference on the World Wide Web (WWW-2003), Budapest, Hungary, 2003.
  10. Hemnani, A. and S. Bressan, "Extracting Information from Semi-Structured Web Documents", Lecture Notes in Computer Science, Vol.2426(2002), 166-175.
  11. Horrocks, I., P. F. Patel-Schneider, H. Boley, S. Tabet, B. Grosof, and M. Dean, SWRL, A Semantic Web Rule Language Combining OWL and RuleML, W3C Member Submission, , 2004.
  12. Jones, K. S. and P. Willet (eds.), Readings in Information Retrieval, Morgan Kaufmann Publishers, San Francisco 1997.
  13. Kang, J. and J. K. Lee, "Rule Identification from Web Pages by the XRML Approach", Decision Support Systems, Vol.41, No.1, (2005), 205-227. https://doi.org/10.1016/j.dss.2005.01.004
  14. Lee, J. K. and M. Sohn, "Extensible Rule Markup Language - Toward Intelligent Web Platform", Communications of the ACM, Vol.46(2003), 59-64. https://doi.org/10.1145/769800.769802
  15. Manola, F. and E. Miller, Resource Description Framework (RDF) Primer, W3C Recommendation, , 2004.
  16. Miller, G. A., "WordNet a Lexical Database for English", Communications of the ACM, Vol.38, No.11(1995), 39-41.
  17. Park, S., J. K. Lee., and J. Y. Kang, "The Effect of Knowledge Acquisition through OntoRule : XRML Approach", Journal of Korea Intelligent Information Systems Society, Vol.11, No.2, 151-173.
  18. Pearl. J., Heuristics : Intelligent Search Strategies for Computer Problem Solving, Addison-Wesley, Reading, MA, 1984.
  19. Reynolds, D., Jena 2 Inference Support, 2005.
  20. RuleML, The Rule Markup Initiative, 2003.
  21. Schmidt, G. and T. Wetter, "Using Natural Language Sources in Model-Based Knowledge Acquisition", Data & Knowledge Engineering, Vol.26(1998), 327-356. https://doi.org/10.1016/S0169-023X(97)00038-4
  22. Smith, M. K., C. Welty, and D. McGuinness, OWL Web Ontology Language Guide, W3C Recommendation, 2004.
  23. Volz, R., D. Oberle, S. Staab, and B. Motik, KAON SERVER - A Semantic Web Management System, In Alternate Track Proceedings of the Twelfth International World Wide Web Conference, WWW2003, Budapest, Hungary, 20-24 May 2003. ACM, 2003.
  24. Yang, J., H. Oh, K. G. Doh, and J. Choi, "A Knowledge-Based Information Extraction System for Semi-structured Labeled Documents", Proceedings of the 4th Intelligent Data Engineering and Automated Learning, Lecture Notes in Computer Science, Vol.2412(2002), 105-110.
  25. Zou, Y., T. Finin, and H. Chen, F-OWL, "An Inference Engine for Semantic Web", Proc. 3rd NASA/IEEE Workshop on Formal Approaches to Agent Based Systems, FAABS III, 16-18 April 2004, Greenbelt MD, Lecture Notes in Computer Science, Vol.3228, Springer Verlag, 2004.