Semantic-Based Web Information Filtering Using WordNet

어휘사전 워드넷을 활용한 의미기반 웹 정보필터링

  • 변영태 (홍익대학교 전자계산학과) ;
  • 황상규 (홍익대학교 대학원 전자계산학과) ;
  • 오경묵 (숙명여자대학교 정보과학부)
  • Published : 1999.11.01

Abstract

Information filtering for internet search, in which new information retrieval environment is given, is different from traditional methods such as bibliography information filtering, news-group and E-mail filtering. Therefore, we cannot expect high performance from the traditional information filtering models when they are applied to the new environment. To solve this problem, we inspect the characteristics of the new filtering environment, and propose a semantic-based filtering model which includes a new filtering method using WordNet. For extracting keywords from documents, this model uses the SDCC(Semantic Distance for Common Category) algorithm instead of the TF/IDF method usually used by traditional methods. The world sense ambiguation problem, which is one of causes dropping efficiency of internet search, is solved by this method. The semantic-based filtering model can filter web pages selectively with considering a user level and we show in this paper that it is more convenient for users to search information in internet by the proposed method than by traditional filtering methods.

Keywords