DOI QR코드

DOI QR Code

Constructing the Semantic Information Model using A Collective Intelligence Approach

  • Lyu, Ki-Gon (Dept. of Computer Science Education, Korea University) ;
  • Lee, Jung-Yong (Dept. of Computer Science Education, Korea University) ;
  • Sun, Dong-Eon (Dept. of Computer Science Education, Korea University) ;
  • Kwon, Dai-Young (Dept. of Computer Science Education, Korea University) ;
  • Kim, Hyeon-Cheol (Dept. of Computer Science Education, Korea University)
  • Received : 2011.04.08
  • Accepted : 2011.09.26
  • Published : 2011.10.31

Abstract

Knowledge is often represented as a set of rules or a semantic network in intelligent systems. Recently, ontology has been widely used to represent semantic knowledge, because it organizes thesaurus and hierarchal information between concepts in a particular domain. However, it is not easy to collect semantic relationships among concepts. Much time and expense are incurred in ontology construction. Collective intelligence can be a good alternative approach to solve these problems. In this paper, we propose a collective intelligence approach of Games With A Purpose (GWAP) to collect various semantic resources, such as words and word-senses. We detail how to construct the semantic information model or ontology from the collected semantic resources, constructing a system named FunWords. FunWords is a Korean lexical-based semantic resource collection tool. Experiments demonstrated the resources were grouped as common nouns, abstract nouns, adjective and neologism. Finally, we analyzed their characteristics, acquiring the semantic relationships noted above. Common nouns, with structural semantic relationships, such as hypernym and hyponym, are highlighted. Abstract nouns, with descriptive and characteristic semantic relationships, such as synonym and antonym are underlined. Adjectives, with such semantic relationships, as description and status, illustration - for example, color and sound - are expressed more. Last, neologism, with the semantic relationships, such as description and characteristics, are emphasized. Weighting the semantic relationships with these characteristics can help reduce time and cost, because it need not consider unnecessary or slightly related factors. This can improve the expressive power, such as readability, concentrating on the weighted characteristics. Our proposal to collect semantic resources from the collective intelligence approach of GWAP (our FunWords) and to weight their semantic relationship can help construct the semantic information model or ontology would be a more effective and expressive alternative.

Keywords

References

  1. R. Davis, H. Shrobe, P. Szolovits, "What is a Knowledge Representation," AI Magazine, vol. 14, no. 1, pp. 17-33, 1993.
  2. G.A. Miller, "WordNet: A Lexical Database for English," Communications of the ACM, vol. 38, no. 11, pp. 39-41, 1995.
  3. N. Guarino, "Formal Ontology, Conceptual Analysis and Knowledge Representation," International Journal of Human Computer Studies, 1995.
  4. P. Spyns, A. de Moor, J. Vandenbussche, R. Meersman, "From Folksologies to Ontologies: How the Twain Meet," Lecture Notes in Computer Science, vol. 4275, pp. 735-755, 2006.
  5. L. von Ahn, M. Kedia, M. Blum, "Verbosity: a game for collecting common-sense facts," in Proc. of the SIGCHI conference on Human Factors in computing systems, pp. 75-78, 2006.
  6. L. von Ahn, "Games with a purpose," Computer, vol. 39, no. 6, pp. 92-94, 1999.
  7. E.L.M. Law, L. von Ahn, R.B. Dannenberg, M. Crawford. "Tagatune: A game for music and sound annotation," in Proc. of the International Conference on Music Information Retrieval, pp. 361-364, 2003.
  8. D.B. Lenat, "CYC: A Large-Scale Investment in Knowledge Infrastructure," ACM Communications, vol. 38, no. 11, pp. 32-28, 1995. https://doi.org/10.1145/226239.226247
  9. G.A. Miller, "WordNet: a lexical database for English," ACM Bibliometrics, vol. 38, no. 11, pp. 39-41, 1995.
  10. G.A. Miller, R. Beckwith, C. Fellbaum, D. Gross4, K.J. Miller, "Introduction to WordNet: An On-line Lexical Database," International Journal of Lexicography, vol. 24, no. 2, pp. 235-244, 1990.
  11. E. Navarro, F. Sajous, B. Gaume, L. Prévot, H. ShuKai, K. Tzu-Yi, P. Magistry, H. Chu-Ren, "Wiktionary and NLP: improving synonymy networks," in Proc. of the 2009 Workshop on The People's Web Meets NLP: Collaboratively Constructed Semantic Resources, pp. 19-27, 2009.
  12. T. Zesch, C. Müller, I. Gurevych, "Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary," in Proc. of the Conference on Language, pp. 1646-1652, 2008.
  13. L. von Ahn, L. Dabbish, "Designing games with a purpose," ACM Communications, vol. 51, no. 8, pp. 58-67, 2008.
  14. L. von Ahn, L. Dabbish, "Labeling images with a computer game," in Proc. of the SIGCHI Conference on Human factors in computing systems, pp. 319-326, 2004.
  15. B. Hladká, J. Mírovský, P. Schlesinger, "Play the Language: Play Coreference," in Proc. of the ACL-IJCNLP Conference, pp. 209-212, 2009.
  16. C.A. Bean, R. Green, "Relationships in the organization of knowledge," Kluwer Academic Publichers, 2001.
  17. R. Green, C.A. Bean, S.H. Myaeng, "The semantics of relationships: an interdisciplinary perspective," Kluwer Academic Publichers, 2001.
  18. B. Rosario, M. Hearst, "Classifying the Semantic Relations in Noun Compounds via a Domain-Specific Lexical Hierarchy," in Proc. of the Conference on Empirical Methods in Natural Language Processing, pp.82-90, 2001.