- Volume 21 Issue 4
Knowledge map is widely used to represent knowledge in many domains. This paper presents a method of integrating the national R&D data and assists of users to navigate the integrated data via using a knowledge map service. The knowledge map service is built by using a lightweight ontology and a topic modeling method. The national R&D data is integrated with the research project as its center, i.e., the other R&D data such as research papers, patents, and reports are connected with the research project as its outputs. The lightweight ontology is used to represent the simple relationships between the integrated data such as project-outputs relationships, document-author relationships, and document-topic relationships. Knowledge map enables us to infer further relationships such as co-author and co-topic relationships. To extract the relationships between the integrated data, a Relational Data-to-Triples transformer is implemented. Also, a topic modeling approach is introduced to extract the document-topic relationships. A triple store is used to manage and process the ontology data while preserving the network characteristics of knowledge map service. Knowledge map can be divided into two types: one is a knowledge map used in the area of knowledge management to store, manage and process the organizations' data as knowledge, the other is a knowledge map for analyzing and representing knowledge extracted from the science & technology documents. This research focuses on the latter one. In this research, a knowledge map service is introduced for integrating the national R&D data obtained from National Digital Science Library (NDSL) and National Science & Technology Information Service (NTIS), which are two major repository and service of national R&D data servicing in Korea. A lightweight ontology is used to design and build a knowledge map. Using the lightweight ontology enables us to represent and process knowledge as a simple network and it fits in with the knowledge navigation and visualization characteristics of the knowledge map. The lightweight ontology is used to represent the entities and their relationships in the knowledge maps, and an ontology repository is created to store and process the ontology. In the ontologies, researchers are implicitly connected by the national R&D data as the author relationships and the performer relationships. A knowledge map for displaying researchers' network is created, and the researchers' network is created by the co-authoring relationships of the national R&D documents and the co-participation relationships of the national R&D projects. To sum up, a knowledge map-service system based on topic modeling and ontology is introduced for processing knowledge about the national R&D data such as research projects, papers, patent, project reports, and Global Trends Briefing (GTB) data. The system has goals 1) to integrate the national R&D data obtained from NDSL and NTIS, 2) to provide a semantic & topic based information search on the integrated data, and 3) to provide a knowledge map services based on the semantic analysis and knowledge processing. The S&T information such as research papers, research reports, patents and GTB are daily updated from NDSL, and the R&D projects information including their participants and output information are updated from the NTIS. The S&T information and the national R&D information are obtained and integrated to the integrated database. Knowledge base is constructed by transforming the relational data into triples referencing R&D ontology. In addition, a topic modeling method is employed to extract the relationships between the S&T documents and topic keyword/s representing the documents. The topic modeling approach enables us to extract the relationships and topic keyword/s based on the semantics, not based on the simple keyword/s. Lastly, we show an experiment on the construction of the integrated knowledge base using the lightweight ontology and topic modeling, and the knowledge map services created based on the knowledge base are also introduced.
Ontology;Topic Modeling;Knowledgebase;Knowledge Map;Information Integration
- Ahmad, M. N. and R. M. Colomb, "Managing ontologies: a comparative study of ontology servers," Proceedings of the eighteenth conference on Australasian database, Vol.63 (2007), 13-22.
- Blei, D. M., A. Y. Ng and M. I. Jordan, "Latent dirichlet allocation," The Journal of machine Learning research, Vol.3(2003), 993-1022.
- Brickley, D. and R. V. Guha, RDF Schema 1.1, W3C, 2014, Available at http://www.w3.org/TR/rdf-schema/ (Downloaded 14 December, 2015).
- Businska, L., I. Supulniece and M. Kirikova, "On data, information, and knowledge representation in business process models," Information Systems Development, Springer New York, 2013, 613-627.
- Eppler, M. J., "Making knowledge visible through intranet knowledge maps: concepts, elements, cases," Proceedings of the 34th Annual Hawaii International Conference on System Sciences, (2001), 9-18.
- Hofmann, T., "Probabilistic latent semantic indexing," Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, (1999), 50-57.
- Howard, R. A., "Knowledge maps," Management science, Vol.35, No.8(1989), 903-922. https://doi.org/10.1287/mnsc.35.8.903
- Kang, I., Y. Park, and Y. Kim, "A framework for designing a workflow-based knowledge map," Business process management journal, Vol.9, No.3(2003), 281-294. https://doi.org/10.1108/14637150310477894
- Klavans, R. and K. W. Boyack, "Toward a consensus map of science," Journal of the American Society for information science and technology, Vol.60, No.3(2009), 455-476. https://doi.org/10.1002/asi.20991
- Leydesdorff, L. and I. Rafols, "A global map of science based on the ISI subject categories," Journal of the American Society for Information Science and Technology, Vol.60, No.2(2009), 348-362. https://doi.org/10.1002/asi.20967
- McCagg, E. C. and D. F. Dansereau, "A convergent paradigm for examining knowledge mapping as a learning strategy," The Journal of Educational Research, Vol.84, No.6(1991), 317-324. https://doi.org/10.1080/00220671.1991.9941812
- Morbach, J., A. Wiesner, and W. Marquardt, "OntoCAPE-A (re) usable ontology for computer-aided process engineering," Computers & Chemical Engineering, Vol.33, No.10 (2009), 1546-1556. https://doi.org/10.1016/j.compchemeng.2009.01.019
- W3C RDF Working Group, Resource Description Framework (RDF) 1.1, W3C, 2014, Available at http://www.w3.org/RDF/(Downloaded14 December,2015).
- Prud'hommeaux, E. and A. Seaborne, SPARQL Query Language for RDF, W3C, 2008, Available at http://www.w3.org/TR/rdfsparql-query/ (Downloaded 14 December, 2015).
- Rao, L., G. Mansingh and K. M. Osei-Bryson, "Building ontology based knowledge maps to assist business process re-engineering," Decision Support Systems, Vol.52, No.3(2012), 577-589. https://doi.org/10.1016/j.dss.2011.10.014
- Salton, G. and M. J. Mcgill, Introduction to modern information retrieval, McGraw-Hill, New York, 1986.
- W3C OWL Working Group, OWL2 Web Ontology Language (Second Edition), W3C, 2012, Available at http://www.w3.org/TR/2012/RECowl2-overview-20121211/ (Downloaded 14 December, 2015).
- Blei, D. M., "Probabilistic topic models," Communications of the ACM, Vol.55, No.4(2012), 77-84. https://doi.org/10.1145/2133806.2133826