Advanced SearchSearch Tips
ABox Realization Reasoning in Distributed In-Memory System
facebook(new window)  Pirnt(new window) E-mail(new window) Excel Download
  • Journal title : Journal of KIISE
  • Volume 42, Issue 7,  2015, pp.852-859
  • Publisher : Korean Institute of Information Scientists and Engineers
  • DOI : 10.5626/JOK.2015.42.7.852
 Title & Authors
ABox Realization Reasoning in Distributed In-Memory System
Lee, Wan-Gon; Park, Young-Tack;
As the amount of knowledge information significantly increases, a lot of progress has been made in the studies focusing on how to reason large scale ontology effectively at the level of RDFS or OWL. These reasoning methods are divided into TBox classifications and ABox realizations. A TBox classification mainly deals with integrity and dependencies in schema, whereas an ABox realization mainly handles a variety of issues in instances. Therefore, the ABox realization is very important in practical applications. In this paper, we propose a realization method for analyzing the constraint of the specified class, so that the reasoning system automatically infers the classes to which instances belong. Unlike conventional methods that take advantage of the object oriented language based distributed file system, we propose a large scale ontology reasoning method using spark, which is a functional programming-based in-memory system. To verify the effectiveness of the proposed method, we used instances created from the Wine ontology by W3C(120 to 600 million triples). The proposed system processed the largest 600 million triples and generated 951 million triples in 51 minutes (696 K triple / sec) in our largest experiment.
ontology;large scale reasoning;realization;big data;spark;
 Cited by
동적 분산병렬 하둡시스템 및 분산추론기에 응용한 서버가상화 빅데이터 플랫폼,송동호;신지애;인연진;이완곤;이강세;

Journal of the Korean Data and Information Science Society, 2015. vol.26. 5, pp.1129-1139 crossref(new window)
An elastic distributed parallel Hadoop system for bigdata platform and distributed inference engines, Journal of the Korean Data and Information Science Society, 2015, 26, 5, 1129  crossref(new windwow)
Weaver, Jesse, and James A. Hendler, "Parallel materialization of the finite rdfs closure for hundreds of millions of triples," The Semantic Web-ISWC 2009. Springer Berlin Heidelberg, 2009, 682-697.

Oren, Eyal, et al., "Marvin: Distributed reasoning over large-scale Semantic Web data," Web Semantics: Science, Services and Agents on the World Wide Web7.4 (2009): 305-316. crossref(new window)

Oren, Eyal, et al., "Marvin: A platform for largescale analysis of Semantic Web data," (2009).

Klyne, Graham, and Jeremy J. Carroll, "Resource description framework (RDF): Concepts and abstract syntax," 2006.

Brickley, Dan, and Ramanathan V. Guha, "{RDF vocabulary description language 1.0: RDF schema}," 2004.

Jena, Apache, "Apache jena," jena. apache. org [Online]. Available: [Accessed: Mar. 20, 2014] (2013).

Broekstra, Jeen, Arjohn Kampman, and Frank Van Harmelen, "Sesame: A generic architecture for storing and querying rdf and rdf schema," The Semantic Web-ISWC 2002. Springer Berlin Heidelberg, pp. 54-68, 2002.

Urbani, Jacopo, et al., "WebPIE: A Web-scale parallel inference engine using MapReduce," Web Semantics: Science, Services and Agents on the World Wide Web 10, pp. 59-75, 2012. crossref(new window)

Urbani, Jacopo, et al., "WebPIE: a web-scale parallel inference engine," Third IEEE International Scalable Computing Challenge (SCALE2010), Held in Conjunction with the 10th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2010), Melbourne, Australia. 2010.

Zaharia, Matei, et al., "Spark: cluster computing with working sets," Proc. of the 2nd USENIX conference on Hot topics in cloud computing, 2010.

Zaharia, Matei, et al., "Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing," Proc. of the 9th USENIX conference on Networked Systems Design and Implementation, USENIX Association, 2012.