JOURNAL BROWSE
Search
Advanced SearchSearch Tips
Scalable Ontology Reasoning Using GPU Cluster Approach
facebook(new window)  Pirnt(new window) E-mail(new window) Excel Download
  • Journal title : Journal of KIISE
  • Volume 43, Issue 1,  2016, pp.61-70
  • Publisher : Korean Institute of Information Scientists and Engineers
  • DOI : 10.5626/JOK.2016.43.1.61
 Title & Authors
Scalable Ontology Reasoning Using GPU Cluster Approach
Hong, JinYung; Jeon, MyungJoong; Park, YoungTack;
 
 Abstract
In recent years, there has been a need for techniques for large-scale ontology inference in order to infer new knowledge from existing knowledge at a high speed, and for a diversity of semantic services. With the recent advances in distributed computing, developments of ontology inference engines have mostly been studied based on Hadoop or Spark frameworks on large clusters. Parallel programming techniques using GPGPU, which utilizes many cores when compared with CPU, is also used for ontology inference. In this paper, by combining the advantages of both techniques, we propose a new method for reasoning large RDFS ontology data using a Spark in-memory framework and inferencing distributed data at a high speed using GPGPU. Using GPGPU, ontology reasoning over high-capacity data can be performed as a low cost with higher efficiency over conventional inference methods. In addition, we show that GPGPU can reduce the data workload on each node through the Spark cluster. In order to evaluate our approach, we used LUBM ranging from 10 to 120. Our experimental results showed that our proposed reasoning engine performs 7 times faster than a conventional approach which uses a Spark in-memory inference engine.
 Keywords
scalable RDFS reasoning;GPGPU;spark;LUBM;
 Language
Korean
 Cited by
1.
GPGPU를 활용한 스파크 기반 공간 연산,손찬승;김대희;박능수;

정보처리학회논문지:컴퓨터 및 통신 시스템, 2016. vol.5. 8, pp.181-188 crossref(new window)
1.
Spatial Computation on Spark Using GPGPU, KIPS Transactions on Computer and Communication Systems, 2016, 5, 8, 181  crossref(new windwow)
 References
1.
Jeffrey Dean, Sanjay Ghemawat, "MapReduce: Simplified Data Processing on Large Clusters," Communications of the ACM, 2008.

2.
Matei Zaharia, Mosharaf Chowdhury, Michael J. Franklin, Scott Shenker, Ion Shoica, "Spark: Cluster Computing with Working Sets," HotCloud 2010, Jun. 2010.

3.
Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave, Justin Ma, Murphy McCauley, Michael J. Franklin, Scott Shenker, Ion Stoica, "Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing," NSDI 2012, Apr. 2012.

4.
Norman Heino, Jeff Z. Pan, "RDFS Reasoning on Massively Parallel Hardware," The Semantic Web-ISWC 2012, Vol. 7649, pp 133-148, 2012.

5.
Martin Peters, Christopher Brink, Sabine Sachweh, Albert Zundorf, "Rule-based Reasoning on Massively Parallel Hardware," SSWS 2013, Vol. 1046, pp. 33-48, 2013.

6.
Oren Segal, Philip Colangelo, Nashbeh Nasiri, Zhuo Qian, Martin Margala, "SparkCL: A Unified Programming Framework for Accelerators on Heterogeneous Clusters," 2015.

7.
Jacopo Urbani, et al., "WebPIE: A Web-scaleparallel inference engine using MapReduce," WebSemantics: Science, Services and Agents on theWorld Wide Web 10, pp. 59-75, 2012. crossref(new window)

8.
Thusoo, Ashish, et al., "Hive: a warehousing solution over a map-reduce framework," Proc. of the VLDB Endowment 2.2, pp. 1626-1629, 2009. crossref(new window)

9.
Wan-Gon Lee, Je-Min Kim, Young-Tack Park, "Distributed Table Join for Scalable RDFS Reasoning on Cloud Computing Environment," Journal of KIISE, Vol. 41, No. 9, pp. 674-685, Sep. 2014. crossref(new window)

10.
Kornacker, Marcel, and Justin Erickson, "Cloudera Impala: real-time queries in Apache Hadoop, for real," 2012-10 [2013-02]. [Online]. Available: http://blog.cloudera.com/blog/2012/10/cloudera-impalareal-time-queries-in-apache-hadoop-for-real, 2012.

11.
Jagvaral Batselem, Young-Tack Park, "Distributed scalable RDFS reasoning," Big Data and Smart Computing(BigComp), pp. 31-34, 2015.

12.
John E. Stone, David Gohara, Guochun Shi, "OpenCL: A parallel programming standard for heterogeneous computing systems," Computing in science and engineering, 2010.

13.
Yonghong Yan, Max Grossman, Vivek Sarkar, "JCUDA: A Programmer-Friendly Interface for Accelerating Java Programs with CUDA," Euro-Par 2009 Parallel Processing Lecture Notes in Computer Science, Vol. 5704, pp. 887-899, 2009.

14.
Shrinivas Joshi, "Leveraging Aparapi to Help Improve Financial Java Application Performance," AMD, 2012.

15.
Stephen Warshall, "A Theorem on Boolean Matrices," Journal of the ACM(JACM), 1962.

16.
Klyne, Graham, and Jeremy J. Carroll, "Resource Description Framework (RDF): Concepts and abstract syntax," 2006.

17.
Patric Hayes, Brain McBride, "RDF Semantics. Technical report, W3C Recommendation," 2004.