DOI QR코드

DOI QR Code

Development of a CUBRID-Based Distributed Parallel Query Processing System

  • Kim, Hyeong-Il (The 1st Missile Systems PMO, Agency for Defense Development) ;
  • Yang, HyeonSik (Dept. of Information and Technology, Chonbuk National University) ;
  • Yoon, Min (The 1st R&D Institute - 4th Directorate, Agency for Defense Development) ;
  • Chang, Jae-Woo (Dept. of Information and Technology, Chonbuk National University)
  • Received : 2015.01.29
  • Accepted : 2016.11.30
  • Published : 2017.06.30

Abstract

Due to the rapid growth of the amount of data, research on bigdata processing has been highlighted. For bigdata processing, CUBRID Shard is able to support query processing in parallel way by dividing the database into a number of CUBRID servers. However, CUBRID Shard can answer a user's query only when the query is required to gain accesses to a single CUBRID server, instead of multiple ones. To solve the problem, in this paper we propose a CUBRID based distributed parallel query processing system that can answer a user's query in parallel and distributed manner. Finally, through the performance evaluation, we show that our proposed system provides 2-3 times better performance on query processing time than the existing CUBRID Shard.

Keywords

References

  1. D. H. Lee, "Personalizing information using users' online social networks: a case study of CiteULike," Journal of Information Processing Systems, vol. 11, no. 1, pp. 1-21, 2015. https://doi.org/10.3745/JIPS.04.0014
  2. J. Lv, J. Guo, and H. Ren, "Efficient greedy algorithms for influence maximization in social networks," Journal of Information Processing Systems, vol. 10, no. 3, pp. 471-482, 2014. https://doi.org/10.3745/JIPS.04.0003
  3. D. Jiang, G. Chen, B. C. Ooi, K. L. Tan, and S. Wu, "epiC: an extensible and scalable system for processing big data," Proceedings of the VLDB Endowment, vol. 7, no. 7, pp. 541-552, 2014. https://doi.org/10.14778/2732286.2732291
  4. J. Dean and S. Ghemawat, "MapReduce: simplified data processing on large clusters," Communications of the ACM, vol. 51, no. 1, pp. 107-113, 2008. https://doi.org/10.1145/1327452.1327492
  5. H. C. Yang, A. Dasdan, R. L. Hsiao, and D. S. Parker, "Map-reduce-merge: simplified relational data processing on large clusters," in Proceedings of the ACM SIGMOD International Conference on Management of Data, Beijing, China, 2007, pp. 1029-1040.
  6. T. Rabl, S. Gomez-Villamor, M. Sadoghi, V. Muntes-Mulero, H. A. Jacobsen, and S. Mankovskii, "Solving big data challenges for enterprise application performance management," Proceedings of the VLDB Endowment, vol. 5, no. 12, pp. 1724-1735, 2012. https://doi.org/10.14778/2367502.2367512
  7. Apache Software Foundation, "Apache Hadoop," 2014 [Online]. Available: http://hadoop.apache.org/.
  8. K. Chodorow, MongoDB: The Definitive Guide, 2nd ed. Sebastopol, CA: O'Reilly Media Inc., 2013.
  9. A. Dietrich, S. Mohammad, S. Zug, and J. Kaiser, "ROS meets Cassandra: data management in smart environments with NoSQL," in Proceedings of the 11th International Baltic Conference on DB and IS, Tallinn, Estonia, 2014.
  10. CUBRID Shard [Online]. Available: http://www.cubrid.com/manual/91/shard.html.
  11. M. Stonebraker, "SQL databases v. NoSQL databases," Communications of the ACM, vol. 53, no. 4, pp. 10-11, 2010. https://doi.org/10.1145/1721654.1721659
  12. R. Cattell, "Scalable SQL and NoSQL data stores," ACM SIGMOD, vol. 39, no. 4, pp. 12-27, 2011. https://doi.org/10.1145/1978915.1978919
  13. J. Han, E. Haihong, and G. Le, "Survey on NoSQL database," in Proceedings of 2011 6th international conference on Pervasive computing and applications (ICPCA), Port Elizabeth, South Africa, 2011, pp. 363-366.
  14. CUBRID [Online]. Available: http://www.cubrid.com/.
  15. V. Saravanan, K. D. Pralhaddas, D. P. Kothari, and I. Woungang, "An optimizing pipeline stall reduction algorithm for power and performance on multi-core CPUs," Human-centric Computing and Information Sciences, vol. 5, no. 1, article no. 2, 2015.
  16. Y. Li, D. Kim, and B. S. Shin, "Geohashed spatial index method for a location-aware WBAN data monitoring system based on NoSQL," Journal of Information Processing Systems, vol. 12, no. 2, pp. 263-274, 2016. https://doi.org/10.3745/JIPS.04.0025
  17. M. Lee, Y. S. Park, M. H. Kim, and J. W. Lee, "A convergence data model for medical information related to acute myocardial infarction," Human-centric Computing and Information Sciences, vol. 6, no. 1, article no. 15, 2016.
  18. H. I. Kim, M. Yoon, M. Choi, and J. W. Chang, "A new middleware for distributed data processing in CUBRID DBMS," Procedia Computer Science, vol. 52, pp.654-658, 2015. https://doi.org/10.1016/j.procs.2015.05.066
  19. A. Thusoo, J. S. Sarma, N. Jain, Z. Shao, P. Chakka, S. Anthony, H. Liu, P. Wyckoff, and R. Murthy, "Hive: a warehousing solution over a map-reduce framework," Proceedings of the VLDB Endowment, vol. 2, no. 2, pp. 1626-1629, 2009. https://doi.org/10.14778/1687553.1687609
  20. D. J. DeWitt, "The Wisconsin benchmark: past, present, and future," University of Wisconsin, 1993.