DOI QR코드

DOI QR Code

Comparative Analysis of NoSQL Database's Activities and Scalability Investigation With Library Introspection

  • Seo, Chang-Ho (School of Computer Science and Engineering, Kyungpook National University) ;
  • Tak, Byungchul (Dept. of Computer Science and Engineering, Kyungpook National University)
  • Received : 2020.08.03
  • Accepted : 2020.08.31
  • Published : 2020.09.29

Abstract

In this paper, we propose a method of in-depth analysis of internal operation process by recording library calls and related information that occur in the operation process of NoSQL database. It observes and records the specified library calls, compares the internal behavior differences between the NoSQL databases through recorded library call information, and evaluates the characteristics and scalability of each database by observing changes in the number of input data. The development of computing performance and the activation of big data have led to the emergence of different types of NoSQL databases for recording and analyzing various and large amounts of data, and it is necessary to evaluate the scalability of each database in order to select a database suitable for each environment. However, it is difficult to analyze or predict how a database operates in traditional ways, such as benchmarking, observing external behavior through performance models, or analyzing structural features based on design. Therefore, it is necessary to utilize the techniques proposed in this paper to understand the scalability of NoSQL databases with high accuracy.

이 논문에서는 NoSQL 데이터베이스의 동작 과정에서 발생하는 라이브러리 콜과 관련 정보들을 기록하여 내부 동작 과정을 심층적으로 분석하는 방법을 제안한다. 이를 통해 지정한 라이브러리 콜을 관찰 및 기록하며, 기록된 라이브러리 콜 정보를 통해 NoSQL 데이터베이스 간 내부 동작 차이를 비교하고, 입력 데이터 개수의 변화에 따라 발생하는 라이브러리 콜의 변화를 관찰하여 각 데이터베이스의 특징 및 확장성을 평가한다. 컴퓨팅 성능의 발전과 빅테이터의 활성화에 따라 다양하고 많은 양의 데이터를 기록 및 분석하기 위한 여러 종류의 NoSQL 데이터베이스가 등장하였으며, 각 환경에 적합한 데이터베이스를 선택하기 위해 각 데이터베이스의 확장성을 평가할 필요가 있다. 그러나 벤치마크, 성능 모델을 통한 외부 동작 관찰 또는 설계에 따른 구조적 특징 분석과 같은 기존의 방식으로는 데이터베이스가 동작하는 과정을 분석 또는 예측하기 어렵다. 따라서, 더욱 심층적인 분석을 통해 동작 과정 및 확장성을 파악하는 본 논문에서 제안하는 기법의 활용이 필요하다.

Keywords

References

  1. G. DeCandia, D. Hastorun, M. Jampani, G. Kakulapati, A. Lakshman, A. Pilchin, S. Sivasubramanian, P. Vosshall and W. Vogels, "Dynamo: amazon's highly available key-value store," ACM SIGOPS operating systems review, Vol. 41, No. 6, pp. 205-220. Oct, 2007. DOI: 10.1145/1294261.1294281
  2. F. Chang, J. Dean, S. Ghemawat, W.C. Hsieh, D.A. Wallach, M. Burrows, T. Chandra, A. Fikes, and R.E. Gruber, "Bigtable: A Distributed Storage System for Structured Data," ACM Transactions on Computer Systems, Vol. 26, No. 2, pp. 1-26. Jun. 2008. DOI: 10.1145/1365815.1365816
  3. A. Lakshman and P. Malik, "Cassandra: a decentralized structured storage system," ACM SIGOPS Operating Systems Review, Vol. 44, No. 2, pp. 35-40 Apr. 2010. DOI: 10.1145/1773912.1773922
  4. R. Cattell, "Scalable SQL and NoSQL data stores," Acm SIGMOD Record, Vol. 39, No. 4, pp. 12-27, May. 2011. DOI: 10.1145/1978915.1978919
  5. R. Hecht and S. Jablonski. "NoSQL evaluation: A use case oriented survey." 2011 International Conference on Cloud and Service Computing, pp. 336-341, Hong Kong, Dec. 2011, DOI: 10.1109/CSC.2011.6138544
  6. A. Corbellini, C. Mateos, A. Zunino, D. Godoy, and S. Schiaffino, "Persisting big-data: The NoSQL landscape," Information Systems, Vol. 63, pp. 1-23, Jan. 2017. DOI: 10.1016/j.is.2016.07.009
  7. E. Dede, M. Govindaraju, D. Gunter, R.S. Canon, and L. Ramakrishnan, "Performance evaluation of a mongodb and hadoop platform for scientific data analysis." Proceedings of the 4th ACM workshop on Scientific cloud computing, pp. 13-20, NY, USA, Jun. 2013. DOI: 10.1145/2465848.2465849
  8. F. Karniavoura, and K. Magoutis, "A Measurement-Based Approach to Performance Prediction in NoSQL Systems," Proceedings of the 25th IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems, MASCOTS, pp. 255-262, Banff, AB, Canada, Sept. 2017. DOI: 10.1109/MASCOTS.2017.31
  9. J. Kuhlenkamp, M. Klems and O. Ross, "Benchmarking scalability and elasticity of distributed database systems," Proceedings of the VLDB Endowment, Vol. 7, No. 12, pp. 1219-1230, Aug. 2014. DOI: 10.14778/2732977.2732995
  10. A. Kamsky, "Adapting TPC-C Benchmark to Measure Performance of Multi-Document Transactions in MongoDB," Proceedings of the VLDB Endowment, Vol. 12, No. 12, pp. 2254-2262, Aug. 2019. DOI: 10.14778/3352063.3352140
  11. V.A.E. Farias, F.R.C. Sousa, J.G.R. Maia, J.P.P. Gomes, and J.C. Machado, "Regression based performance modeling and provisioning for NoSQL cloud databases." Future Generation Computer Systems Vol. 79, No. 1 pp. 72-81, Feb. 2018. DOI: 10.1016/j.future.2017.08.061