Advanced SearchSearch Tips
A Review of Science of Databases and Analysis of Its Case Studies
facebook(new window)  Pirnt(new window) E-mail(new window) Excel Download
  • Journal title : Journal of KIISE
  • Volume 43, Issue 2,  2016, pp.237-245
  • Publisher : Korean Institute of Information Scientists and Engineers
  • DOI : 10.5626/JOK.2016.43.2.237
 Title & Authors
A Review of Science of Databases and Analysis of Its Case Studies
Suh, Young-Kyoon; Kim, Jong Wook;
In this paper we introduce a novel database research area called science of databases (SoDB) and carry out a comprehensive analysis of its case studies. SoDB aims to better understand interesting phenomena observed across multiple database management systems (DBMSes). While mathematical and engineering work in the database field has been dominant, less attention has been given to scientific approaches through which DBMSes can be better understood. Scientific investigations can lead to better engineered designs through deeper understanding of query optimizers and transaction processing. The SoDB research has investigated several interesting phenomena observed across different DBMSes and provided several engineering implications based on our uncovered results. In this paper we introduce a novel scientific, empirical methodology and describe the research infrastructure to enable the methodology. We then review each of a selected group of phenomena studied and present an identified structural causal model associated with each phenomenon. We also conduct a comprehensive analysis on the case studies. Finally, we suggest future directions to expand the SoDB research.
science of databases (SoDB);DBMSes;phenomena;empirical studies;structural causal model;engineering implications;
 Cited by
T. Horikawa, "An Approach for Scalability-Bottleneck Solution: Identification and Elimination of Scalability Bottlenecks in a DBMS," SIGSOFT SEN 36, pp. 31-42, 2011.

S. Chaudhuri and U. Dayal, "An Overview of Data Warehousing and OLAP Technology," SIGMOD Record 26, pp. 65-74, 1997. crossref(new window)

S. A. Schuster, "Relational Data Base Management for On-Line Transaction Processing," Technical Report 81.5, Tandem Computers Incorporated, 1981.

R. Snodgrass and llsoo Ahn, "Temporal Databases," IEEE Computer, Vol. 19, No. 9, pp. 35-42, 1986.

R. Kallman, et. al., "H-Store: a High-Performance, Distributed Main Memory Transaction Processing System," PVLDB, Vol. 1, No. 2, pp. 1496-1499, 2008.

M. Stonebraker, et. al., "C-store: a Column-oriented DBMS," VLDB, pp. 553-564, Aug. 2005.

R. Snodgrass, [Online]. Available:, viewed on October 26, 2015.

R. Snodgrass, [Online]. Available:, viewed on October 26, 2015.

R. Snodgrass and P. Denning, "The Science of Computer Science: Closing Statement: The Science of Computer Science (Ubiquity Symposium)," Ubiquity, 2014(6):1-11, Jun. 2014.

C. Morrison and R. Snodgrass, "Computer Science Can Use More Science," Communications of the ACM, Vol. 54, No. 7, pp. 36-39, Jun. 2011.

P. Cohen, Empirical Methods for Artificial Intelligence, MIT Press, 1995.

L. Peterson, et. al., "Experiences Building Planet-Lab," OSDI, pp. 351-366, 2006.

G. Werner-Allen, et. al., "MoteLab: A Wireless Sensor Network Testbed," IPSN, pp. 483-488, 2005.

G. Larkou, et. al., "Managing Smartphone Testbeds with SmartLab," LISA, pp. 115-132, 2013.

E. Cuervo, et. al., "Crowdlab: An Architecture for Volunteer Mobile Testbeds," COMSNETS, pp. 1-10, 2011.

R. Naveen and J. R. Haritsa, "Analyzing Plan Diagrams of Database Query Optimizers," VLDB, pp. 1228-1239, 2005.

Y-K. Suh, et. al., "AZDBLab: A Laboratory Information System for Large-scale Empirical DBMS Studies," PVLDB, Vol. 7, No. 13, pp. 1641-1644, 2014.

S. Currim, et. al., "DBMS Metrology: Measuring Query Time," SIGMOD, pp. 421-432, Jun. 2013.

Y.-K. Suh (2015). Exploring Causal Factors of DBMS Thrashing (Doctoral dissertation). Dept. of Computer Science, Univ. of Arizona, Tucson, AZ.

TPC-H, [Online]. Available:, viewed on Dec. 2015.

TPC-C, [Online]. Available:, viewed on Dec. 2015.

A. F. Hayes, Introduction to Mediation, Moderation, and Conditional Process Analysis: A Regression-Based Approach, Guilford, 2013.