DOI QR코드

DOI QR Code

Automated Conceptual Data Modeling Using Association Rule Mining

연관규칙 마이닝을 활용한 개념적 데이터베이스 설계 자동화 기법

  • 손윤호 (국민대학교 비즈니스IT전문대학원) ;
  • 김인규 (국민대학교 비즈니스IT학부) ;
  • 김남규 (국민대학교 비즈니스IT학부)
  • Received : 2009.11.20
  • Accepted : 2009.12.05
  • Published : 2009.12.01

Abstract

Data modeling can be regarded as a series of processes to abstract real-world business concerns. The conceptual modeling phase is often regarded as the most difficult stage in the entire modeling process, because quite different conceptual models may be produced even for similar business domains based on users' varying requirements and the data modelers' diverse perceptions of the requirements. This implies that an object considered as an entity in one domain may be considered as an attribute in another, and vice versa. However, many traditional knowledge-based automated database design systems unfortunately fail to construct appropriate Entity-Relationship Diagrams(ERDs) for a given set of requirements due to the rigid assumption that an object should be classified as an entity if it has been classified as an entity in previous applications. In this paper, we propose an alternative automation system which can generate ERDs from business descriptions using association rule mining technique. Our system can be differentiated from the traditional ones in that our system can perform data modeling only based on business description written by domain workers, rather than relying on any kind of knowledge base. Since the proposed system can produce various versions of ERDs from the same business descriptions simultaneously, users can have the opportunity to choose one of the ERDs as being the most appropriate, based on their business environment and requirements. We performed a case study for personnel management in a university to evaluate the practicability of the proposed system This paper summarizes the result of it in the experiment section.

Keywords

Acknowledgement

Supported by : 국민대학교

References

  1. 김병곤, 박순창, "데이터 웨어하우징의 성공적 구현에 영향을 미치는 요인," 정보시스템연구, 제17권 제4호, 2008, pp. 51-77. https://doi.org/10.5859/KAIS.2008.17.4.051
  2. 정대율, "객체지향적 모델관리시스템을 위한 기능요구분석," 정보시스템연구, 제4권 제1호, 1995, pp. 129-153.
  3. 정윤수, 이춘열, 김남규, "토픽맵의 다중역할 토픽 보존을 위한 관계형 데이터베이스 구조," 정보시스템연구 제18권 제3호, 2009, pp. 327-349.
  4. Agrawal, R., Imielinske, T., and Swami, A., "Mining Association Rules between Sets of Items in Large Databases," In Proceedings of ACM SIGMOD International Conference on Management of Data, Washington D. C., 1993, pp. 207-216.
  5. Bouzeghoub, M., Fabret, F., Llirbat, F., Matulovic, M., and Simon, E., "ACTIVE-DESIGN: A Generic Toolkit for Deriving Specific Rule Execution Models," In Proceedings of the 3rd International Workshop on Rules in Database Systems, Skovde, Sweden, 1987, pp. 197-211.
  6. Dogac, A., Yuruten, B., and Spaccapietra, S., "A Generalized Expert System for Database Design," IEEE Transactions on Software Engineering, Vol. 15. 1989, pp. 479-491. https://doi.org/10.1109/32.16607
  7. Geng, L. and Hamilton, H. J., "Interestingness Measures for Data Mining: A Survey," ACM Computing Surveys, Vol. 38, No. 3, 2006, pp. 1-32. https://doi.org/10.1145/1132952.1132953
  8. Kilic, E., Ozhan, G., Dengi, C., Kesim, N., Koksal, P., and Dogac, A., "Experiences in Using CORBA for a Multidatabase Implementation," In Proceedings of the 6th International Workshop on Database and Expert System Applications, London, UK, 1995, pp. 223-230.
  9. Lazarov, A. and Shoval, P., "A Rule-based System for Automatic Assignment of Technicians to Service Faults," Decision Support Systems, Vol. 32, No. 4, 2002, pp. 343-360. https://doi.org/10.1016/S0167-9236(01)00122-1
  10. Lenca, P., Vaillant, B., Meyer, P., and Lallich, S., "Association Rule Interestingness Measures: Experimental and Theoretical Studies," Quality Measures in Data Mining, Springer, 2007, pp. 51-76.
  11. Lo, W. A. and Choobineh, J., "Knowledge-based Systems as Database Design Tools: A Comparative Study," Journal of Database Management, Vol. 10, No. 3, 1999, pp. 26-40. https://doi.org/10.4018/jdm.1999070103
  12. Mills, K. L. and Gomaa, H., "Knowledge-based Automation of a Design Method for Concurrent Systems," IEEE Transactions on Software Engineering, Vol. 28, No. 3, 2002, pp. 228-255. https://doi.org/10.1109/32.991319
  13. Noah, S. A. and Williams, M., "Exploring and Validating the Contributions of Real-world Knowledge to the Diagnostic Performance of Automated Database Design Tools," In Proceedings of the 5th IEEE International Conference on Automated Software Engineering, 2000, pp. 177-185.
  14. Storey, V. C., View Creation: An Expert System for Database Design, International Center for Information Technologies, Washington, D. C., 1988.
  15. Storey, V. C. and Goldstein, R. C., "Knowledge- based Approaches to Database Design," MIS Quarterly, Vol. 17, 1993, pp. 25-46. https://doi.org/10.2307/249508
  16. Storey, V. C., Goldstein, R. C., and Ullrich, H., "Naive Semantics to Support Automated Database Design," IEEE Transactions on Knowledge and Data Engineering, Vol. 14, 2002, pp. 1-12. https://doi.org/10.1109/69.979969
  17. Teorey, T. J. and Fry, J. P., Design of Database Structures, Prentice-Hall, New Jersey, 1982.
  18. Teorey, T. J., Yang, D., and Fry, J. P., "A Logical Design Methodology for Relational Databases Using the Extended Entity- Relationship Model," ACM computing Surveys, Vol. 18, 1986, pp. 197-222. https://doi.org/10.1145/7474.7475
  19. Vaillant, B., Lenca, P., and Lallich, S., "A Clustering of Interestingness Measure," In Proceedings of 7th International Conference on Discovery Science, Padova, Italy, 2004, pp. 290-297.
  20. Wand, Y., Storey, V. C., and Weber, R., "An Ontological Analysis of the Relationship Construct in Conceptual Modeling," ACM Transactions on Database Systems, Vol. 24, 1999, pp. 494-528. https://doi.org/10.1145/331983.331989
  21. Yang, H. and Chu, W. C., "Acquisition of Entity Relationship Models for Maintenance- Dealing with Data Intensive Programs in a Transformation System," Journal of Information Science and Engineering, Vol. 15, 1999, pp. 173-198.

Cited by

  1. A Study on Plagiarism Detection and Document Classification Using Association Analysis vol.23, pp.3, 2014, https://doi.org/10.5859/KAIS.2014.23.3.127
  2. Using Ontologies for Semantic Text Mining vol.21, pp.3, 2012, https://doi.org/10.5859/KAIS.2012.21.3.137
  3. Methodology for Issue-related R&D Keywords Packaging Using Text Mining vol.16, pp.2, 2015, https://doi.org/10.7472/jksii.2015.16.2.57
  4. A Methodology for Analyzing Public Opinion about Science and Technology Issues Using Text Analysis vol.14, pp.3, 2015, https://doi.org/10.9716/KITS.2015.14.3.033