DOI QR코드

DOI QR Code

Dynamic Decision Tree for Data Mining

데이터마이닝을 위한 동적 결정나무

  • Choi, Byong-Su (Department of Multimedia Engineering, Hansung University) ;
  • Cha, Woon-Ock (Department of Multimedia Engineering, Hansung University)
  • 최병수 (한성대학교 멀티미디어공학과) ;
  • 차운옥 (한성대학교 멀티미디어공학과)
  • Received : 20090900
  • Accepted : 20091000
  • Published : 2009.11.30

Abstract

Decision tree is a typical tool for data classification. This tool is implemented in DAVIS (Huh and Song, 2002). All the visualization tools and statistical clustering tools implemented in DAVIS can communicate with the decision tree. This paper presents methods to apply data visualization techniques to the decision tree using a real data set.

결정나무는 데이터마이닝에서 데이터를 분류하는 기법으로 가장 많이 사용되고 있으며, 데이터 탐색 소프트웨어 DAVIS에서는 동적 기능을 사용하여 데이터 시각화를 하는 것이 가능하다. 본 논문에서는 동적 데이터 분석의 기본 원리와 이를 결정나무에 적용하는 방법을 소개하고, 생성되는 동적 결정나무의 효율성과 유용성을 실제 데이터를 사용하여 분석한다.

Keywords

References

  1. Breiman, L., Friedman, J. H., Olshen, R. A. and Stone, C. J. (1984). Classification and Regression Trees, Monterey, CA: Wadsworth & Brooks/Cole Advanced Books & Software
  2. Cleveland, W. S. and McGill, M. E. (1988). Dynamic Graphics for Statistics, Wadsworth & Brooks/Cole, Belmont, CA
  3. Huh, M. Y. (2001). Strategy for visual clustering, The Korean Journal of Applied Statistics, 4, 177–190. (in Korean)
  4. Huh, M. Y. (1995). Exploring multidimensional data with FEDF, Journal of Computational and Graphical Statistics, 4, 335–343
  5. Huh, M. Y. (2009). http://stat.skku.ac.kr/myhuh.
  6. Huh, M. Y. and Song, K. Y. (2002). DAVIS: A Java-based data visualization system, Computational Statistics, 17, 411–423
  7. Kass, G. V. (1980) An exploratory technique for investigating large quantities of categorical data, Applied Statistics, 29, 119–127
  8. Nocholas, C. J. (1999). The emergence of data visualization and prospects for its business application, Masters of Information Systems Management Professional Seminar
  9. Quinlan, J. R. ( 1986). Induction of decision trees, Machine Learning, 1, 81–106 https://doi.org/10.1007/BF00116251
  10. Quinlan, J. R. (1993). C4.5: Programs for Machine Learning, Morgan Kaufmann Publishers
  11. Quinlan, J. R. (1996). Improved use of continuous attributes in c4.5., Journal of Artificial Intelligence Research, 4, 77–90
  12. Witten, I. H. and Frank, E. (2005). Data Mining: Practical Machine Learning Tools and Techniques, (Second Edition), Morgan Kaufmann