Association Rule Mining Algorithm and Analysis of Missing Values

  • Lee, Jae-Wan (School of Electronic and Information Engineering, Kunsan National University) ;
  • Bobby D. Gerardo (School of Electronic and Information Engineering, Kunsan National Universit) ;
  • Kim, Gui-Tae (School of Electronic and Information Engineering, Kunsan National Universit) ;
  • Jeong, Jin-Seob (School of Electronic and Information Engineering, Kunsan National University)
  • Published : 2003.09.01

Abstract

This paper explored the use of an algorithm for the data mining and method in handling missing data which had generated enhanced association patterns observed using the data illustrated here. The evaluations showed that more association patterns are generated in the second analysis which suggests more meaningful rules than in the first situation. It showed that the model offer more precise and important association rules that is more valuable when applied for business decision making. With the discovery of accurate association rules or business patterns, strategies could be efficiently planned out and implemented to improve marketing schemes. This investigation gives rise to a number of interesting issues that could be explored further like the effect of outliers and missing data for detecting fraud and devious database entries.

Keywords

Association Rule Algorithm;Data Mining;istributed Systems;Missing Data

References

  1. Agrawal and Srikant. Fast Algorithms for Mining Association Rules. Proceeding of International Conference on Very Large Databases VLDB, 1994, 487-499
  2. Text Mining. http://www.cs.waikato.ac.nz/~nzdl/textmining
  3. Multi-Dimensional Constrained Gradient Mining. ftp://fas.sfu.ca/pub/cs/theses/2001/JoyceManWingLamMSc.pdf
  4. Handling missing or incomplete data. http://www.utexas.edu/cc/faqs/stat/general/gen25.html
  5. Han J. and Kamber M. Data mining concepts and techniques. USA: Morgan Kaufmann (2001)
  6. Hellerstein, J.L., Ma, S. and Pemg, C. S. Discovering actionable patterns in event data. IBM Systems Journal, Vol. 41, No.3, 2002
  7. Nayak, Jyothsna R. and Cook, Diane J. Approximate Association Rule Mining. Proceedings of the Florida Artificial Intelligence Research Symposium, 2001
  8. Pairwise Deletion of Missing Data vs. Mean Substitution. http://www.statsoftinc.com/textbook/glosp.html
  9. Knowledge Discovery in Databases. http://www.cs.ualberta.ca/~joerg/courses/cmput690/slides/Overview-s4.pdf
  10. Coenen, F. The Apriori Algorithm. http://www.csc.liv.ac.uk/~frans/KDD/aprioriTdemo.html#algorithm (2001)
  11. Nestorov, Svetlozar and Jukic, Nenad. Ad-Hoc Association-Rule Mining within the Data Warehouse. Proceedings of 36th Annual Hawaii International Conference on System Sciences, page 232a, January 2003
  12. Edelstein, Herb. Data Mining: Can you dig it? http://www.teradatamagazine.com/articles/2003/vol3_no2/enterpriseviews/default.htm