DOI QR코드

DOI QR Code

Subspace Projection-Based Clustering and Temporal ACRs Mining on MapReduce for Direct Marketing Service

  • Lee, Heon Gyu (IT Convergence Technology Research Laboratory, ETRI) ;
  • Choi, Yong Hoon (IT Convergence Technology Research Laboratory, ETRI) ;
  • Jung, Hoon (IT Convergence Technology Research Laboratory, ETRI) ;
  • Shin, Yong Ho (School of Business, Yeungnam University)
  • Received : 2014.08.04
  • Accepted : 2014.10.24
  • Published : 2015.04.01

Abstract

A reliable analysis of consumer preference from a large amount of purchase data acquired in real time and an accurate customer characterization technique are essential for successful direct marketing campaigns. In this study, an optimal segmentation of post office customers in Korea is performed using a subspace projection-based clustering method to generate an accurate customer characterization from a high-dimensional census dataset. Moreover, a traditional temporal mining method is extended to an algorithm using the MapReduce framework for a consumer preference analysis. The experimental results show that it is possible to use parallel mining through a MapReduce-based algorithm and that the execution time of the algorithm is faster than that of a traditional method.

Keywords

References

  1. X. Ling and C. Li, "Data Mining for Direct Marketing: Problems and Solutions," Int. Conf. Knowl. Discovery Data Mining, 1998, pp. 73-79.
  2. A. Hall, Direct Mail that Makes Cents, NPF2014 (Nat. Postal Forum), Washington D.C., 2014.
  3. B. Greshan and C. Kiani, Mail Meets Mobile Technol., NPF2014 (Nat. Postal Forum), Washington D.C., 2014.
  4. L. Sing'oei and J. Wang, "Data Mining Framework for Direct Marketing: A Case Study of Bank Marketing," J. Comput. Sci. Issues, vol. 10, no. 2, Mar. 2013, pp. 198-203.
  5. D. Zakrzewska and J. Murlewski, "Clustering Algorithms for Bank Customer Segmentation," Int. Conf. Intell. Syst. Des. Appl., Sept. 8-10, 2005, pp. 197-202.
  6. H.-J. Oh, C. Lee, and C.-H. Lee, "Analysis of the Empirical Effects of Contextual Matching Advertising for Online News," ETRI J., vol. 34, no. 2, Apr. 2012, pp. 292-295. https://doi.org/10.4218/etrij.12.0211.0171
  7. R. Agrawal, T. Imielinski, and A. Swami, "Mining Association Rules between Sets of Items in Large Databases," Int. Conf. Manag. Data, 1993, pp. 207-216.
  8. R. Agrawal and R. Srikant, "Mining Sequential Patterns," Int. Conf. Data Eng., 1995, pp. 3-14.
  9. Y. Li et al., "Discovering Calendar-Based Temporal Association Rules," Data Knowl. Eng., vol. 44, no. 2, Feb. 2003, pp. 193-218. https://doi.org/10.1016/S0169-023X(02)00135-0
  10. B. Ozden, S. Ramaswamy, and A. Silberschatz, "Cyclic Association Rules," Int. Conf. Data Eng., Orlando, FL, USA, Feb. 23-27, 1998, pp. 412-421.
  11. K. Verma and O.P. Vyas, "Efficient Calendar Based Temporal Association Rule," ACM SIGMOD Record, vol. 34, no. 3, Sept. 2005, pp. 63-70.
  12. Y.J. Lee et al., "Mining Temporal Interval Relational Rules from Temporal Data," J. Syst. Softw., vol. 82, no. 1, 2009, pp. 155-167. https://doi.org/10.1016/j.jss.2008.07.037
  13. S. Laxman and P.S. Sastry, "A Survey of Temporal Data Mining," Sadhana, vol. 31, no. 2, Apr. 2006, pp. 173-198. https://doi.org/10.1007/BF02719780
  14. E. Muller et al., OpenSubspace: Weka Subspace-Clustering Integration, 2013. Accessed May 11, 2014. http://dme.rwthaachen.de/OpenSubspace/
  15. J. Dean and S. Ghemawat, "MapReduce: Simplified Data Processing on Large Clusters," Operating Syst. Des. Implementation, vol. 6, 2004, pp. 137-150.
  16. S. Parsa and M. Hamzei, "Locality-Conscious Nested-Loops Parallelization," ETRI J., vol. 36, no. 1, Feb. 2014, pp. 124-133. https://doi.org/10.4218/etrij.14.0113.0266
  17. M.-Y. Lin, P.-Y. Lee, and S.-C. Hsueh, "A Priori-Based Frequent Itemset Mining Algorithms on MapReduce," Int. Conf. Ubiquitous Inf. Manag. Commun., 2012, no. 76.
  18. C. Aggarwal et al., "Fast Algorithms for Projected Clustering," Int. Conf. Manag. Data, Philadelphia, PA, USA, 1999, pp. 61-72.
  19. J. Han, M. Kamber, and J. Pei, Data Mining: Concepts and Techniques, 3rd ed., San Francisco, CA, USA: Morgan Kaufmann, 2006, pp. 508-511.
  20. B. Durrant et al., Weka 3: Data Mining Software in Java, Machine Learning Group at the University of Waikato, 2014. Accessed July 12, 2014. http://www.cs.waikato.ac.nz/ml/weka/
  21. J.S. Kim et al. "CTAR: Classification Based on Temporal Class-Association Rules for Intrusion Detection," Int. Workshop WISA, Jeju, Rep. of Korea, Aug. 25-27, 2003, pp. 84-96.
  22. W. Li, J. Han, and J. Pei, "CMAR: Accurate and Efficient Classification Based on Multiple Class-Association Rules," IEEE Int. Conf. Data Mining, San Jose, CA, USA, 2001, pp. 369-376.
  23. F. Thabtah, "A Review of Associative Classification Mining," Knowl. Eng. Rev., vol. 22, no. 1, Mar. 2007, pp. 37-65. https://doi.org/10.1017/S0269888907001026
  24. T.H. Nhan, J.W. Lee, and K.H. Ryu, "Spatiotemporal Pattern Mining Technique for Location-Based Service System," ETRI J., vol. 30, no. 3, June 2008, pp. 421-431. https://doi.org/10.4218/etrij.08.0107.0238
  25. R. Agrawal and R. Srikant, "Fast Algorithms for Mining Association Rules in Large Databases," Int. Conf. Very Large Databases, 1994, pp. 487-499.
  26. K. Shvachko et al., "The Hadoop Distributed File System," IEEE Symp. Mass Storage Syst. Technol., Incline Village, NV, USA, May 3-7, 2010, pp. 1-10.

Cited by

  1. A Data Mining Approach for Cardiovascular Disease Diagnosis Using Heart Rate Variability and Images of Carotid Arteries vol.8, pp.6, 2015, https://doi.org/10.3390/sym8060047
  2. Cancelled: A New Efficient Resource Management Framework for Iterative MapReduce Processing in Large-Scale Data Analysis vol.ed100, pp.4, 2015, https://doi.org/10.1587/transinf.2016dap0013
  3. Die Steigerung einzuwerbender Versicherungsprämien durch Targeting im Dialogmarketing vol.108, pp.2, 2015, https://doi.org/10.1007/s12297-019-00436-x