DOI QR코드

DOI QR Code

효율적인 연관규칙 감축을 위한 WT-알고리즘에 관한 연구

A Study on WT-Algorithm for Effective Reduction of Association Rules

  • 박진희 (대구한의대학교 교양학부) ;
  • 피수영 (대구가톨릭대학교 교양교육원)
  • 투고 : 2015.03.06
  • 심사 : 2015.10.07
  • 발행 : 2015.10.31

초록

매일 각종 모바일 디바이스와 온라인, 소셜네트워크서비스 등에서 쏟아지는 데이터로 인해 정보의 홍수를 넘어 과부하 상태에 있다. 이미 생성되어 있는 기존 정보들도 있지만 시시각각 새롭게 생겨나고 있는 정보들이 헤아릴 수 없을 정도이다. 연관분석은 이러한 정보들 속에서 나타나는 항목의 발생 빈도수가 최소 지지도보다 큰 빈발항목집합(Frequent Item set)을 찾는 방법이다. 항목의 수가 많아짐에 따라 규칙의 수도 기하급수적으로 늘어나므로 원하는 정보를 찾기가 어려운 단점이 있다. 따라서 본 논문에서는 트랜잭션데이터 집합을 Boolean 변수 아이템으로 나타내었다. 논리함수를 간소화하는데 사용되는 Quine-McKluskey의 방법으로 알고리즘화하여 각 항목에 가중치를 부여한 WT-알고리즘을 제안한다. 제안한 알고리즘은 항목의 개수와 관계없이 간략화가 가능한 장점으로 인하여 불필요한 규칙을 감소시켜 데이터마이닝 효율을 향상시킬 수 있다.

We are in overload status of information not just in a flood of information due to the data pouring from various kinds of mobile devices, online and Social Network Service(SNS) every day. While there are many existing information already created, lots of new information has been created from moment to moment. Linkage analysis has the shortcoming in that it is difficult to find the information we want since the number of rules increases geometrically as the number of item increases with the method of finding out frequent item set where the frequency of item is bigger than minimum support in this information. In this regard, this thesis proposes WT-algorithm that represents the transaction data set as Boolean variable item and grants weight to each item by making algorithm with Quine-McKluskey used to simplify the logical function. The proposed algorithm can improve efficiency of data mining by reducing the unnecessary rules due to the advantage of simplification regardless of number of items.

키워드

참고문헌

  1. H. C. Park, "Association rule ranking function by decreased lift influence," Journal of the Korean Data & Information Science Society, Vol. 21, No. 3, pp. 397-405, 2010.
  2. K. C. Ahn, C. B. Moon, B. M. kim, Y. S. Shin, and H. S. kim, "POS Data Analysis System based on Association Rule Analysis," Korea Society of Industrial Information Systems, Vol. 17, No. 5, pp. 9-17, 2012.
  3. W. Lin, S. Alvarez, and C. Ruiz, "Efficient adaptive-support association rule mining for recommender systems," Data Mining and Knowledge Discovery, Vol. 6, No. 1, pp. 83-105, 2002. https://doi.org/10.1023/A:1013284820704
  4. B. W. Zheng, J. M. Yeo, "The method of using database technology to process rules of Rule-based System," Journal of information and communication convergence engineering, Vol.8, No.1, pp. 89-94, 2010. https://doi.org/10.6109/jicce.2010.8.1.089
  5. J. H. Park, H. M. Chung, "An Effective Reduction of Association Rules using a T-Algorithm," Korea Intelligent Information System Society, Vol. 19, No. 2, pp. 285-290, 2009. https://doi.org/10.5391/JKIIS.2009.19.2.285
  6. J. H. Park, C. H. Her, H. M. Chung, "Efficiency for Reduction of Association Rule," Proceedings of KIIS conference, Vol. 18, No. 2, pp. 101-104, 2008.
  7. Y. Kim, "A Study on Design and Implementation of Personalized Information Recommendation System based on Apriori Algorithm," Journal of the Korean Biblia Society for Library and Information Science, Vol. 23, No. 4, pp. 283-308, 2012. https://doi.org/10.14699/kbiblia.2012.23.4.283
  8. R. Argrawal, T. Imielinski, and A. Swami, "Mining Association Rules between sets of items in Large Databases," In Proc. Int'l Conf. on Management of Data, ACMSIGMOD, Washington D.C, Vol. 22, No. 2, pp. 207-216, 1993.
  9. R. Srikant, Q. C.Vu and R. Agrawal, "Mining Association Rules with Items Constraints," In Proc. the 3rd Int'l Conf. on Knowledge Discovery and Data mining, pp. 67-73, 1997.
  10. H. S. Hwang, K. D. Yoo, "Mining Association Rules from the Web Access Log of an Online News Website," Korea Society of Industrial Information Systems, Vol. 18, No. 2, pp. 47-57, 2013.
  11. R. Srikant, R. Argrawal, "Mining quantitative association rules in Large Relational Tables," In Proceedings of the ACM SIGMOD Conference on Management of Data, Vol. 25, No. 2, pp. 3-8, 1996.
  12. S. I. Jeon, G. W. Park, K. W. Nam, and K. H. Ryu, "Pattern Analysis-Based Query Expansion for Enhancing Search Convenience," Korea Society of Industrial Information Systems, Vol. 17, No. 2, pp. 65-72, 2012. https://doi.org/10.9723/jksiis.2012.17.2.065
  13. K. H. Joo, E. Y. Shin, J. I. Lee ,W. S. Lee, "Hierarchical Automatic Classification of News Articles based on Association Rules," Journal of Korea Multimedia Society, Vol. 14, No. 6, pp. 730-741, 2011. https://doi.org/10.9717/kmms.2011.14.6.730