Development of e-Mail Classifiers for e-Mail Response Management Systems

전자메일 자동관리 시스템을 위한 전자메일 분류기의 개발

  • Published : 2003.11.30

Abstract

With the increasing proliferation of World Wide Web, electronic mail systems have become very widely used communication tools. Researches on e-mail classification have been very important in that e-mail classification system is a major engine for e-mail response management systems which mine unstructured e-mail messages and automatically categorize them. in this research we develop e-mail classifiers for e-mail Response Management Systems (ERMS) using naive bayesian learning and centroid-based classification. We analyze which method performs better under which conditions, comparing classification accuracies which may depend on the structure, the size of training data set and number of classes, using the different data set of an on-line shopping mall and a credit card company. The developed e-mail classifiers have been successfully implemented in practice. The experimental results show that naive bayesian learning performs better, while centroid-based classification is more robust in terms of classification accuracy.

Keywords

References

  1. 윤종식, '배깅과 부스팅을 이용한 나이브 베이지안 이메일 분류기의 성능향상', '동국대학교 석사학위논문', 2001
  2. 황호순, '프론트 앤드 e-CRM을 위한 전자메일 분류기 개발', '동국대학교 석사학위논문', 2001
  3. Diao. Y., Lu. H. and Wu. D., A Comparative Study of Classification Based Personal E-mail Filtering, PAKDD, 2000
  4. Dietterich. T. G., 'Approximate Statistical tests for Comparing Supervised Classification Learning Algorithms,' Neural Computation, Vol.10, No.7(1998)
  5. Dumais. S. S., Heckerman. D. and Horvitz. E., A Bayesian Approach to Filtering Junk e-Mail, AAAI Technical Report WS-98-05, 1998
  6. Han(Sam). E. H. and Karypis. G., Centroid -Based Document Classification:Analysis & Experimental Results,' PAKDD, 2000
  7. Lewis. D. and Ringuette. M., Comparison of Two Learning Algorithms for Text Categorization, In Tenth European Conference on Machine Learning, 1998
  8. McCallum. A. and Nigam. K., A Comparison of Event Models for Naive Bayes Text Classification, In AAAI-98 Workshop on Learning for Text Categorization, 1998
  9. Mitchell. T. M., Machine Learning, The McGraw-Hill Company, 1997
  10. Salton. G., Automatic Text Processing:The Transformation, Analysis, and Retrieval of Information by Computer, Addison Wesley, 1989
  11. Yang. Y. and Pedersen. J., A Comparative Study On Feature Selection in Text Categorization,' ICML, 1997