DOI QR코드

DOI QR Code

Improvements of K-modes Algorithm and ROCK Algorithm

K-모드 알고리즘과 ROCK 알고리즘의 개선

  • 김보화 (한국산업은행) ;
  • 김규성 (서울시립대학교 컴퓨터·통계학과)
  • Published : 2002.09.01

Abstract

K-modes algorithm and ROCK(RObust Clustering using linKs) algorithm we useful clustering methods for large categorical data. In the paper, we investigate these algorithms and propose improved algorithms of them to correct their weakness. A simulation study shows that the proposed algorithms could increase the performance of data clustering.

K-모드(modes) 알고리즘과 락(ROCK) 알고리즘은 대규모 범주형 자료에 적용 가능한 데이터 군집화 방법이다. 이 논문에서는 두 알고리즘을 고찰하였으며, 두 알고리즘의 단점을 보완한 개선된 데이터 군집화 알고리즘을 제안하였다. 그리고 실제자료에 제안된 방법을 적용한 모의실험을 실시하여 제안된 방법이 데이터 군집화의 성능을 향상시킬 수 있음을 보였다.

Keywords

References

  1. 응용통계연구 v.13 이중 K-평균 군집화 허명회
  2. Cluster analysis for application Anderberg, M. R.
  3. IEEE Transactions on Pattern Analysis and Machine Intelligence v.2 no.8 A convergence theorem for the fuzzy ISODATA clustering algorithm Bezdek, J. C. https://doi.org/10.1109/TPAMI.1980.4766964
  4. Journal of the American Statistical Association v.49 Measures of association for cross classifications Goodman, L. A.;Kruskal, W. https://doi.org/10.2307/2281536
  5. Classification Gordon, A. D.
  6. Technical report A clustering algorithm for categorical attributes Guha, S.;Rastogi, R.;Shim, K.
  7. Proceedings of the IEEE International Conference on Data Engineering Rock: a robust clustering algorithm for categorical attributes Guha, S.;Rastogi, R.;Shin, K.
  8. Proceedings of the first pacific-asia conference on KDD Clustering large data sets with mixed numeric and categorical values Huang, Z.
  9. Workshop on research issues on data mining and knowledge discovery A fast clustering algorithm to cluster very large categorical data sets in data mining Huang, Z.
  10. Algorthims for clustering data Jain, A. K.;Dubes, R. C.
  11. Journal of the Korean Statistical Society v.29 On a modified k-spatial medians clustering Jhun, Myounshic;Jin, Seohoon
  12. Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability Some methods for classification and analysis of multi-variate observations MacQueen, J. B.
  13. IEEE Transactions on Pattern Analysis and Machine Intelligence v.14 no.10 Comments on Murtagh, F. https://doi.org/10.1109/34.159908
  14. IEEE Tansactions on Pattern Analysis and Machine Intelligence v.6 no.1 K-means-type algorithms : a generalized convergence theorem and characterization of local optimality Selim, S. Z.;Ismail, M. A. https://doi.org/10.1109/TPAMI.1984.4767478
  15. Clustering analysis algorithms for data reduction and classification of objects Spath, H.
  16. UCI Machine Learning Repository Content Summary