EXTENDED ONLINE DIVISIVE AGGLOMERATIVE CLUSTERING

  • Musa, Ibrahim Musa Ishag (Database/Bioinformatics Laboratory, Chungbuk National University) ;
  • Lee, Dong-Gyu (Database/Bioinformatics Laboratory, Chungbuk National University) ;
  • Ryu, Keun-Ho (Database/Bioinformatics Laboratory, Chungbuk National University)
  • Published : 2008.10.29

Abstract

Clustering data streams has an importance over many applications like sensor networks. Existing hierarchical methods follow a semi fuzzy clustering that yields duplicate clusters. In order to solve the problems, we propose an extended online divisive agglomerative clustering on data streams. It builds a tree-like top-down hierarchy of clusters that evolves with data streams using geometric time frame for snapshots. It is an enhancement of the Online Divisive Agglomerative Clustering (ODAC) with a pruning strategy to avoid duplicate clusters. Our main features are providing update time and memory space which is independent of the number of examples on data streams. It can be utilized for clustering sensor data and network monitoring as well as web click streams.

Keywords