A Hierarchical Clustering Algorithm Using Extended Sequence Element-based Similarity Measure

Oh, Seung-Joon;

Journal of the Korea Society of Computer and Information (한국컴퓨터정보학회논문지)

Volume 11 Issue 5 Serial No. 43
/
Pages.321-327
/
2006
/
1598-849X(pISSN)
/
2383-9945(eISSN)

Korean Society of Computer Information (한국컴퓨터정보학회)

A Hierarchical Clustering Algorithm Using Extended Sequence Element-based Similarity Measure

확장된 시퀀스 요소 기반의 유사도를 이용한 계층적 클러스터링 알고리즘

Oh, Seung-Joon

오승준 (경기공업대학 산업경영시스템과)

Published : 2006.11.30

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

Recently there has been enormous growth in the amount of commercial and scientific data. Such datasets consist of sequence data that have an inherent sequential nature. However, only a few of the existing clustering algorithms consider sequentiality. This study presents a similarity measure and a method for clustering such sequence datasets. Especially, we present an extended concept of the measure of similarity, which considers various conditions. Using a splice dataset, we show that the quality of clusters generated by our proposed clustering algorithm is better than that of clusters produced by traditional clustering algorithms.

최근 들어 상업적이거나 과학적인 데이터들의 폭발적인 증가를 볼 수 있다. 이런 데이터들은 항목들 간의 순서적인 면을 가지고 있는 시퀀스 데이터들이다. 그러나 항목들 간의 순서적인 면을 고려한 클러스터링 연구는 많지 않다. 본 논문에서는 이들 시퀀스 데이터들 간의 유사도를 계산하는 방법과 클러스터링 방법을 연구한다. 특히 다양한 조건을 고려한 확장된 유사도 계산 방법을 제안한다. splice 데이터 셋을 이용하여 본 논문에서 제안하는 클러스터링 방법이 기존 방법 보다 우수하다는 것을 보여준다.

Journal of the Korea Society of Computer and Information (한국컴퓨터정보학회논문지)

A Hierarchical Clustering Algorithm Using Extended Sequence Element-based Similarity Measure

확장된 시퀀스 요소 기반의 유사도를 이용한 계층적 클러스터링 알고리즘

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)