Frequent Itemset Search Using LSI Similarity

Ko, Younhee;Kim, Hyeoncheol;Lee, Wongyu;

컴퓨터교육학회논문지 (The Journal of Korean Association of Computer Education)

제6권1호
/
Pages.1-8
/
2003
/
1598-5016(pISSN)
/
2733-9785(eISSN)

한국컴퓨터교육학회 (The Korean Association of Computer Education)

LSI 유사도를 이용한 효율적인 빈발항목 탐색 알고리즘

Frequent Itemset Search Using LSI Similarity

고윤희 (고려대학교 컴퓨터교육과) ;
김현철 (고려대학교 컴퓨터교육과) ;
이원규 (고려대학교 컴퓨터교육과)

발행 : 2003.01.30

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

본 논문에서는 frequent itemset을 빠르게 발견해내기 위한 효율적인 vertical 마이닝 알고리즘을 제안한다. 본 알고리즘은 frequent itemset을 구하기 위해 아이템들을 Least Support Itemset(LSI) 과의 유사도에 의해 올림차순으로 정렬하여 탐색 트리를 구축하여 보다 빠르고 효율적으로 frequent itemset을 찾아낸다. 또한, 트리를 탐색 시, 2가지의 휴리스틱 방법을 사용하여 탐색의 초기에 많은 후보 아이템들을 탐색 트리로부터 제거함으로써 탐색 공간을 크게 줄인다. 본 논문에서 제안하는 알고리즘은 이전의 알고리즘들과 비교해, long pattern을 가지는 데이터 베이스에서 보다 빠르게 frequent itemset을 발견해 냄을 실험을 통해 발견하였다.

We introduce a efficient vertical mining algorithm that reduces searching complexity for frequent k-itemsets significantly. This method includes sorting items by their LSI(Least Support Itemsets) similarity and then searching frequent itemsets in tree-based manner. The search tree structure provides several useful heuristics and therefore, reduces search space significantly at early stages. Experimental results on various data sets shows that the proposed algorithm improves searching performance compared to other algorithms, especially for a database having long pattern.

컴퓨터교육학회논문지 (The Journal of Korean Association of Computer Education)

LSI 유사도를 이용한 효율적인 빈발항목 탐색 알고리즘

Frequent Itemset Search Using LSI Similarity

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)