Keyword Weight based Paragraph Extraction Algorithm

Lee, Jongwon;Yu, Seongjong;Kim, Doan;Jung, Hoekyung;

Proceedings of the Korean Institute of Information and Commucation Sciences Conference (한국정보통신학회:학술대회논문집)

2018.05a
/
Pages.462-463
/
2018

The Korea Institute of Information and Commucation Engineering (한국정보통신학회)

Keyword Weight based Paragraph Extraction Algorithm

문단 가중치 분석 기반 본문 영역 선정 알고리즘

Lee, Jongwon (PaiChai University) ;
Yu, Seongjong (PaiChai University) ;
Kim, Doan (PaiChai University) ;
Jung, Hoekyung (PaiChai University)

이종원 (배재대학교) ;
유성종 (배재대학교) ;
김도안 (배재대학교) ;
정회경 (배재대학교)

Published : 2018.05.31

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

Traditional document analysis systems used word-based analysis using a morphological analyzer or TF-IDF technique. These systems have the advantage of being able to derive key keywords by calculating the weights of the keywords. On the other hand, it is not appropriate to analyze the contents of documents due to the structural limitations. To solve this problem, the proposed algorithm calculates the weights of the documents in the document and divides the paragraphs into areas. And we calculate the importance of the divided regions and let the user know the area with the most important paragraphs in the document. So, it is expected that the user will be provided with a service suitable for analyzing documents rather than using existing document analysis systems.

기존의 문서 분석 시스템들은 형태소 분석기나 TF-IDF 기법을 통해 단어 위주의 분석을 진행하였다. 이러한 시스템들은 키워드들의 가중치를 계산하여 주요 키워드를 도출할 수 있는 장점이 있다. 이에 반해 문서의 내용을 분석하기에는 구조적인 한계로 인해 부적합한 실정이다. 이를 해결하기 위해 본 논문에서 제안하는 알고리즘은 문서 내에 있는 문단들의 가중치를 계산한 뒤 문단들을 영역별로 분할한다. 그리고 분할된 영역별로 중요도를 계산하여 해당 문서 내에 가장 중요한 문단들이 있는 영역을 사용자에게 알려준다. 이를 통해 사용자는 기존의 문서 분석 시스템들을 사용할 때보다 문서를 분석하기에 적합한 서비스를 제공받을 것으로 사료된다.

Proceedings of the Korean Institute of Information and Commucation Sciences Conference (한국정보통신학회:학술대회논문집)

Keyword Weight based Paragraph Extraction Algorithm

문단 가중치 분석 기반 본문 영역 선정 알고리즘

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)