A Automatic Document Summarization Method based on Principal Component Analysis Kim, Min-Soo; Lee, Chang-Beom; Baek, Jang-Sun; Lee, Guee-Sang; Park, Hyuk-Ro;
In this paper, we propose a automatic document summarization method based on Principal Component Analysis(PCA) which is one of the multivariate statistical methods. After extracting thematic words using PCA, we select the statements containing the respective extracted thematic words, and make the document summary with them. Experimental results using newspaper articles show that the proposed method is superior to the method using either word frequency or information retrieval thesaurus.
principal component analysis;document summarization;thematic word extraction;
Using Lexical chains for Text Summarization, proc., 1997.
Journal of the Association for Computing Machinery, 1969.
Proc. Association for Computational Linguistics, 1997.
Proc. 18th ACM-SIGIR, 1995.
Proceedings of ACM-SIGIR'98, 1998.
제9회 한글 및 한국어 정보처리 학술대회, 1997.
다변량 통계자료분석, 1994.
제27회 정보과학회 봄 학술발표논문집(B), 2000.