DOI QR코드

DOI QR Code

Combining Sentimental Expression-level and Sentence-level Classifiers to Improve Subjective Sentence Classification

감정 표현구 단위 분류기와 문장 단위 분류기의 결합을 통한 주관적 문장 분류의 성능 향상

  • Published : 2007.12.31

Abstract

Subjective sentences express opinions, emotions, evaluations and other subjective ideas relevant to products or events. These expressions sometimes can be seen in only part of a sentence, thus extracting features from a full-sentence can degrade the performance of subjective-sentence-classification. This paper presents a method for improving the performance of a subjectivity classifier by combining two classifiers generated from the different representations of an input sentence. One representation is a sentimental phrase that represents an automatically identified subjective expression or objective expression and the other representation is a full-sentence. Each representation is used to extract modified n-grams that are composed of a word and its contextual words' polarity information. The best performance, 79.7% accuracy, 2.5% improvement, was obtained when the phrase-level classifier and the sentence-level classifier were merged.

주관적 문장이란 주관적인 내용을 포함한 문장으로써 저자의 제품이나 사건에 대한 생각을 알 수 있다. 주관적 내용임을 나타내는 주관적인 표현은 문장 전반적으로 골고루 나타날 수도 있지만 일부 한정된 영역에서만 발견될 수도 있다. 따라서 보다 정확한 분류를 위해서는, 문장 전체를 고려하는 정보 외에 사실이나 감정을 표현하는 주관적 혹은 객관적 표현구 정보의 활용이 필요하다. 본 연구에서는 문장 전체를 이용한 분류 결과와 감정 표현구를 이용한 분류 결과를 결합하여 주/객관적 문장 분류기의 성능을 향상시키는 방법을 제안한다. 한 문장은 여러 개의 표현구를 가질 수 있어 복수개의 표현구 단위 결과를 얻게 되며 기계 학습을 응용하여 문장 단위 결과와 결합한다. 실험을 통한 결과, 표현구 단위 결과물 중 최대값을 가지는 두 가지 결과와 문장 전체를 이용한 결과를 합침으로써 2.5% 성능 향상된 79.7%의 정확률을 얻을 수 있었다.

Keywords

References

  1. Kushal Dave, Steve Lawrence and David M. Pennock, 'Mining the peanut gallery:opinion extraction and semantic classification of product reviews,' Proceedings of the 12th international conference on World Wide Web, 519-528, 2003 https://doi.org/10.1145/775152.775226
  2. Theresa Wilson, Janyce Wiebe and Paul Hoffmann, 'Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis,' Proceedings of Human Language Technology Conference and North American Chapter of the Association for Computational Linguistics, 2005 https://doi.org/10.3115/1220575.1220619
  3. Bo Pang and Lillian Lee, 'A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts,' Annual Meeting of the Association for Computational Linguistics (ACL), 2004 https://doi.org/10.3115/1218955.1218990
  4. Hong Yu and Vasileios Hatzivassiloglou, Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying the Polarity of Opinion Sentences,' Proceedings of the Conference on Empirical Methods in Natural Language Processing vol. 10, 129-136, 2003 https://doi.org/10.3115/1119355.1119372
  5. Ellen Riloff, Janyce Wiebe and William Phillips, 'Exploiting Subjectivity Classification to Improve Information Extraction,' Proceedings of the 20th National Conference on Artificial Intelligence, 2005
  6. Theresa Wilson and Janyce Wiebe, 'Annotating Opinions in the World Press,' Proceedings of the 4th SIGdial Workshop on Discourse and Dialogue, 2003
  7. Saso Dzeroski and Bernard Zenko, 'Is Combining Classifiers Better than Selecting the Best One?' Machine Learning, 54(3), 255-273, 2004 https://doi.org/10.1023/B:MACH.0000015881.36452.6e
  8. Soo-Min Kim and Eduard Hovy, 'Automatic Detection of Opinion Bearing Words and Sentences,' Proceedings of the Second International Joint Conference on Natural Language Processing, 2005
  9. Bo Pang, Lillian Lee and Shivakumar Vaithyanathan, 'Thumbs up? Sentiment Classification using Machine Learning Techniques,' Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2002
  10. George A. Miller, 'WordNet: a lexical database for English,' Communications of the ACM, 38(11), 39-41, 1995 https://doi.org/10.1145/219717.219748
  11. Ellen Riloff and Janyce Wiebe, 'Learning Extraction Patterns for Subjective Expressions,' Proceedings of the Conference on Empirical Methods in Natural Language Processing, 105-112, 2003 https://doi.org/10.3115/1119355.1119369
  12. Ellen Riloff, Janyce Wiebe and Theresa Wilson, 'Learning Subjective Nouns Using Extraction Pattern Bootstrapping,' Proceedings of the Seventh Conference on Natural Language Learning, 25-32, 2003 https://doi.org/10.3115/1119176.1119180
  13. Janyce Wiebe and Ellen Riloff, 'Creating Subjective and Objective Sentence Classifiers from Unannotated Texts,' Proceedings of the Sixth International Conference on Intelligent Text Processing and Computational Linguistics, 2005
  14. Eric Brill, 'Transformation-based Error-driven Learning and Natural Language Processing: A Case Study in Part of Speech Tagging,' Computational Linguistics, 1995
  15. Thorsten Joachims, 'Making large-Scale SVM Learning Practical,' Advances in Kernel Methods - Support Vector Learning, MIT Press, 169-184, 1999