DOI QR코드

DOI QR Code

A Comparative Study between Stock Price Prediction Models Using Sentiment Analysis and Machine Learning Based on SNS and News Articles

SNS와 뉴스기사의 감성분석과 기계학습을 이용한 주가예측 모형 비교 연구

  • 김동영 (숭실대학교 SW특성화대학원) ;
  • 박제원 (숭실대학교 SW특성화대학원) ;
  • 최재현 (숭실대학교 SW특성화대학원)
  • Received : 2014.07.14
  • Accepted : 2014.09.15
  • Published : 2014.09.30

Abstract

Because people's interest of the stock market has been increased with the development of economy, a lot of studies have been going to predict fluctuation of stock prices. Latterly many studies have been made using scientific and technological method among the various forecasting method, and also data using for study are becoming diverse. So, in this paper we propose stock prices prediction models using sentiment analysis and machine learning based on news articles and SNS data to improve the accuracy of prediction of stock prices. Stock prices prediction models that we propose are generated through the four-step process that contain data collection, sentiment dictionary construction, sentiment analysis, and machine learning. The data have been collected to target newspapers related to economy in the case of news article and to target twitter in the case of SNS data. Sentiment dictionary was built using news articles among the collected data, and we utilize it to process sentiment analysis. In machine learning phase, we generate prediction models using various techniques of classification and the data that was made through sentiment analysis. After generating prediction models, we conducted 10-fold cross-validation to measure the performance of they. The experimental result showed that accuracy is over 80% in a number of ways and F1 score is closer to 0.8. The result can be seen as significantly enhanced result compared with conventional researches utilizing opinion mining or data mining techniques.

Keywords

References

  1. Ahn, S., "Stock Prediction Using News Text Mining and Time Series Analysis", M.S. thesis, The Graduate School of Engineering, Yonsei Univ., Seoul, Korea, 2010.
  2. Bollen, J., H. Mao, and X., Zeng, "Twitter mood predicts the stock market", Journal of Computational Science, Vol.2, No.1, 2011, 1-8. https://doi.org/10.1016/j.jocs.2010.12.007
  3. Ian, H., F. Eibe, and A. Mark, Data Mining, Morgan Kaufmann, Burlington, 2011.
  4. Kim, K., G. Lee, and S., Lee, "A Comparative Analysis of Artificial Intelligence System and Ohlson model for IPO firm's Stock Price Evaluation", The Journal of Digital Policy and Management, Vol.11, No.5, 2013, 145-158.
  5. Kim, S., D. Nam, H. Jo, and S. Kim, "A Study on the Relation of Web News and Stock Price", Korea Society of IT Services, Vol. 11, No.3, 2012, 191-203. https://doi.org/10.9716/KITS.2012.11.3.191
  6. Kim, T., W. Jung, and S. Lee, "The Analysis on the Relationship between Firms' Exposures to SNS and Stock Prices in Korea", Asia Pacific Journal of Information Systems, Vol. 24, No.2, 2014, 233-253. https://doi.org/10.14329/apjis.2014.24.2.233
  7. Kim, Y., "News Big Data Opinion Mining Model for Predicting KOSPI Movement", Ph.D. thesis, Graduate School of Business IT, Kookmin Univ., Seoul, Korea, 2013.
  8. Kim, Y., N. Kim, and S. Jung, "Stock-Index Invest Model Using News Big Data Opinion Mining", Journal of Intelligence and Information Systems, Vol.18, No.2, 2012, 143-156.
  9. Park, K. and H. Shin, "Stock Price Prediction Based on Time Series Network", Journal of the Korean Operations Research and Management Science Society, Vol.28, No.1, 2011, 53-60.
  10. Sagong, J., "A Study on Predicting Stock Price Based on Data Mining Techniques", M.S. thesis, Dept. Data Science, Inje Univ., Gimhae, Korea, 2012.
  11. Shim, K. and J. Yang, "High Speed Korean Morphological Analysis based on Adjacency Condition Check", Korean Institute of Information Scientists and Engineers, Vol.31, No.1, 2004, 89-99.
  12. Song, C., "News and Financial Prices", International Economic Journal, Vol.8, No.3, 2002, 1-34.
  13. Song, J. and S. Lee, "Automatic Construction of Positive/Negative Feature-Predicate Dictionary for Polarity Classification of Product Reviews", Korean Institute of Information Scientists and Engineers, Vol.38, No.3, 2011, 157-168.
  14. Yu, E., Y. Kim, N. Kim, and S., Jeong, "Predicting the Direction of the Stock Index by Using a Domain-Specific Sentiment Dictionary", Journal of Intelligence and Information Systems, Vol.19, No.1, 2013, 95-110. https://doi.org/10.13088/jiis.2013.19.1.095
  15. Chun, S., "뉴스 콘텐츠의 오피니언 마이닝을 통한 매체별 주가상승 예측정확도 비교 연구", M. S. thesis, Graduate School of Business IT, Kookmin Univ., Seoul, Korea, 2013.