DOI QR코드

DOI QR Code

Text mining on internet-news regarding climate change and food

기후변화 및 식품 관련 뉴스기사의 텍스트 마이닝

  • 현윤진 (국민대학교 전문대학원 비즈니스 IT학과) ;
  • 김정선 (한국보건사회연구원) ;
  • 정진욱 (한국보건사회연구원) ;
  • 윤시몬 (한국보건사회연구원) ;
  • 이문수 (국민대학교 전문대학원 비즈니스 IT학과)
  • Received : 2015.01.08
  • Accepted : 2015.02.09
  • Published : 2015.03.31

Abstract

Despite of correlation between climate changes and food-related information, it is still not easy for many users to get access to the information with interest. This study investigated how much climate change and food-related information are correlated with each other and how often they are exposed through frequency and correlation analysis on news articles on the internet portals. Through analysis on the frequency of climate change and food-related news articles, this study was able to figure out how often they are exposed at the same time by the internet news portals. In addition, a total of 59 correlation rules regarding the climate change and food-related vocabularies were derived from these news articles using the climate change and food-related glossaries. Then, a correlation between certain climate change-related and food-related words was analyzed in order to package the related words.

기후변화와 식품 관련 정보가 유기적인 관련이 있음에도 불구하고, 사실상 현실에서는 사용자들이 직접 그 관련성에 대한 관심을 가지고, 해당 정보에 대한 접근이 용이하다고 말하기는 어렵다.본 연구는 실제 사용자들이 직접적으로 노출되는 인터넷 포털 사이트의 뉴스 기사에 대한 빈도분석 및 연관관계 분석을 통해 기후변화 및 식품 관련 정보가 어느 정도의 연관성을 가지고 얼마나 자주 나타나고 있는지에 대해 파악하였다. 또한 추출된 기후변화 및 식품 관련 뉴스를 대상으로 기후변화 용어 사전과 식품 관련 용어 사전을 활용하여 기후변화 관련 용어와 식품 관련 용어의 총 59개의 연관관계 규칙을 도출함으로써, 특정 기후변화 관련 용어가 어떠한 식품 관련 용어와 연관관계를 갖는지 파악하여, 추후 두 용어를 패키징해 제공할 수 있는 발판을 마련하였다.

Keywords

References

  1. Agrawal, R. and Srikant, R. (1994). Fast algorithms for mining association rules. In Proceedings of the 20th International Conference on Very Large Data Bases, VLDB, 1215, 487-499.
  2. Ahn, H. C., Han, I. K. and Kim, K. J. (2006). The product recommender system combining association rules and classification models: The case of G Internet shopping mall. Information System Review, 8, 181-201.
  3. Albright, R. (2006). Taming text with the SVD, SAS Institute Inc., Cary, NC.
  4. Cho, I. D. and Kim, N. K. (2011). Recommending core and connecting keywords of research area using social network and data mining techniques. Journal of Intelligence and Information Systems, 17, 127-138.
  5. Choi, K. S. (2012). Hybrid big data analysis techniques and case studies in the SNS-era. In 2012 Big Data Search and Analysis Techniques Insight, Korea Database Agency, Seoul.
  6. Fan, W., Wallace, W., Rich, S. and Zhang, Z. (2006). Tapping the ower of text mining. Communications of the ACM, 49, 76-82.
  7. Han, J. and Kamber, M. (2011). Data mining: Concepts and techniques, 3rd Ed., Morgan Kaufmann Publishers, San Francisco, US.
  8. Hearst, M. A. (1999). Untangling text data mining. In Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics on Computational Linguistics, Association for Computation Linguistics, 3-10.
  9. Hyun, Y., Han, H., Choi, H., Park, J., Lee, K., Kwahk, K. and Kim, N. (2013). Methodology using text analysis for packaging R&D information services on pending national issues. Journal of Information Applications & Management, 20, 231-257.
  10. Kim, I. H. (2012). Values and application strategy of big data. In 2012 Big Data Search and Analysis Techniques Insight, Korea Database Agency, Seoul.
  11. Kim, N. (2008). Effect of market basket size on the accuracy of association aule measures. Asia Pacific Journal of Information Systems, 18, 95-114.
  12. Lee, Y. J. and Kim, K. J. (2013). Product recommender systems using multi-model ensemble techniques. Journal of Intelligence and Information Systems, 19, 39-54.
  13. Metzler, D., Bernstein, Y., Crofit, W. B., Moffat, A. and Zobel, J. (2005). Similarity measures for tracking information flow. In Proceedings of the 14th ACM International Conference on Information and Knowledge Management, 517-524.
  14. Mooney, R. J. and Bunescu, R. (2006). Mining knowledge from text using information extraction. ACM SIGKDD Explorations Newsletter, 7, 3-10.
  15. Park, W. C., Seung, H. W. and Yong, H. S. (2003). Data mining: Concepts and techniques, Free Academy, Seoul.
  16. Rijsbergen, C. J. V. (1979). Information retrieval, 2nd Ed., Butterworth, London,
  17. Salton, G., Wong, A. and Yang, C. S. (1975). A vector space model for automatic indexing. Communications of the ACM, 18, 613-620. https://doi.org/10.1145/361219.361220
  18. Sebastiani, F. (2002). Machine learning in automated text categorization. ACM Computing Surveys, 34, 1-47. https://doi.org/10.1145/505282.505283
  19. Sebastiani, F. (2006). Classification of text, automatic. In The encyclopedia of language and linguistics 14, 2nd Ed., Elsevier Science Pub., Amsterdam, 457-462.
  20. Stanvrianou, A., Andritsos, P. and Nicoloyannis, N. (2007). Overview and semantic issues of text mining. ACM Sigmod Record, 36, 23-34. https://doi.org/10.1145/1324185.1324190
  21. Wang, W. F., Chung, Y. L., Hus, M. H. and Keh, H. C. (2004). A personalized recommender system for the cosmetic business. Expert Systems with Applications, 26, 427-434. https://doi.org/10.1016/j.eswa.2003.10.001
  22. Witten, I. H. (2004). Text mining. In Practical Handbook of Internet Computing, edited by M. P. Singh, CRC Press, New York.
  23. Yoon, S. J. (2005). A study of churn prediction model for department store customers using data mining technique. Asia Marketing Journal, 6, 45-72.

Cited by

  1. A study on fractal dimensions of art works vol.27, pp.2, 2016, https://doi.org/10.7465/jkdi.2016.27.2.305
  2. 텍스트 마이닝 기법을 이용한 환경 분야의 ICT 활용 연구 동향 분석 vol.33, pp.2, 2015, https://doi.org/10.7780/kjrs.2017.33.2.7
  3. 비정형 농업기상자료를 활용한 배추 도매가격 예측모형 연구 vol.28, pp.3, 2017, https://doi.org/10.7465/jkdi.2017.28.3.617
  4. 민원 분석을 위한 텍스트 마이닝 기법 연구: 계층적 연관성 분석 vol.23, pp.3, 2015, https://doi.org/10.9723/jksiis.2018.23.3.013
  5. Trend Analysis of Thyroid Cancer Research in Korea with Text Mining Techniques vol.23, pp.12, 2018, https://doi.org/10.9708/jksci.2018.23.12.153