DOI QR코드

DOI QR Code

A Study on the Preemptive Measure for Fake News Eradication Using Data Mining Algorithms : Focused on the M Online Community Postings

데이터 마이닝을 활용한 가짜뉴스의 선제적 대응을 위한 연구 : M 온라인 커뮤니티 게시물을 중심으로

  • 임문영 (호서대학교 기술경영전문대학원) ;
  • 박승범 (호서대학교 기술경영전문대학원)
  • Received : 2018.10.31
  • Accepted : 2019.03.18
  • Published : 2019.03.31

Abstract

Fake news threaten democratic elections and causes social conflicts, resulting in major damage. However, the concept of fake news is hard to define, as there is a saying, "News is not fake, fake is not news." Fake news, however, has irreversible characteristics that can not be recovered or reversed completely through post-punishment of economic and political benefits. It is also rapidly spreading in the early days. Therefore, it is very important to preemptively detect these types of articles and prevent their blind proliferation. The existing countermeasures are focused on reporting fake news, raising the level of punishment, and the media & academia to determine the authenticity of the news. Researchers are also trying to determine the authenticity by analyzing its contents. Apart from the contents of fake news, determining the behavioral characteristics of the promoters and its qualities can help identify the possibility of having fake news in advance. The online community has a fake news interception and response tradition through its long-standing community-based activities. As a result, I attempted to model the fake news by analyzing the affirmation-denial analysis and posting behavior by securing the web board crawl of the 'M community' bulletin board during the 2017 Korean presidential election period. Random forest algorithm deemed significant. The results of this research will help counteract fake news and focus on preemptive blocking through behavioral analysis rather than post-judgment after semantic analysis.

Keywords

OTSBB9_2019_v18n1_219_f0001.png 이미지

Divison of Fake News Progress

OTSBB9_2019_v18n1_219_f0002.png 이미지

Conceptual Diagram of Expanded SEMMA

OTSBB9_2019_v18n1_219_f0003.png 이미지

Feature Importance of Prediction Fake News

OTSBB9_2019_v18n1_219_f0004.png 이미지

Decision Tree

Fake News Compare Korea vs U. S. A

OTSBB9_2019_v18n1_219_t0001.png 이미지

Top10 Link Reported from Minjudang Web

OTSBB9_2019_v18n1_219_t0002.png 이미지

Top10 Link Reported from Moonjaein Web

OTSBB9_2019_v18n1_219_t0003.png 이미지

Predictability of Evaluation Model and t-value

OTSBB9_2019_v18n1_219_t0004.png 이미지

Compare of AUR by Mathods of Analysis

OTSBB9_2019_v18n1_219_t0005.png 이미지

The Result of Modeling

OTSBB9_2019_v18n1_219_t0006.png 이미지

References

  1. Dieudonne, D., CrossCheck : Partnering with First Draft and newsrooms in the leadup to French elections, 2017, Available from : https://www.blog.google/around-the-globe/google-europe/crosscheck-first-draft-newsrooms-french-elections/.
  2. Facebook, https://www.facebook.com/facebookmedia/blog/introducing-the-facebook-journalism-project.
  3. Gahirwal, M. et al., "Fake News Detection, International Journal of Advance Research", Ideas and Innovations in Technology, Vol. 4, No.1, 2018, 817-819.
  4. Gibson, J., The theory of affordances The Ecological Approach to Visual Perception (pp. 127-143), Boston : Houghton Miffin, 1979.
  5. Greiner, M., D. Pfeiffer, and R. Smith, "Principles and practical application of the receiveroperating characteristic analysis for diagnostic tests", Preventive Veterinary Medicine, Vol.45, No.1-2, 2000, 23-41. https://doi.org/10.1016/S0167-5877(00)00115-X
  6. Hong, S.K., "A study of reasonable regulation methods and Understanding for fake newsfocused on interview of specialists-", Korean Speech & Communication Association seminar articles, 2017, 21-35.
  7. Hwang, Y.S. and O.S. Kwon, "A Study on the Conceptualization and Regulation Measures on Fake News-Focused on selfregulation of internet service providers-", Journal of Media Law, Ethics and Policy Research, Vol.16, No.1, 2017, 53-101. https://doi.org/10.26542/JML.2017.4.16.1.53
  8. Jung, M. and D.M. Baek, "Economic cost calculation and implication", Weekly Economic Review, No.736, 2017, 1-15.
  9. Kim, K.M., "Online news diffusion and public opinion formation : A case study of 'controversy over Park Kyung Sin blog posting' ", Journal of Communication Science, Vol.12, No.4, 2012, 35-72.
  10. Korea Press Foundation, Media Education and Measures for fake news through F8, 2018.
  11. Korean Cultural Center in France, "France Public media reform profosal", 2018.
  12. Lee, D.-H., J.-H. Lee, Y.-R. Kim, H.-J. Kim, S.-M. Park, Y.-J. Yang, and W.-B. Shin, "Fake news detection using deep learning", Korea Information Processing Society, Spring Conference, Vol.25, No.1, 2018, 384-387.
  13. Lee, H.-J., C. Rhee, and S.-B. Yang, "Analyzing the Comparative Importance of Heuristic-Systematic Attributes of Online News Content : Focusing on the PWYW Payment Mode", The Journal of Internet Electronic Commerce Research, Vol.18, No.3, 2018, 165-185. https://doi.org/10.37272/JIECR.2018.06.18.3.165
  14. Lee, U.-K., A.R. Lee, and K.K. Kim, "The effect of Anonymity on Virtual Team Performance in Online Communities", The Journal of Society for e-Business Studies, Vol.20, No.1, 2015, 217-241. https://doi.org/10.7838/jsebs.2015.20.1.217
  15. Murdock Jr, B.B., "The serial position effect of free recall", Journal of Experimental Psychology, Vol.64, No.5, 1962, 482. https://doi.org/10.1037/h0045106
  16. Murphy, M., Government bans phrase 'fake news' The Telegraph, 2018, Retrieved from https://www.telegraph.co.uk/technology/2018/10/22/government-bans-phrase-fake-news/.
  17. Murray, A., How to report fake news to social media, in BBC NEWS, 2016.
  18. National ecleciton commission, e-election information, 2017. 6.
  19. Oversampling and undersampling in data analysis, in Wikipedia.
  20. Park, A.R., "Legal issues and measures for fake news", Press and Broadcasing, No.4, 2017, 14.
  21. Park, J.-H. and Y.-I. Kim, "Development of a Fake News Discrimination System using SVM Classifier", Proceedings of KIIT Summer Conference, 2017, 354-355.
  22. Park, Y.J. and J.H. Kim, "The Effect of Social Support in On-line Community on Community Involvement and Self-esteem", The Korean Journal of Health Psychology : Social and Personality Psychology, Vol.19, No.1, 2005, 13-25.
  23. Rubin, V. et al., "Fake news or truth? using satirical cues to detect potentially misleading news", in Proceedings of the Second Workshop on Computational Approaches to Deception Detection, 2016.
  24. Seoul National University Institute of communication Research, "In the Summit scene Global fact 4, Growing Global factcheck", http://factcheck.snu.ac.kr/documents/84, 2017. 7. 7.
  25. Shin, S.J., "Reading sentiment in the articles", International Data Group Tech report, 2014.
  26. Shmueli, G., N.R. Patel, and P.C. Bruce, Data mining for business intelligence: Concepts, techniques, and applications in Microsoft Office Excel with XLMiner : John Wiley and Sons, 2011.
  27. Silverman, C., How To Spot A Deepfake Like The Barack Obama-Jordan Peele Video, 2018, Available from : https://www.buzzfeed.com/craigsilverman/obama-jordan-peeledeepfake-video-debunk-buzzfeed.
  28. Silverman, C., This Analysis Shows How Viral Fake Election News Stories Outperformed Real News On Facebook, BuzzFeedNews, 2016.
  29. Song, I.Y., T.S. Yi, K.J. Shin, and K.C. Kim, "An Integrated Data Mining Model for Customer Relationship Management", Journal of Intelligence and Information Systems, Vol.33, No. 2, 2006, 154-159.
  30. Swets, J.A., "Measuring the accuracy of diagnostic systems", Science, Vol.240, No.4857, 1988, 1285-1293. https://doi.org/10.1126/science.3287615
  31. Tacchini, E. et al., Some like it hoax : Automated fake news detection in social networks, arXiv preprint arXiv : 1704.07506, 2017.
  32. The Council for fake news, The Headquartes of national sovereignty election measures, Activity white paper, 2017.
  33. The Joongangilbo, https://news.joins.com/article/22250147, 2018. 1. 1.
  34. The Kyunghyang Shinmun, http://news.khan.co.kr/kh_news/khan_art_view.html?art_id=201703251717011, 2017. 3. 25.
  35. The Slownews, http://slownews.kr/71699, 2018. 11. 21.
  36. Vosoughi, S., D. Roy, and S. Aral, "The spread of true and false news online", Science, Vol.359, No.6380, 2018, 1146-1151. https://doi.org/10.1126/science.aap9559
  37. Wang, Y. et al., "EANN : Event Adversarial Neural Networks for Multi-Modal Fake News Detection", in Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, ACM, 2018.
  38. Watson, J.B., "Psychology as the behaviorist views it," Psychological Review, Vol.20, No.2, 1913, 158. https://doi.org/10.1037/h0074428
  39. wikipedia, Random forest from https://en.wikipedia.org/wiki/Randomforest.
  40. wikipedia, SEMMA, from https://en.wikipedia.org/wiki/SEMMA.
  41. Wikipedia, User behavior analytics, from https://en.wikipedia.org/wiki/User_behavior_analytics.
  42. Witten, I.H. et al., Data Mining : Practical machine learning tools and techniques, Morgan Kaufmann, 2016.
  43. Yonhap News Agency, https://www.yna.co.kr/view/AKR20170317076700004, 2017. 3. 17.
  44. Yonhap News Agency, https://m.yna.co.kr/view/AKR20181010075400001, 2018. 10. 10.
  45. Yonhap News Agency, https://www.yna.co.kr/view/AKR20170316143600001, 2017. 3. 16.
  46. Yoon, I., "The Research about the User Attribution Method of Digital Files", Journal of Digital Forensics, Vol.11, No.1, 2017, 73-93. https://doi.org/10.22798/KDFS.2017.11.1.73
  47. Yoon, T.W. and H.C. Ahn, "Prediction of domestic fake news by artificial intelligence method", The Korea Society of Management Information Systems Conference, 2017, 365-368.