DOI QR코드

DOI QR Code

Investigation of Research Topic and Trends of National ICT Research-Development Using the LDA Model

LDA 토픽모델링을 통한 ICT분야 국가연구개발사업의 주요 연구토픽 및 동향 탐색

  • Woo, Chang Woo (Department of Computer Science, Chungbuk National University) ;
  • Lee, Jong Yun (Department of Computer Science, Chungbuk National University)
  • 우창우 (충북대학교 컴퓨터과학과) ;
  • 이종연 (충북대학교 소프트웨어학과)
  • Received : 2020.05.07
  • Accepted : 2020.07.20
  • Published : 2020.07.28

Abstract

The research objectives investigates main research topics and trends in the information and communication technology(ICT) field, Korea using LDA(Latent Dirichlet Allocation), one of the topic modeling techniques. The experimental dataset of ICT research and development(R&D) project of 5,200 was acquired through matching with the EZone system of IITP after downloading R&D project dataset from NTIS(National Science and Technology Information Service) during recent five years. Consequently, our finding was that the majority research topics were found as intelligent information technologies such as AI, big data, and IoT, and the main research trends was hyper realistic media. Finally, it is expected that the research results of topic modeling on the national R&D foundation dataset become the powerful information about establishment of planning and strategy of future's research and development in the ICT field.

본 논문의 연구목표는 LDA(Latent Dirichlet Allocation) 모델을 적용하여 국가연구개발사업을 통해 수행되고 있는 ICT(Information and Communication Technology) 분야의 연구과제에 대한 주요 연구 토픽과 동향을 탐색하는데 있다. 연구방법에는 NTIS(National Science and Technology Information Service)로부터 최근 5년간 국가연구개발사업의 전체 연구과제 정보를 다운로드받고 이를 정보통신기획평가원(IITP)의 EZone 시스템과 매칭하여 ICT 분야 연구과제 5,200건을 확보하고, 토픽모델링 기법중 하나인 LDA 모델을 적용하여 연구토픽과 연구동향을 조사하였다. 실험결과로, ICT분야 연구과제에 대한 연구토픽은 인공지능, 빅데이터, 사물인터넷(Internet of Things)과 같은 지능정보기술로 확인되었고 연구동향에는 초실감미디어에 관한 연구가 활발히 진행되고 있음을 확인하였다. 끝으로 본 논문에서 진행된 국가연구개발사업에 대한 토픽모델링 결과는 향후 ICT분야 연구개발 계획 및 전략수립, 정책, 과제기획 등 중요한 정보로 활용될 수 있을 것이다.

Keywords

References

  1. Klaus Schwab. (2016). The Fourth Industrial Revolution: what it means, hot to respond. World Economic Forum Agenda. World Economic Forum.
  2. Presidential Committee on the Fourth Industrial Revolution. (2016). Comprehensive Measures for Intelligence Information Society. Seoul : Presidential Committee on the Fourth Industrial Revolution.
  3. Presidential Committee on the Fourth Industrial Revolution. (2017). 4th Industrial Revolution Response Plan. Seoul : Presidential Committee on the Fourth Industrial Revolution.
  4. Ministry of Science and ICT. (2018). I-KOREA 4.0 : ICT R&D Innovation Strategy. Sejong : Ministry of Science and ICT Publishing.
  5. Institute of Information & Communications Technology Planning & Evaluation. (2019). 2018 ICT Technical Level Survey. Daejeon : Institute of Information & Communications Technology Planning & Evaluation.
  6. NTIS. National R&D Management System. https://www.ntis.go.kr
  7. EZone. National ICT R&D Management System. https://https://ezone.iitp.kr
  8. T. K. Kim, H. R. Choi & H. C. Lee. (2016). A Study on the Research Trends in Fintech using Topic Modeling. Journal of the Korea Academia Industrial cooperation Society, 17(11), 670-681. DOI : 10.5762/KAIS.2016.17.11.670
  9. C. S. Kim, S. J. Choi & K. Y. Kwahk. (2017). Investigation of Research Trends in Information Systems Domain Using Topic Modeling and Time Series Regression Analysis. Journal of Digital Contents Society, 18(6), 1143-1150. DOI : 10.9728/dcs.2017.18.6.1143
  10. J. S. Park, N. R. Kim & E. J. Han. (2018). Analysis of Trends in Science and Technology using Keyword Network Analysis. Journal of the Korea Industrial Information Systems Research, 23(2), 63-73. DOI : 10.9723/jksiis.2018.23.2.063
  11. H. Y. Kim & Y. S. Kim. (2019) Trend Analysis of Healthcare Research in Korea using Topic Modeling. Journal of Wellness, 14(1), 253-262. DOI : 10.21097/ksw.2019.02.14.1.253
  12. H. I. Jo, J. W. Kim & B. K. Lee. (2019). A Study on Research Trends of Blockchain Using LDA Topic Modeling : Focusing on United States, China, and South Korea. Journal of Digital Contents Society, 20(7), 1453-1460. DOI : 10.9728/dcs.2019.20.7.1453
  13. Ministry of Science and ICT. (2019). Administrative Rules(2019-79) National research and development information standard. Seoul : Ministry of Science and ICT.
  14. David M. Blei, Andrew Y. Ng & Michael I. Jordan. (2003). Latent dirichlet allocation. Journal of Machine Learning Research, 3, 993-1022. DOI : 10.1162/jmlr.2003.3.4-5.993
  15. Scott Deerwester, Susan T. Dumais, George W. Furnas, Thomas K. Landauer & Richard Harshman. (1990). Indexing by latent semantic analysis. Journal of the American Society for Information Science, 41(6), 391-407. DOI : 10.1002/(SICI)1097-4571(199009)41:6<391
  16. Thomas Hofmann. (2001). Unsupervised learning by probabilistic latent semantic analysis. Machine Learning, 42(1-2), 177-196. DOI : 10.1023/A:1007617005950
  17. AlphaGo versus Lee Sedol, AlphaGo versus Lee Sedol, https://en.wikipedia.org/wiki/AlphaGo_versus_Lee_Sedol
  18. Bitcoin, Bitcoin, https://en.wikipedia.org/wiki/Bitcoin
  19. Ministry of Government Legislation. (2019). Current Status of Proposals Related to Cryptocurrency. Seoul : Ministry of Government Legislation.
  20. Organization for Economic Cooperation and Development. (2019). Artificial Intelligence in Society. France: Organization for Economic Cooperation and Development. DOI : 10.1787/eedfee77-en
  21. Lin Liu, Lin Tang, Wen Dong, Shaowen Yao & Wei Zhou. (2016). An Overview of Topic Modeling and its current applications in bioinformatics. Springerplus, 5(1), 1608-1630. DOI : 10.1186/s40064-016-3252-8