DOI QR코드

DOI QR Code

오픈소스 소프트웨어를 활용한 자연어 처리 패키지 제작에 관한 연구

Research on Natural Language Processing Package using Open Source Software

  • 투고 : 2016.11.17
  • 심사 : 2016.12.27
  • 발행 : 2016.12.31

초록

Purpose In this study, we propose the special purposed R package named ""new_Noun()" to process nonstandard texts appeared in various social networks. As the Big data is getting interested, R - analysis tool and open source software is also getting more attention in many fields. Design/methodology/approach With more than 9,000 R packages, R provides a user-friendly functions of a variety of data mining, social network analysis and simulation functions such as statistical analysis, classification, prediction, clustering and association analysis. Especially, "KoNLP" - natural language processing package for Korean language - has reduced the time and effort of many researchers. However, as the social data increases, the informal expressions of Hangeul (Korean character) such as emoticons, informal terms and symbols make the difficulties increase in natural language processing. Findings In this study, to solve the these difficulties, special algorithms that upgrade existing open source natural language processing package have been researched. By utilizing the "KoNLP" package and analyzing the main functions in noun extracting command, we developed a new integrated noun processing package "new_Noun()" function to extract nouns which improves more than 29.1% compared with existing package.

키워드

참고문헌

  1. 권순창, "A Study on the Use of Open Source Software in Vocational Education," 한국전산회계학회 정기학술발표회, 2007, pp. 165-169.
  2. 김상현, 송영미, "오픈소스 소프트웨어의 지속적인 사용의도에 영향을 미치는 요인에 관한 연구," 인터넷전자상거래연구, 제9권, 제1호, 2009, pp. 257-280.
  3. 김성용, 이상민. "무선 인터넷 망에서 임베디드 리눅스 기반 PDA 를 이용한 영상보드 원격제어 시스템 구현," 정보시스템연구 제17권, 제1호 2008, pp. 155-171.
  4. 김용현, 허의남, "Log Analysis Supporting System based on Log Data for Efficient Big Data Analysis," 한국정보과학회 학술발표논문집, 2014, pp. 936.
  5. 문상식, 김기홍, "IT 환경 변화에 따른 한국의 오픈소스 소프트웨어의 정책방향 연구," 인터넷전자상거래연구, 제14권, 제1호, 2014, pp. 203-221.
  6. 사공원, 하성호, 박경배. "온라인 후기에 내재된 고객의 감성분석과 LQI 차원별 호텔서비스 품질 평가," 정보시스템연구 제25권, 제3호, 2016, pp. 217-245.
  7. 손수아, 박석천, "IoT 기반 실시간 시각화 알고리즘을 이용한 스마트가드닝 시스템 설계 및 구현," 정보교육학회논문지, 제16권, 제6호, 2015, pp. 31-37.
  8. 박정웅, 최영민, 박희동, "오픈 소스 하드웨어 기반의 스마트 온실관리 시스템 설계 및 구현," 디지털융복합연구, 제14권, 제2호, 2016, pp. 259-264. https://doi.org/10.14400/JDC.2016.14.2.259
  9. 신수범, "Teaching and Learning Strategies of Computer Algorithms using Robot," 한국엔터테인먼트산업학회 학술대회 논문집, 2015, pp. 39-42.
  10. 안정국, 김희웅, "Building a Korean Sentiment Lexicon Using Collective Intelligence," 지능정보연구, 제21권, 제2호, 2015, pp. 49-64.
  11. 장영재, "튜토리얼: 빅데이터, 비즈니스 애널리틱스, IoT: 경영의 새로운 도전과 기회," 정보시스템연구, 제24권, 제4호, 2015, pp. 139-152.
  12. 한만휘, 박성찬, 이한빛, 연종흠, 이상구, "한국어 언어자원에서의 자연어 처리 기술현황 조사," 한국정보과학회 학술발표논문집, 2015, pp. 681-683.
  13. Black, E. W., "Wikipedia and academic peer review: Wikipedia as a recognised medium for scholarly publication?," Online Information Revie, Vol. 32, No. 1, 2008, pp. 73-88. https://doi.org/10.1108/14684520810865994
  14. Cachia, R., R. Compano, and O. D. Costa, "Grasping the potential of online social networks for foresight," Technological Forecasting and Social Change, Vol. 74, No. 8, 2007, pp. 1179-1203. https://doi.org/10.1016/j.techfore.2007.05.006
  15. Lee J, Le HS, Lee H. "Research on Methods for Processing Nonstandard Korean Words on Social Network Services," Journal of the Korea Industrial Information Systems Research, Vol. 21, No. 3, 2016, pp. 35-46. https://doi.org/10.9723/JKSIIS.2016.21.3.035
  16. Lee, J. H. and Lee, H. K., "A Study on Unstructured Text Mining Algorithm through R Programming based on Data Dictionary," Journal of the Korea Society Industrial Information System, Vol. 20, No. 2, 2015, pp. 113-124. https://doi.org/10.9723/jksiis.2015.20.2.113
  17. Lee, J. H, Le. H. S. and Lee, H. K., "A Study on Customer Reviews about Domestic and Imported Clothes Products through Opinion Mining," The Journal of Internet Electronic Commerce Research, Vol. 15, No. 3, 2015, pp. 233-234.
  18. LE, H., LEE, J. H. and LEE, H. K., "Purchase Process Aspect-based Opinion Mining : An Application for Online Shopping Mall," The Journal of Internet Electronic Commerce Research, Vol. 15, No. 2, 2015, pp. 15-28.
  19. http://www.nipa.kr
  20. https://www.r-project.org/