DOI QR코드

DOI QR Code

TPIPF로 계산된 이용자프로파일을 적용한 논문추천시스템에 대한 연구

A Study on Scientific Article Recommendation System with User Profile Applying TPIPF

  • 장령령 (전남대학교 대학원 문헌정보학과) ;
  • 장우권 (전남대학교 문헌정보학과)
  • 투고 : 2016.03.05
  • 심사 : 2016.03.22
  • 발행 : 2016.03.30

초록

오늘날 폭발적인 정보의 증가로 이용자들은 자신이 원하는 정보를 찾기 위해 엄청난 시간과 노력을 기울여야 한다. 이 문제를 해결하기 위하여 이용자의 정보요구를 분석하고 이용자에게 적합한 논문을 추천해주는 논문추천시스템이 등장하고 있다. 그러나 대부분의 논문추천시스템은 논문추천시스템의 핵심인 이용자 프로파일을 간과하고 있다. 따라서 이 연구는 논문추천시스템의 성능을 좌우하는 이용자 프로파일을 기존의 평균으로 계산하지 않고 새로운 TPIPF(Topic Proportion-Inverse Paper Frequency)로 계산하는 방법을 제안하였다. 제안된 방법과 기존의 방법을 모두 논문추천시스템에 적용하여 각각의 성능을 온라인 참고문헌 관리도구인 CiteULike에서 제공된 데이터 실험을 통하여 비교하였다. 그 결과 제안된 TPIPF 방법을 적용한 논문추천시스템의 성능이 더 높다는 것을 알 수 있었다.

Nowadays users spend more time and effort to find what they want because of information overload. To solve the problem, scientific article recommendation system analyse users' needs and recommend them proper articles. However, most of the scientific article recommendation systems neglected the core part, user profile. Therefore, in this paper, instead of mean which applied in user profile in previous studies, New TPIPF (Topic Proportion-Inverse Paper Frequency) was applied to scientific article recommendation system. Moreover, the accuracy of two scientific article recommendation systems with above different methods was compared with experiments of public dataset from online reference manager, CiteULike. As a result, the proposed scientific article recommendation system with TPIPF was proven to be better.

키워드

참고문헌

  1. 박상진, 김윤현, 이지현 (2011). 인용논문 분석을 통한 학술 문서 추천 시스템. 한국정보처리학회 춘계학술발표대회 논문집, 18(1), 279-282.(Park, Sang Jin, Kim, Yoon Hyun, & Lee, Ji-Hyun (2011). A recommender of academic papers using the citation analysis. The 35th Conferences of the KIPS, 18(1), 279-282.)
  2. 여운동, 박현우, 권영일, 박영욱 (2010). 연구논문 추천시스템의 전자도서관 적용방안. 한국콘텐츠학회논문지, 10(11), 10-19. http://dx.doi.org/10.5392/jkca.2010.10.11.010(Yeo, Woon-Dong, Park, Hyun-Woo, Kwon, Young-Il, & Park, Young-Wook (2010). Application of research paper recommender system to digital library. The Korean Contents Society, 10(11), 10-19. http://dx.doi.org/10.5392/jkca.2010.10.11.010)
  3. 최호연, 신동옥, 최중민, 김정선 (2013). 전자 도서관 도메인에서 의미적 관계를 이용한 개인화된 논문추천 시스템. 정보과학회논문지: 소프트웨어 및 응용, 40(3), 164-175.(Choi, Hoyeon, Shin, Dongwook, Choi, Joongmin, & Kim, Jungsun (2013). PARuS: Personalized academic paper recommender using semantic relation in digital library domain. Journal of KISS: Software and Applications, 40(3), 164-175.)
  4. Beel, J., Langer, S., Genzmehr, M., & Nürnberger, A. (2013). Introducing docear's research paper recommender system. Proceedings of the 13th ACM/IEEECS Joint Conference on Digital Libraries, 459-460. http://dx.doi.org/10.1145/2467696.2467786
  5. Blei, D. M., & Lafferty, J. D. (2007). A correlated topic model of science. The Annals of Applied Statistics, 1(1), 17-35. http://dx.doi.org/10.1214/07-aoas114
  6. Blei, D. M., & Lafferty, J. D. (2009). Topic models. In A. Srivastava & M. Sahami (eds.), Text mining: classification, clustering, and applications. Chapman & Hall/CRC Press.
  7. Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent Dirichlet Allocation. Journal of Machine Learning Research, 3, 993-1022.
  8. Bogers, T., & van den Bosch, A. (2008). Recommending scientific articles using CiteULike. Proceedings of the 2008 ACM Conference on Recommender Systems, 287-290. http://dx.doi.org/10.1145/1454008.1454053
  9. Burke, R. (2002). Hybrid recommender systems: survey and experiments. User Modeling and User-Adapted Interaction, 12(4), 331-370. http://dx.doi.org/10.1109/dictap.2012.6215409
  10. Chandrasekaran, K., Gauch, S., Lakkaraju, P., & Luong, H. (2008). Concept-based document recommendations for Citeseer authors. Proceedings of the 5th International Conference on Adaptive Hypermedia and Adaptive Web-Based Systems, 83-92. http://dx.doi.org/10.1007/978-3-540-70987-9_11
  11. Choochaiwattana, W. (2010). Usage of tagging for research paper recommendation. Proceedings of the 3rd International Conference on Advanced Computer Theory and Engineering (ICACTE), 439-442. http://dx.doi.org/10.1109/icacte.2010.5579321
  12. CiteULike (2016). dataset. Retrieved from http://www.citeulike.org/faq/data.adp.
  13. Ferrara, F., Pudota, N., & Tasso, C. (2011). A keyphrase-based paper recommender system. Proceedings of Italian Research Conference (IRCDL 2011), 14-25. http://dx.doi.org/10.1007/978-3-642-27302-5_2
  14. Griffiths, T. L., & Steyvers, M. (2004). Finding scientific topics. Proceedings of the National Academy of Sciences of the United States of America, 101, 5228-5235. http://dx.doi.org/10.1073/pnas.0307752101
  15. Grun, B., & Hornik, K. (2011). Topicmodels: An R package for fitting topic models. Journal of Statistical Software, 40(13), 1-30. http://dx.doi.org/10.18637/jss.v040.i13
  16. He, Q., Pei, J., Kifer, D., Mitra, P., & Giles, L. (2010). Context-aware citation recommendation. Proceedings of the 19th International Conference on World Wide Web, 421-430. http://dx.doi.org/10.1145/1772690.1772734
  17. Henning, V., & Reichelt, J. (2008). Mendeley-a Last.fm for Research? IEEE Fourth International Conference on eScience, 327-328. http://dx.doi.org/10.1109/escience.2008.128
  18. Hornik, K., & Gruen, B. (2011). Topicmodels: an R package for fitting topic models. Journal of Statistical Software, 40(13), 1-30. Topicmodels: an R package for fitting topic models. http://dx.doi.org/10.1109/escience.2008.128
  19. Huang, W., Kataria, S., Caragea, C., Mitra, P., Giles, C. L., & Rokach, L. (2012). Recommending citations: translating papers into references. Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 1910-1914. http://dx.doi.org/10.1145/2396761.2398542
  20. Hwang, S., & Chuang, S. (2004). Combining article content and Web usage for literature recommendation in digital libraries. Online Information Review, 28(4), 260-272. http://dx.doi.org/10.1108/14684520410553750
  21. James, G., Witten, D., Hastie, T., & Tibshirani, R. (2014). An introduction to statistical learning: with applications in R. Springer.
  22. Jiang, Y., Jia, A., Feng, Y., & Zhao, D. (2012). Recommending academic papers via users' reading purposes. Proceedings of the 6th ACM Conference on Recommender Systems, 241-244. http://dx.doi.org/10.1145/2365952.2366004
  23. Liang, T., Yang, Y., Chen, D., & Ku, Y. (2008). A semantic-expansion approach to personalized knowledge recommendation. Decision Support Systems, 45(3), 401-412. http://dx.doi.org/10.1016/j.dss.2007.05.004
  24. Lops, P., de Gemmis, M., & Semeraro, G. (2011). Content-based recommender systems: state of the art and trends. In F. Ricci, L. Rokach, B. Shapria & K. Paul (Eds.). Recommender systems handbook (pp. 73-105). Springer.
  25. Middleton, S. E., De Roure, D. C., & Shadbolt, N. R. (2001). Capturing knowledge of user preferences: ontologies in recommender systems. Proceedings of the 1st International Conference on Knowledge Capture (K-CAP 2001), 100-107. http://dx.doi.org/10.1145/500737.500755
  26. Pudhiyaveetil, A., Kodakateri, G. S., Luong, H., & Eno, J. (2009). Conceptual recommender system for CiteSeerX. Proceedings of the third ACM Conference on Recommender Systems, 241-244. http://dx.doi.org/10.1145/1639714.1639758
  27. Sarkar, Deepayan (2008). Lattice: Multivariate data visualization with R. New York: Springer.
  28. Sarwar, B., Karypis, G., Konstan, J., & Riedl, J. (2001). Item-based collaborative filtering recommendation algorithms. Proceedings of the 10th International Conference on World Wide Web, 285-295. http://dx.doi.org/10.1145/371920.372071
  29. Shani, G., & Gunawardana, A. (2011). Evaluating recommendation systems. In F. Ricci, L. Rokach, B. Shapria & K. Paul (Eds.). Recommender Systems Handbook (pp. 257-297). Springer.
  30. Sugiyama, K., & Kan, M. (2010). Scholarly paper recommendation via user's recent research interests. Proceedings of Joint Conference of Digital Libraries. Gold Coast, 29-38. http://dx.doi.org/10.1145/1816123.1816129
  31. Torres, R., McNee, S. M., Abel, M., Konstan, J. A., & Riedl, J. (2004). Enhancing digital libraries with Techlens+. Proceedings of the 4th ACM/IEEE-CS Joint Conference on Digital Libraries, 228-236. http://dx.doi.org/10.1145/996350.996402
  32. Vellino, A. (2010). A comparison between usage-based and citation-based methods for recommending scholarly research articles. Proceedings of the American Society for Information Science and Technology, 1-2. http://dx.doi.org/10.1002/meet.14504701330
  33. Wallach, H. M. (2008). Structured topic models for language. Ph.D. diss. University of Cambridge, UK.