DOI QR코드

DOI QR Code

계량서지적 분석에서 지적구조 매핑을 위한 링크 삭감 알고리즘의 적합도 측정

Measuring the Goodness of Fit of Link Reduction Algorithms for Mapping Intellectual Structures in Bibliometric Analysis

  • 이재윤 (명지대학교 인문대학 문헌정보학과)
  • 투고 : 2022.05.15
  • 심사 : 2022.06.07
  • 발행 : 2022.06.30

초록

지적구조 분석을 위해 가중 네트워크를 시각화해야 하는 경우에 패스파인더 네트워크와 같은 링크 삭감 알고리즘이 널리 사용되고 있다. 이 연구에서는 네트워크 시각화를 위한 링크 삭감 알고리즘의 적합도를 측정하기 위한 지표로 NetRSQ를 제안하였다. NetRSQ는 개체간 연관성 데이터와 생성된 네트워크에서의 경로 길이 사이의 순위 상관도에 기반하여 네트워크의 적합도를 측정한다. NetRSQ의 타당성을 확인하기 위해서 몇 가지 네트워크 생성 방식에 대해 정성적으로 평가를 했었던 선행 연구의 데이터를 대상으로 시험적으로 NetRSQ를 측정해보았다. 그 결과 품질이 좋게 평가된 네트워크일수록 NetRSQ가 높게 측정됨을 확인하였다. 40가지 계량서지적 데이터에 대해서 4가지 링크 삭감 알고리즘을 적용한 결과에 대해서 NetRSQ로 품질을 측정하는 실험을 수행한 결과, 특정 알고리즘의 네트워크 표현 결과가 항상 좋은 품질을 보이는 것은 아니며, 반대로 항상 나쁜 품질을 보이는 것도 아님을 알 수 있었다. 따라서 이 연구에서 제안한 NetRSQ는 생성된 계량서지적 네트워크의 품질을 측정하여 최적의 기법을 선택하는 근거로 활용될 수 있을 것이다.

Link reduction algorithms such as pathfinder network are the widely used methods to overcome problems with the visualization of weighted networks for knowledge domain analysis. This study proposed NetRSQ, an indicator to measure the goodness of fit of a link reduction algorithm for the network visualization. NetRSQ is developed to calculate the fitness of a network based on the rank correlation between the path length and the degree of association between entities. The validity of NetRSQ was investigated with data from previous research which qualitatively evaluated several network generation algorithms. As the primary test result, the higher degree of NetRSQ appeared in the network with better intellectual structures in the quality evaluation of networks built by various methods. The performance of 4 link reduction algorithms was tested in 40 datasets from various domains and compared with NetRSQ. The test shows that there is no specific link reduction algorithm that performs better over others in all cases. Therefore, the NetRSQ can be a useful tool as a basis of reliability to select the most fitting algorithm for the network visualization of intellectual structures.

키워드

과제정보

이 논문은 2021년도 명지대학교 교내연구비를 지원받아 수행되었음.

참고문헌

  1. Choi, Sanghee & Lee, Jae Yun (2019). Analyzing use of the words 'Industrial Revolution' and 'the 4th Industrial Revolution' and related keywords in academic disciplines. The Journal of Humanities, 40(4), 157-188. http://doi.org/10.22947/ihmju.2019.40.4.006
  2. Choi, Sanghee (2017). Analysis of author image based on book recommendation from readers. Journal of the Korean Society for Information Management, 34(4), 153-171. http://doi.org/10.3743/KOSIM.2017.34.4.153
  3. Jeong, Dong-Youl & Choi, Yoon Mee (1999). An experimental study of cocitation analysis on web information. Journal of the Korean Society for Information Management, 16(2), 7-26.
  4. Joe, Seon-Rye & Lee, Jae Yun (2012). Journal co-citation analysis for library services in pharmaceutics. Journal of Information Management, 43(1), 159-185. http://dx.doi.org/10.1633/JIM.2012.43.1.159
  5. Kim, Do-Mi (1993). A study on intellectual structure using author co-citation analysis and indexing term analysis of citing documents: application to economics. Journal of Information Management, 24(1), 32-57.
  6. Kim, Heejeon & Cho, Hyun Yang (2010). A study on intellectual structure using author co-citation analysis and author bibliographic coupling analysis in the field of social welfare science. Journal of the Korean Society for Information Management, 27(3), 283-306. http://doi.org/10.3743/KOSIM.2010.27.3.283
  7. Kim, Jo-Ah & Lee, Jae Yun (2016). Analyzing the research fronts of women's studies in Korea using citation image makers profiling. Journal of the Korean Society for Information Management, 33(2), 201-225. http://doi.org/10.3743/KOSIM.2016.33.2.201
  8. Kim, Pan Jun & Lee, Jae Yun (2007). Descriptor profiling for research domain analysis. Journal of the Korean Society for Information Management, 24(4), 285-303. http://doi.org/10.3743/KOSIM.2007.24.4.285
  9. Kim, Youngjoon & Kim, ChaeHwan (2007). Mapping the intellectual structure of communication research field in Korea: an author co-citation analysis, 1989~2006. Koran Journal of Communication Studies, 15(3), 156-185.
  10. Lee, Eun Suk (2003). An Author Co-citation Analysis of Multiple Authorship in the Fields of Information Science and Computer Science. Master's thesis, Graduate School of Yonsei University.
  11. Lee, Jae Yun & Choi, Sanghee (2013). Collaboration networks and document Networks in informetrics research from 2001 to 2011: Finding influential nations, institutions, documents. Journal of the Korean Society for Information Management, 30(1), 179-191. http://doi.org/10.3743/KOSIM.2013.30.1.179
  12. Lee, Jae Yun & Chung, EunKyung (2022). Introducing Keyword Bibliographic Coupling Analysis (KBCA) for identifying the intellectual structure. Journal of the Korean Society for Information Management, 39(1), 309-330, https://doi.org/10.3743/KOSIM.2022.39.1.309
  13. Lee, Jae Yun & Jung, Ju Hee (2006). Examining the interdisciplinary structure of Korean cognitive science through analyzing author affiliations and title words. Proceedings of the 13th Conference of the Korean Society for Information Management, 127-134.
  14. Lee, Jae Yun & Kim, Heejung (2011). Bibliometric analysis for the research support service at International Vaccine Institute. Proceedings of the 18th Conference of the Korean Society for Information Management, 11-16.
  15. Lee, Jae Yun & Kim, Soojung (2016). A bibliometric analysis of research trends on disaster in Korea. Journal of the Korean Society for Information Management, 33(4), 103-124. http://doi.org/10.3743/KOSIM.2016.33.4.103
  16. Lee, Jae Yun (2006). A study on the network generation methods for examining the intellectual structure of knowledge domains. Journal of the Korean Society for Library and Information Science, 40(2), 333-355. http://doi.org/10.4275/KSLIS.2006.40.2.333
  17. Lee, Jae Yun (2007). Clustering-based network generation algorithm. Proceedings of the 14th Conference of Korean Society for Information Management, 147-154.
  18. Lee, Jae Yun (2008). Analyzing the network of academic disciplines with journal contributions of Korean researchers. Journal of the Korean Society for Information Management, 25(4), 327-345. http://doi.org/10.3743/KOSIM.2008.25.4.327
  19. Lee, Jae Yun (2012). A measure of goodness of fit for network representations. Proceedings of the 19th Conference of Korean Society for Information Management, 133-136.
  20. Lee, Jae Yun (2013). A comparison study on the weighted network centrality measures of tnet and WNET. Journal of the Korean Society for Information Management, 30(4), 241-264. http://doi.org/10.3743/KOSIM.2013.30.4.241
  21. Lee, Jae Yun (2017). Deep learning research trends analysis with ego centered topic citation analysis. Journal of the Korean Society for Information Management, 34(4), 7-32. http://doi.org/10.3743/KOSIM.2017.34.4.007
  22. Lee, Jae Yun (2019). Analyzing research trends of the journal Language Facts and Perspectives. with ego-centered network analysis. Presentations at the 117th Workshop of the Institute of Language & Information Studies, Yonsei University.
  23. Lee, Jae Yun (2020). Analyzing the intellectual structure of school library researches with citation-weighted author profiling. Journal of the Korean Society for Library and Information Science, 54(2), 197-223. http://doi.org/10.4275/KSLIS.2020.54.2.197
  24. Lee, Sung-Sook (2005). A study on changes of the intellectual structure in web information using the co-links analysis. Journal of the Korean Society for Library and Information Science, 22(2), 205-228. http://doi.org/10.3743/KOSIM.2005.22.2.205
  25. Riew, Hee Kyun & Eum, Key Hyoung (2016). An inquiry on the 'intellectual structure' in Korea teacher policy researches: Focusing on author bibliographic coupling analysis. Korean Journal of Educational Administration, 34(5), 297-327.
  26. Ryoo, Jong-duk & Choi, Eun-Ju (2011). A comparison test on the potential utility between author profiling analysis (APA) and author co-citation analysis (ACA). Journal of the Korean Society for Information Management, 28(1), 123-144. https://doi.org/10.3743/KOSIM.2011.28.1.123
  27. Yoo, Yeong Jun & Lee, Jae Yun (2016). A bibliographic study on the Calvin Theological Journal. Journal of the Korean Biblia Society for Library and Information Science, 27(4), 125-145. http://doi.org/10.14699/kbiblia.2016.27.4.125
  28. Yoo, Yeong Jun & Lee, Jae Yun (2019). A bibliometric study on foreign reformed theological journals. Journal of the Korean Biblia Society for Library and Information Science, 30(3), 149-170. http://doi.org/10.14699/kbiblia.2019.30.3.149
  29. Yoo, Yeong Jun & Lee, Jae Yun (2020). A bibliometric study on the KCI listed theological journals. Journal of the Korean Biblia Society for Library and Information Science, 31(3), 5-27. http://doi.org/10.14699/kbiblia.2020.31.3.005
  30. Yoon, JungWon, Chung, Eunkyung, & Lee, Jae Yun (2019). An investigation on scholarly publications using HINTS data by bibliometric analysis. Ewha Journal of Social Sciences, 35(1), 121-152. http://doi.org/10.16935/ejss.2019.35.1.004
  31. Arleo, A., Kwon, O. H., & Ma, K. L. (2017). GraphRay: distributed pathfinder network scaling. Proceedings of the IEEE 7th Symposium on Large Data Analysis and Visualization (LDAV), 74-83.
  32. Barabasi, A. L. (2016). Network Science. UK: Cambridge University Press.
  33. Hauguel, S., Zhai, C., & Han, J. (2009). Parallel pathfinder algorithms for mining structures from graphs. 2009 Ninth IEEE International Conference on Data Mining, 812-817. https://doi.org/10.1109/ICDM.2009.142
  34. Kim, Pan Jun, Lee, Jae Yun, & Park, Ji-Hong (2009). Developing a new collection-evaluation method: mapping and the user-side h-index. Journal of the American Society for Information Science and Technology, 60(11), 2366-2377. https://doi.org/10.1002/asi.21159
  35. Lee, Jae Yun, Kim, Heejung, & Kim, Pan Jun (2010). Domain analysis with text mining: analysis of digital library research trends using profiling methods. Journal of Information Science, 36(2), 144-161. https://doi.org/10.1177/0165551509353251
  36. McCain, K. W. (1986). Cocited author mapping as a valid representation of intellectual structure. Journal of the American Society for Information Science, 37(3), 111-122. https://doi.org/10.1002/(SICI)1097-4571(198605)37:3<111::AID-ASI2>3.0.CO;2-D
  37. Quirin, A., Cordon, O., Guerrero-Bote, V. P., Vargas-Quesada, B., & Moya-Anegon, F. (2008). A quick MST-based algorithm to obtain Pathfinder networks (∞, n - 1). Journal of the American Society for Information Science and Technology, 59(12), 1912-1924. https://doi.org/10.1002/asi.20904
  38. Schvaneveldt, R. W. (1990). Pathfinder Associative Networks: Studies in Knowledge Organization. Norwood, NJ: Ablex Publishing Corporation.
  39. Torgerson, W. S. (1952). Multidimensional scaling: I. Theory and method. Psychometrika, 17, 401-419. https://doi.org/10.1007/BF02288916
  40. White, H. D. & Griffith, B. C. (1981). Author cocitation: a literature measure of intellectual structure. Journal of the American Society for Information Science, 32(3), 163-171. https://doi.org/10.1002/asi.4630320302
  41. White, H. D. (2003). Pathfinder networks and author cocitation analysis: a remapping of paradigmatic information scientists. Journal of the American Society for Information Science and Technology, 54(5), 423-434. https://doi.org/10.1002/asi.10228
  42. Young, F. W. (1987). Multidimensional Scaling: History, Theory, and Applications. Hillsdale, N.J.: L. Erlbaum Associates.
  43. Zweig, K. A. (2016). Network Analysis Literacy: A Practical Approach to the Analysis of Networks. Austria: Springer-Verlag. https://doi.org/10.1007/978-3-7091-0741-6