DOI QR코드

DOI QR Code

Comparison of journal clustering methods based on citation structure

논문 인용에 따른 학술지 군집화 방법의 비교

  • Kim, Jinkwang (Department of Statistics, Yeungnam University) ;
  • Kim, Sohyung (Academic Infrastructure Promotion Team, National Research Foundation of Korea) ;
  • Oh, Changhyuck (Department of Statistics, Yeungnam University)
  • 김진광 (영남대학교 통계학과) ;
  • 김소형 (한국연구재단 학술기반진흥팀) ;
  • 오창혁 (영남대학교 통계학과)
  • Received : 2015.04.03
  • Accepted : 2015.06.20
  • Published : 2015.07.31

Abstract

Extraction of communities from a journal citation database by the citation structure is a useful tool to see closely related groups of the journals. SCI of Thomson Reuters or SCOPUS of Elsevier have had tried to grasp community structure of the journals in their indices according to citation relationships, but such a trial has not been made yet with the Korean Citation Index, KCI. Therefore, in this study, we extracted communities of the journals of the natural science area in KCI, using various clustering algorithms for a social network based on citations among the journals and compared the groups obtained with the classfication of KCI. The infomap algorithm, one of the clustering methods applied in this article, showed the best grouping result in the sense that groups obtained by it are closer to the KCI classification than by other algorithms considered and reflect well the citation structure of the journals. The classification results obtained in this study might be taken consideration when reclassification of the KCI journals will be made in the future.

학술지 인용 데이터베이스에서 네트워크 구조분석을 통해 학술지의 공동체를 추출하는 것은 인용관계에 따른 학술지의 집단을 파악하는 유용한 수단이다. 전 세계적으로 널리 활용되는 학술지 인용데이터베이스인 Thomson Reuters의 SCI나 Elsevier의 SCOPUS가 제공하는 자료를 활용하여 인용관계에 따른 공동체 구조를 파악하는 시도가 이루어진 바 있으나, 국내 학술지 인용 데이터베이스인 KCI에서는 이러한 연구가 현재까지는 이루어지지 않은 것으로 알려져 있다. 따라서 본 연구에서는 기존의 여러 가지 네트워크 군집화 알고리즘을 이용하여 KCI에 등재되어 있는 자연과학 분야 학술지를 대상으로 인용관계에 따른 공동체를 파악하고 KCI에 등록된 학술지 분류와 비교하여 보았다. 적용된 군집화 방법 중 인포맵 알고리즘에 의한 분류가 KCI 등재 자연과학 분야 학술지의 인용관계 구조를 잘 반영하며, 기존의 KCI 분류와 가장 유사한 것으로 나타났다. 본 연구를 통해 얻은 KCI의 기존 분류와 차이점들은 장차 KCI 학술지의 재분류가 이루어질 시 고려의 대상이 될 수도 있을 것이다.

Keywords

References

  1. Agarwal, G. and Kempe, D. (2008). Modularity-maximizing graph communities via mathematical programming. The European Physical Journal B-Condensed Matter and Complex Systems, 66, 409-418. https://doi.org/10.1140/epjb/e2008-00425-1
  2. Arenas, A., Duch, J., Fernandez, A. and Gomez, S. (2007). Size reduction of complex networks preserving modularity. New Journal of Physics, 9, 176. https://doi.org/10.1088/1367-2630/9/6/176
  3. Blondel, V., Guillaume, J. L., Lambiotte, R. and Lefebvre, E. (2008). Fast unfolding of communities in large networks. Journal of Statistical Mechanics: Theory and Experiment, 2008, P10008.
  4. Brandes, U., Delling, D., Gaertler, M., Gorke, R., Hoefer, M., Nikoloski, Z. and Wagner, D. (2008). On modularity clustering. IEEE Transactions on Knowledge and Data Engineering, 20, 172-188. https://doi.org/10.1109/TKDE.2007.190689
  5. Carpenter, M. P. and Narin, F. (1973). Clustering of scientific journals. Journal of the American Society for Information Science, 24, 425-436. https://doi.org/10.1002/asi.4630240604
  6. Clauset, A., Newman, M. and Moore, C. (2004). Finding community structure in very large networks. Physical Review E, 70, 66111. https://doi.org/10.1103/PhysRevE.70.066111
  7. Dinh, T. N. and Thai, M. T. (2013). Towards optimal community detection: From trees to general weighted networks. Internet Mathematics (accepted pending revision).
  8. Fawcett, T. (2006). An introduction to ROC analysis. Pattern recognition letters, 27, 861-874. https://doi.org/10.1016/j.patrec.2005.10.010
  9. Fortunato, S. (2010). Community detection in graphs. Physics Reports, 486, 74-174.
  10. Girvan, M. and Newman, M. E. (2002). Community structure in social and biological networks. Proceedings of the National Academy of Sciences, 99, 7821-7826. https://doi.org/10.1073/pnas.122653799
  11. Jeong, E. S., Cho, D. Y., Suh, I. W. and Yeo, W. D. (2008). Emerging research field selection of construction & transportation sectors using scientometrics. The Journal of the Korea Contents Association, 8, 231-238. https://doi.org/10.5392/JKCA.2008.8.2.231
  12. Kim, H. (2008). Citation flow of the ASIST proceedings using pathfinder network analysis. Journal of the Korean Society for Information Management, 25, 157-166. https://doi.org/10.3743/KOSIM.2008.25.2.157
  13. Kim, J. A. and Lee, H. S. (2008). A study on network analysis for science and technology activity. Proceedings of the Autumn Conference of the Korean Operations Research and Management Science Society, 498-503.
  14. Lancichinetti, A. and Fortunato, S. (2009a). Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. Physical Review E, 80, 016118. https://doi.org/10.1103/PhysRevE.80.016118
  15. Lancichinetti, A. and Fortunato, S. (2009b). Community detection algorithms: A comparative analysis. Physical review E, 80, 056117. https://doi.org/10.1103/PhysRevE.80.056117
  16. Levorato, V. and Petermann, C. (2011). Detection of communities in directed networks based on strongly p-connected components. International Conference on Computational Aspects of Social Networks, CASoN, IEEE, 211-216.
  17. Leydesdorff, L. (2004). Clusters and Maps of Science Journals Based on Bi-connected Graphs in the Journal Citation Reports. Journal of Documentation, 9, 715-723.
  18. Lin, W., Kong, X., Yu, P. S., Wu, Q., Jia, Y. and Li, C. (2012). Community detection in incomplete information networks. In Proceedings of the 21st international conference on World Wide Web, ACM, 341-350.
  19. Malliaros, F. D. and Vazirgiannis, M. (2013). Clustering and community detection in directed networks:A survey. Physics Reports Journal, 533, 95-142. https://doi.org/10.1016/j.physrep.2013.08.002
  20. Narin, F., Carpenter, M. and Berlt, N. (1972). Interrelationships of scientific journals. Journal of the American Society for Information Science, 23, 323-331. https://doi.org/10.1002/asi.4630230508
  21. Newman, M. and Girvan, M. (2003). Mixing patterns and community structure in networks. in Statistical Mechanics of Complex Networks, 625, 66-87. https://doi.org/10.1007/978-3-540-44943-0_5
  22. Newman, M. and Girvan, M. (2004). Finding and evaluating community structure in networks. Physical Review E, 69, 26113. https://doi.org/10.1103/PhysRevE.69.026113
  23. Newman, M. E. (2004). Fast algorithm for detecting community structure in networks. Physical review E, 69, 066133. https://doi.org/10.1103/PhysRevE.69.066133
  24. Newman, M. E. and Leicht, E. A. (2007). Mixture models and exploratory analysis in networks. Proceedings of the National Academy of Sciences, 104, 9564-9569. https://doi.org/10.1073/pnas.0610537104
  25. Park, C. (2013). Simple principle component analysis using Lasso. Journal of the Korean Data & Information Science Society, 24, 533-541. https://doi.org/10.7465/jkdi.2013.24.3.533
  26. Pons, P. and Latapy, M. (2006). Computing communities in large networks using random walks. Journal of Graph Algorithms and Applications,, 10, 191-218. https://doi.org/10.7155/jgaa.00124
  27. Radicchi, F., Castellano, C., Cecconi, F., Loreto, V. and Parisi, D. (2004). Defining and identifying communities in networks. Proceedings of the National Academy of Sciences of the United States of America, 101, 2658-2663. https://doi.org/10.1073/pnas.0400054101
  28. Raghavan, U. N., Albert, R. and Kumara, S. (2007). Near linear time algorithm to detect community structures in large-scale networks. Physical Review E, 76, 036106. https://doi.org/10.1103/PhysRevE.76.036106
  29. Rosvall, M. and Bergstrom, C. T. (2008). Maps of random walks on complex networks reveal community structure. Proceedings of the National Academy of Sciences, 105, 1118-1123. https://doi.org/10.1073/pnas.0706851105
  30. Schaeffer, S. E. (2007). Graph clustering. Computer Science Review, 1, 27-64. https://doi.org/10.1016/j.cosrev.2007.05.001
  31. Ward Jr, J. H. (1963). Hierarchical grouping to optimize an objective function. Journal of the American statistical association, 58, 236-244. https://doi.org/10.1080/01621459.1963.10500845
  32. Zhang, A., Ren, G., Cao, H., zhu Jia, B. and bin Zhang, S. (2013). Generalization of label propagation algorithm in complex networks. In Control and Decision Conference (CCDC), 2013 25th Chinese, IEEE, 1306-1309.
  33. Zhang, L., Liu, X., Janssens, F., Liang, L. and Glanzel, W. (2010). Subject clustering analysis based on ISI category classification. Journal of Informetrics, 4, 185-193. https://doi.org/10.1016/j.joi.2009.11.005

Cited by

  1. A classification of the journals in KCI using network clustering methods vol.27, pp.4, 2016, https://doi.org/10.7465/jkdi.2016.27.4.947