DOI QR코드

DOI QR Code

Comparison of graph clustering methods for analyzing the mathematical subject classification codes

  • Choi, Kwangju (Institute of Mathematical Sciences, Ewha Womans University) ;
  • Lee, June-Yub (Department of Mathematics, Ewha Womans University) ;
  • Kim, Younjin (Institute of Mathematical Sciences, Ewha Womans University) ;
  • Lee, Donghwan (Department of Statistics, Ewha Womans University)
  • Received : 2020.05.22
  • Accepted : 2020.07.08
  • Published : 2020.09.30

Abstract

Various graph clustering methods have been introduced to identify communities in social or biological networks. This paper studies the entropy-based and the Markov chain-based methods in clustering the undirected graph. We examine the performance of two clustering methods with conventional methods based on quality measures of clustering. For the real applications, we collect the mathematical subject classification (MSC) codes of research papers from published mathematical databases and construct the weighted code-to-document matrix for applying graph clustering methods. We pursue to group MSC codes into the same cluster if the corresponding MSC codes appear in many papers simultaneously. We compare the MSC clustering results based on the several assessment measures and conclude that the Markov chain-based method is suitable for clustering the MSC codes.

Keywords

References

  1. Chen K and Liu L (2005). The best K for entropy-based categorical data clustering. In Proceedings of the 17th International Conference on Scientific and Statistical Database Management, 253-262.
  2. De Robbio A, Maguolo D, and Marini A (2002). Mathematics Subject Classification and related schemes in the OAI framework. In: Find and Post Mathematics in the Web A workshop on Electronic Information and Communication in Mathematics, Lange Springer-Lecture Notes in Computer Sciences
  3. Kenley EC and Cho YR (2011). Entropy-Based Graph Clustering: Application to Biological and Social Networks. In Proceedings of the 11th IEEE International Conference on Data Mining, 1116-1121.
  4. Lange C, Ion P, Dimou A, Bratsas C, Sperber W, Kohlhase M, and Antoniou I (2012). Bringing mathematics to the web of data: the case of the mathematics subject classification, Extended Semantic Web Conference, 763-777.
  5. Li T, Ma S, and Ogihara M (2004). Entropy-based criterion in categorical clustering. In Proceedings of the 21st International Conference on Machine Learning, 68-75.
  6. Van Dongen S (2008). Graph clustering via a discrete uncoupling process, SIAM Journal on Matrix Analysis and Applications, 30, 121-141. https://doi.org/10.1137/040608635