DOI QR코드

DOI QR Code

A Knowledge Graph on Japanese "Comfort Women": Interlinking Fragmented Digital Archival Resources

일본군 '위안부' 지식그래프: 파편화된 디지털 기록의 연결

  • 박하람 (중앙대학교 일반대학원 문헌정보학과 문헌정보학전공) ;
  • 김학래 (중앙대학교 사회과학대학 문헌정보학과)
  • Received : 2021.07.20
  • Accepted : 2021.08.03
  • Published : 2021.08.31

Abstract

Records on Japanese "Comfort Women" have been individually managed by private sectors or institutions, and some are provided as digital archives on the Internet. However, records of digital archives differ in the composition and representation of metadata by individual institutions. Meanwhile, there is a lack of a consistent structure to describe the relationships between and among these records, leading to their fragmentation and disconnectedness. This paper proposes a knowledge model for interlinking the digital archival resources and builds a knowledge graph by integrating the records from distributed digital archives. It derives common elements by analyzing metadata from the diverse digital archives and expresses them in standard vocabularies to semantically describe multiple entities and relationships of the digital archival resources. In particular, the study includes the refinement of collected data to search and thread dispersed records and the enrichment of external data to provide significant contextual information of records. An evaluation of the knowledge graph is performed via a query measuring the (dis)connectivity between the distributed records. As a result, the knowledge graph is capable of interlinking and retrieving fragmented records, providing substantial contextual information on the records with external data enrichment, and searching accurately to match the user's intentions through semantic-based queries.

일본군 '위안부'에 대한 기록은 민간 기관에서 개별적으로 관리하고 있다. 일부 기록은 디지털 아카이브로 구축되어 온라인으로 접근할 수 있다. 그러나, 디지털 아카이브의 기록은 기관에 따라 메타데이터의 구성과 표현 방식이 다르다. 한편, 기록 사이의 관계를 정의할 수 있는 체계가 미흡하기 때문에, 현재 구축된 일본군 '위안부' 기록은 서로 연결되지 않고 파편적인 형식으로 남아있다. 본 연구는 일본군 '위안부' 디지털 기록을 연계하기 위한 지식 모델을 제안하고, 분산화된 디지털 아카이브의 기록을 통합하여 일본군 '위안부' 지식그래프를 구축한다. 일본군 '위안부' 디지털 아카이브의 메타데이터를 분석하여 공통 요소를 도출하고, 표준 어휘를 적용하여 디지털 기록의 다양한 개체와 개체 사이의 관계를 의미적으로 표현한다. 특히, 흩어져 있는 기록을 연계하고 검색하기 위해 수집한 데이터의 정제가 이루어지고, 외부데이터를 활용하여 기록의 맥락 정보를 강화하고 있다. 구축된 지식그래프의 검증은 분산된 기록의 탐색 여부를 측정하는 질의를 통해 수행된다. 검증 결과, 지식그래프는 흩어져 있는 기록을 연계하여 검색할 수 있고, 외부데이터로부터의 강화로 기록의 맥락 정보를 풍부하게 제공하며, 의미 기반의 검색을 통해 사용자의 의도에 맞춘 정확한 검색이 가능하다.

Keywords

Acknowledgement

이 논문은 2021년도 중앙대학교 CAU GRS 지원에 의하여 작성되었음.

References

  1. Bong, Ji Hyeon & Nam, Young Joon (2019). A Study on the Design of Metadata Elements for Management of Oral History Archives about Sexual Slavery by Japan's Military. Journal of Korean Society of Archives and Records Management, 19(1), 225-250. http://doi.org/10.14404/JKSARM.2019.19.1.225
  2. Chung, Chin-Sung Team at SNU Human Rights Center (2018). To be taken, to be abandoned, to stand before us 1. Seoul: Purunoksa.
  3. Chung, Chin-Sung Team at SNU Human Rights Center (2018). To be taken, to be abandoned, to stand before us 2. Seoul: Purunoksa.
  4. Gender Archive (2019). Japanese 'Comfort Women' Collection. Available: http://www.genderarchive.or.kr/multi-collections/multi-collections/show/id/20
  5. Jeon, Ye Ji & Lee, Hyewon (2020). A Study on the Ontology Modeling by Analyzing RiC-CM v0.2. Journal of Korean Society of Archives and Records Management, 20(1), 139-158. https://doi.org/10.14404/JKSARM.2020.20.1.139
  6. Jeong, Hoemyeong & Lee, Sungsook (2021). A Study on the Application of Records in Contexts-Ontology(RiC-O) for the Description of Archives Contexts in a Digital Environment. Journal of Korean Society of Archives and Records Management, 21(2), 23-48, https://doi.org/10.14404/JKSARM.2021.21.2.023
  7. Kim, Haklae (2021). FAIR Principles: Considerations for Implementing Digital Archives from a Data Perspective, Journal of Korean Society of Archives and Records Management, 21(2), 155-172. https://doi.org/10.14404/JKSARM.2021.21.2.155
  8. Kim, Jeonghyun (2020). Achievements and Tasks of the excavation of 'Japanese military sexual slavery' records in JapanChina-Korea. The Korea-Japan Historical Society, 69, 185-224. https://doi.org/10.18496/kjhr.2020.08.69.185
  9. Kim, Soohyun & Lee, Sungsook (2020). A Study on Archive Description Using RiC-CM. Journal of Korean Society of Archives and Records Management, 20(1), 115-137. https://doi.org/10.14404/JKSARM.2020.20.1.115
  10. Korea Association of Archivists (2020). [Statement] Archivists' group position for neglection and annihiliation of House of Sharing's Japanese 'Comfort Women' records. Available: https://www.archivists.or.kr/1636
  11. Korean Council for Justice and Remembrance for the Issues of Military Sexual Slavery by Japan(Korean Council). (2018), What is Japanese Military Sexual Slavery System? Retrieved May 8th, 2021, from https://womenandwar.net/kr/what-is-japanese-military-sexual-slavery-system/
  12. Kwon, Mi-Hyun (2007). Management and Use of Oral History Archives on Forced Mobilization -Centering on oral history archives collected by the Truth Commission on Forced Mobilization under the Japanese Imperialism Republic of Korea-. Journal of Korean Society of Archives and Records Management, 16, 305-341.
  13. Lee, Na-Young (2010). Womens Movement for/on Comfort Women: Historical Present in the Context of Postcolonial Nation-State. The Journal of Asiatic Studies, 53(3), 41-78.
  14. Lee, Yu-kyeong & Kim, Haklae (2020). A Knowledge Graph of the Korean Financial Crisis of 1997: A Relationship-Oriented Approach to Digital Archives. Journal of Korean Society of Archives and Records Management, 20(4), 1-17, https://doi.org/10.14404/JKSARM.2020.20.4.001
  15. Nam, Young Joo (2017). Memory Replay and Memory Expansion of the Japanese Military Sexual Slavery Records Management Institutions: On the Cases of National Women's Historical Hall. The Journal of Humanities and Social science (HSS21), 8(3), 129-148. https://doi.org/10.22143/HSS21.8.3.8
  16. National Archives of Korea (2013). Japanese 'Comfort Women' records. Available: https://theme.archives.go.kr/next/nationalArchives/topicArchivesList.do?page=1&groupName=comport
  17. National Archives of Korea (2014). National Archives Notice No. 2014-8(Implementation 2014. 12. 26.). Available: https://law.go.kr/LSW/admRulInfoP.do?admRulSeq=2200000025361
  18. National Archives of Korea (2014). What is Nation-designated Archives? Available: https://theme.archives.go.kr/next/nationalArchives/archiveIntro.do
  19. Park, Sun-hee (2019). A Study on Improving Record Contextual Information and Developing Integrated System: Focusing on RiC-CM and RiC-O. The Korean Journal of Archival, Information and Cultural Studies, 9, 55-96.
  20. Park, Zi-young (2017). Transition of Archival Description from ISAD(G) to Record in Context Conceptual Model. Journal of Korean Society of Archives and Records Management, 17(1), 93-115. https://doi.org/10.14404/JKSARM.2017.17.1.093
  21. Research Institute on Japanese Military Sexual Slavery (2020). Archive814. Available: https://www.archive814.or.kr/
  22. Seo, Hyunju (2016). 2006-2016 Research Progress on the Japanese Military Comfort Women Issue and a Future Outlook: Focusing on Historical Studies in South Korea. Dongbuga Yeoksa Nonchong, (53), 197-222. https://doi.org/10.23037/DYN.2016..53.008
  23. Seo, Yeon-Su, Nam, Yeon-Hwa, Park, Ji-Won, Um, So-Young, & Kim, Yong (2016). A Study on the Development of a Metadata Schema for the Records and Archives on the Military Sexual Slavery by Japan. Journal of Korean Society of Archives and Records Management, 16(3), 99-129. https://doi.org/10.14404/JKSARM.2016.16.3.099
  24. Seoul Metropolitan Archives (2019). Japanese 'Comfort Women' records collected by Chung Jin-sung, a research team of Seoul National University, from 2016 to 2018 with the support of the Seoul Metropolitan Government. Available: https://archives.seoul.go.kr/contents/comfort-women
  25. Shin, Heisoo (2021). To register for World Heritages of Japanese 'Comfort Women' archives ➋ 2,744 pieces of history containing the pain of the victim and the efforts of the citizens. Retrieved July 12, 2021, Available: https://www.unesco.or.kr/data/unesco_news/view/780/1273/page/0?
  26. Shin, Mira & Kim, Ikhan (2019). A Study in the Data Modeling for Archive System Applying RiC. Journal of Korean Society of Archives and Records Management, 19(1), 23-67, https://doi.org/10.14404/JKSARM.2019.19.1.023
  27. Women and War Museum (2020). Wednesday Demo Archive Collection. Available: https://www.archivecenter.net/wednesdaydemo/archive/Collection.do
  28. Women and War Museum (2020). Wednesday Demo Archive. Available: https://www.archivecenter.net/wednesdaydemo
  29. Youn, Jihyun (2020). War and Women's Human Rights Museum: Archives are Key. Journal of Korean Society of Archives and Records Management, 20(4), 237-243. https://doi.org/10.14404/JKSARM.2016.16.3.09910.14404/JKSARM.2020.20.4.237
  30. Archives Nationales France (2021). Archives Nationales. Available: https://www.archives-nationales.culture.gouv.fr/en/web/guest/home
  31. Borst, W. N. (1997). Construction of Engineering Ontologies for Knowledge Sharing and Reuse. Enschede: Centre for Telematics and Information Technology (CTIT).
  32. Fensel, D., Simsek, U., Angele, K., Huaman, E., Karle, E., Panasiuk, O., Toma, L., Umbrich, J., & Wahler, A. (2020). Introduction: what is a knowledge graph?. In Knowledge Graphs, New York City: Springer, 1-10.
  33. Freire, N., Charles, V., & Isaac, A. (2018). Evaluation of Schema.org for Aggregation of Cultural Heritage. Preceedings of 15th International Conference on Extended Semantic Web Conference, Heraklion, Greece.
  34. Freire, N., Robson, G., Howard, J. B., Manguinhas, H., & Isaac, A. (2020). Cultural heritage metadata aggregation using web technologies: IIIF, Sitemaps and Schema.org. International Journal on Digital Libraries, 21(1), 19-30. https://doi.org/10.1007/s00799-018-0259-5
  35. Gruber, T. R. (1994). A translation approach to portable ontology specifications. Knowledge Acquisition, 5(2), 199-220. https://doi.org/10.1006/knac.1993.1008
  36. Guha, R.V., Brickly, D., & Macbeth, S. (2016). Schema.org: Evolution of Structured Data on the Web. Communications of the acm, 59(2), 44-51. https://doi.org/10.1145/2844544
  37. Han, M. K., Cole, T. W., Lampron, P., & Sarol, M. J. (2015). Exposing Library Holdings Metadata in RDF Using Schema.org Semantics. Proceedings of International Conference on Dublin Core and Metadata Applications, Sao Paulo, Brazil.
  38. ICA EGAD (2016). Records in Contexts A Conceptual Model for Archival Description draft v0.1. Available: https://www.ica.org/sites/default/files/RiC-CM-0.1.pdf
  39. ICA EGAD (2019). Records in Contexts - Ontology. Available: https://www.ica.org/en/records-in-contexts-ontology
  40. ICA EGAD (2021). RiC-O projects and tools. Available: https://ica-egad.github.io/RiC-O/projects-and-tools.html
  41. Jett, J., Cole, T., W., Han, M. K., & Szylowicz, C. (2017). Linked Open Data (LOD) for Library Special Collections. Proceedings of JCDL '17 The 17th ACM/IEEE-CS Joint Conference on Digital Libraries, Toronto, Canada.
  42. Lampron, P., Mixter, J., & Han M. K. (2016). Challenges of Mapping Digital Collections Metadata to Schema.org: Working with CONTENTdm. In Metadata and Semantics Research. New York City: Springer, 181-186.
  43. Matienzo, M. A., Roke, E. R., & Carlson, S. (2017). Creating a Linked Data-Friendly Metadata Application Profile for Archival Description. Proc. Int'l Conf. on Dublin Core and Metadata Applications 2017, 112-116.
  44. Memobase (2001). Memoriav Memobase. Available: https://memobase.ch/fr/start
  45. Nickel, M., Murphy, K., Tresp, V., & Gabrilovich, E. (2015). A review of relational machine learning for knowledge graphs. Proceedings of the IEEE, 104(1), 11-33. https://doi.org/10.1109/JPROC.2015.2483592
  46. Open Knowledge Foundation (2021). Open Data Handbook. What is Open Data? Available: https://opendatahandbook.org/guide/en/what-is-open-data/
  47. Schema.org (2021). Organization of Schemas. Available: https://schema.org/docs/schemas.html
  48. Shin, H. (2021). Voices of the "Comfort Women": The Power Politics Surrounding the UNESCO Documentary Heritage. The Asia-Pacific Journal, 19(5), 1-19.
  49. Social Networks and Archival Context Cooperative (2021). Social Networks and Archival Context. Available: https://snaccooperative.org/
  50. Studer, R., Benjamins, V. R., & Fensel, D. (1998). Knowledge Engineering: Principles and methods. Data & Knowledge Engineering 25, 161-197. https://doi.org/10.1016/S0169-023X(97)00056-6
  51. UNESCO (2021). Memory of the World. Available: https://en.unesco.org/programme/mow
  52. Vrandecic, D. (2012). Wikidata: A new platform for collaborative data collection. In Proceedings of the 21st international conference on world wide web, 1063-1064, https://doi.org/10.1145/2187980.2188242
  53. W3C Schema Architypes Community Group (2015). W3C Schema Architypes Community Group. Available: https://www.w3.org/community/architypes/
  54. W3C Schema Bib Extend Community Group (2015). W3C Schema Bib Extend Community Group. Available: https://www.w3.org/community/schemabibex/
  55. Zou, X. (2020). A survey on application of knowledge graph. In Journal of Physics. Paper presented at 4th International Conference on Control Engineering and Artificial Intelligence (CCEAI 2020), Singapore.