• 제목/요약/키워드: Data Repository

검색결과 424건 처리시간 0.029초

Global Data Repository Status and Analysis: Based on Korea, China and Japan Data in re3data.org

  • Kim, Suntae
    • International Journal of Knowledge Content Development & Technology
    • /
    • 제8권1호
    • /
    • pp.79-89
    • /
    • 2018
  • We collected and analyzed data from e3data.org, which is a global registry of data repository services. We analyzed data profile for three leading Asian economies-Korea, China, and Japan-against the reference data for other participating countries. In particular, we examined how individual countries contribute to the repository, organizational type, versioning and product quality management, and subject tagging. We come to the conclusion that all three Asian countries still fall short in terms of involvement. As for participating institutions, there are 7 from Korea, 64 from China, and 120 from Japan. Among Chinese organizations, 3 are profit, 61 non-profit, and 37 organizations (which yields 1.8%) are involved in repository building. In Japan, there is 1 is commercial and 119 non-profit organizations, of which 57 (3.0%) are involved in repository building. All 7 organizations from Korea are non-profit, and 6 of them (0.3%) are involved in repository building. As regards versioning and product quality management, Korea, China, and Japan are up to par with other countries. Subject analysis reveals that Korea contributes more to geosciences, Japan to physics and geosciences, while China, unlike Korea and Japan, is more active in life sciences. It is hoped that this study will help planning domestic infrastructure for research data repositories with proper consideration for specific research domains and national characteristics.

Functional Requirements of Data Repository for DMP Support and CoreTrustSeal Authentication

  • Kim, Sun-Tae
    • International Journal of Knowledge Content Development & Technology
    • /
    • 제10권1호
    • /
    • pp.7-20
    • /
    • 2020
  • For research data to be shared without legal, financial and technical barriers in the Open Science era, data repositories must have the functional requirements asked by DMP and CoreTrustSeal. In order to derive functional requirements for the data repository, this study analyzed the Data Management Plan (DMP) and CoreTrustSeal, the criteria for certification of research data repositories. Deposit, Ethics, License, Discovery, Identification, Reuse, Security, Preservation, Accessibility, Availability, and (Meta) Data Quality, commonly required by DMP and CoreTrustSeal, were derived as functional requirements that should be implemented first in implementing data repositories. Confidentiality, Integrity, Reliability, Archiving, Technical Infrastructure, Documented Storage Procedure, Organizational Infrastructure, (Meta) Data Evaluation, and Policy functions were further derived from CoreTrustSeal. The functional requirements of the data repository derived from this study may be required as a key function when developing the repository. It is also believed that it could be used as a key item to introduce repository functions to researchers for depositing data.

Comparative Analysis of Centralized Vs. Distributed Locality-based Repository over IoT-Enabled Big Data in Smart Grid Environment

  • Siddiqui, Isma Farah;Abbas, Asad;Lee, Scott Uk-Jin
    • 한국컴퓨터정보학회:학술대회논문집
    • /
    • 한국컴퓨터정보학회 2017년도 제55차 동계학술대회논문집 25권1호
    • /
    • pp.75-78
    • /
    • 2017
  • This paper compares operational and network analysis of centralized and distributed repository for big data solutions in the IoT enabled Smart Grid environment. The comparative analysis clearly depicts that centralize repository consumes less memory consumption while distributed locality-based repository reduce network complexity issues than centralize repository in state-of-the-art Big Data Solution.

  • PDF

Analysis of the Current Status of Data Repositories in the Field of Ecological Research

  • Kim, Suntae
    • Proceedings of the National Institute of Ecology of the Republic of Korea
    • /
    • 제2권2호
    • /
    • pp.139-143
    • /
    • 2021
  • In this study, data repository information registered in re3data (re3data.org), a research data registry, was collected. Based on collected data, the current status was analyzed for 354 repositories (approximately 14% of total repositories) in the field using keywords in the ecological field suggested by two experts. Major metadata formats used to describe data in ecological research data repositories include Federal Geographic Data Committee Content Standard for Digital Geospatial Metadata (FGDC/CSDGM), Dublin Core, ISO 19115, Ecological Metadata Language (EML), Directory Interchange Format (DIF), Darwin Core, Data Documentation Initiative (DDI), and DataCite Metadata Schema. The number of ecological repositories according to country is 102 in the US, 34 in Germany, 31 in Canada, and one in Korea. A total of 771 non-profit organizations and 12 for-profit organizations are involved in the construction of the ecological field research data repository. Data version control ratio of the ecological field research data repositories registered in re3data was analyzed to be somewhat higher (86.6%) than the total ratio (83.9%). Results of this study can be used to establish policies to build and operate a research data repository in the ecological field.

기업 리파지토리 시스템 : 아키텍쳐 및 ERP 리파지토리 사례 (An Enterprise Repository System : Architecture and ERP Repositiory Case)

  • 이희석;서우종;김태훈;이충석;손명호;백종명;손주찬;박성진
    • 정보기술과데이타베이스저널
    • /
    • 제7권1호
    • /
    • pp.1-15
    • /
    • 2000
  • A repository has been conceived as a critical weapon for managing organizational information resources. The system can help control the heterogeneous data in a variety of CASE (Computer-Aided Software Engineering) tools. However, current repository systems have limitation in creating a synergetic effect by integrating information resources. Therefore, it is important to develop an integrative repository system, called Enterprise Repository System (ERS). This paper (i) defines ERS on the basis of a framework for repository systems, and (ii) suggests an ERS architecture and its detailed components. Finally, a real-life case of developing ERP repository system is illustrated according to the proposed architecture and components. This illustration may demonstrate the usefulness of this research for help developing an advanced repository system.

  • PDF

Functional Requirements for Research Data Repositories

  • Kim, Suntae
    • International Journal of Knowledge Content Development & Technology
    • /
    • 제8권1호
    • /
    • pp.25-36
    • /
    • 2018
  • Research data must be testable. Science is all about verification and testing. To make data testable, tools used to produce, collect, and examine data during the research must be available. Quite often, however, these data become inaccessible once the work is over and the results being published. Hence, information and the related context must be provided on how research data are preserved and how they can be reproduced. Open Science is the international movement for making scientific research data properly accessible for research community. One of its major goals is building data repositories to foster wide dissemination of open data. The objectives of this research are to examine the features of research data, common repository platforms, and community requests for the purpose of designing functional requirements for research data repositories. To analyze the features of the research data, we use data curation profiles available from the Data Curation Center of the Purdue University, USA. For common repository platforms we examine Fedora Commons, iRODS, DataONE, Dataverse, Open Science Data Cloud (OSDC), and Figshare. We also analyze the requests from research community. To design a technical solution that would meet public needs for data accessibility and sharing, we take the requirements of RDA Repository Interest Group and the requests for the DataNest Community Platform developed by the Korea Institute of Science and Technology Information (KISTI). As a result, we particularize 75 requirement items grouped into 13 categories (metadata; identifiers; authentication and permission management; data access, policy support; publication; submission/ingest/management, data configuration, location; integration, preservation and sustainability, user interface; data and product quality). We hope that functional requirements set down in this study will be of help to organizations that consider deploying or designing data repositories.

과학기술분야 기관 연구데이터 리포지터리 운영 활성화 방안 연구 (A Study on Strategies to Promote the Activation of Institutional Research Data Repositories in the Field of Science and Technology)

  • 김예현;김지현
    • 한국비블리아학회지
    • /
    • 제34권3호
    • /
    • pp.109-134
    • /
    • 2023
  • 본 연구의 목적은 과학기술분야 연구기관에서 운영되는 기관 연구데이터 리포지터리 운영 현황을 파악하고 활성화 방안을 제시하는 것에 있다. 이를 위해 문헌 연구와 사례 분석, 국내외 기관 리포지터리 담당자와의 인터뷰를 수행하였으며, 리포지터리 규정 및 정책 수립, 연구데이터 공유 인식 개선, 연구데이터 품질 관리 강화를 골자로 하는 기관 연구데이터 리포지터리 운영 활성화 방안을 제안하였다. 첫째, 리포지터리 규정 및 정책 수립 측면에서는 현재 연구데이터와 관련한 규정인 국가연구개발정보 처리기준의 지위 향상과 리포지터리 근거 규정의 명시가 필요하다고 보았다. 둘째, 연구데이터 공유 인식 개선 측면에서 전반적인 연구데이터 교육과 우수 사례 발굴의 필요성을 제안하였다. 셋째, 연구데이터 품질 관리 강화 측면에서 연구자-담당자-위원회의 상호작용과 표준화 작업, 장기 보존을 위한 준비의 필요성을 제안하였다.

XML 문서 변경 탐지 기능을 갖는 통합 리파지토리 시스템 (An Integrated Repository System with the Change Detection Functionality for XML Documents)

  • 박성진
    • 한국산학기술학회논문지
    • /
    • 제10권10호
    • /
    • pp.2696-2707
    • /
    • 2009
  • 비록 많은 DBMS 업체들이 XML을 지원하기 위해 기존 제품들을 확장하고 있지만 이와는 별도로 DBMS 종류와 플랫폼에 독립적인 경량의 XML 리파지토리 시스템 개발이 요구되고 있다. 본 논문에서 다음과 같은 기능들을 지원하는 XML 통합 리파지토리 시스템의 설계 및 구현에 관해 기술하였다. 구현된 XML 리파지토리 시스템은 XML DTD로부터 XML 문서 저장에 필요한 스키마 구조를 생성하고 데이터베이스 테이블에 저장한 뒤 XMLQL(XML Query Language)를 통해 자유롭게 XML 문서를 생성할 수 있으며 중복된 XML 문서들을 동기화시킨다. XML 리파지토리에는 동일한 데이터가 다양한 XML 문서에 중복될 수 있기 때문에 중복된 XML 문서들의 일관성 유지를 위한 효율적인 변경 탐지 기법이 요구된다. 논문에서는 메시지 다이제스트 기반의 변경 탐지 기법을 제안함으로써 클라이언트 XML 문서와 리파지토리 안의 XML 데이터간의 일관성을 유지하도록 하였다.

지역대표도서관 공동보존서고 운영에 관한 연구 - 부산도서관을 중심으로 - (A Study on the Operation of a Collaborative Repository of the Regional Central Library: Focused on the Busan Metropolitan Library)

  • 강은영
    • 한국비블리아학회지
    • /
    • 제33권3호
    • /
    • pp.55-76
    • /
    • 2022
  • 「제3차 도서관발전종합계획」에서는 공공도서관의 공통적인 문제로 장서수장문제가 부각됨에 따라 지역 단위 공동보존서고 설치를 통한 공간확보의 필요성을 제기하고 있다. 「도서관법」 및 「도서관법시행령」 역시 지역 도서관 자료에 대한 통합적 관리의 책임을 지역대표도서관에 부과하고 있다. 이에 이 연구는 지역대표도서관 중 본격적으로 공동보존서고를 운영하고 있는 부산도서관을 연구대상으로 하여 공동보존서고의 운영현황과 공동보존서고에 대한 관내 공공도서관 사서의 인식을 조사하였다. 연구에 필요한 데이터는 설문조사와 인터뷰, 현장조사, 내부자료 분석 등을 통하여 입수하였다. 이를 통해 향후 부산도서관 공동보존서고를 효율적으로 운영할 수 있는데 도움이 되는 기초적인 데이터를 제공함과 동시에 타지역 대표도서관 공동보존서고 운영에 참고할 만한 기초자료를 제시하는 것을 연구의 목적으로 하였다.

Database Modeling and Environmental Information for a Radioactive Waste Repository Site

  • Park S. M.;Rhee C. G.;Park J. B.;Lee H. J.;Kim Chang Lak
    • Nuclear Engineering and Technology
    • /
    • 제36권3호
    • /
    • pp.263-275
    • /
    • 2004
  • For the safe management of nuclear facilities, including a radioactive waste repository, data about the facility site and the surrounding environment must be collected and managed systematically. This is particularly true for a radwaste repository, which has to be institutionally controlled for a long period after closure. The objectives of this study are (1) to establish a systematical management plan for information about a radwaste repository site and its environment, and (2) to design a database management program for this information, based on the Relative Database Management System (RDBMS). The spatial data are designed by the geodatabase, which is a new object, based on the RDBMS, to manage spatial information related to the database. To meet this requirement, a new program called 'Site Information and Total Environmental data management System (SITES)' is being developed. The scope that produced from the first step of the present study for development of the SITES is introduced. The database is designed to combine spatial and attribute data, and is designed for the establishment of the Geographic Information System (GIS). The hardware and software systems are designed with consideration given to the total data management of the items within the radioactive environment.