• Title/Summary/Keyword: Data Repository

Search Result 63, Processing Time 0.233 seconds

Analysis of the Current Status of Data Repositories in the Field of Ecological Research

  • Kim, Suntae
    • Proceedings of NIE
    • /
    • v.2 no.2
    • /
    • pp.139-143
    • /
    • 2021
  • In this study, data repository information registered in re3data (re3data.org), a research data registry, was collected. Based on collected data, the current status was analyzed for 354 repositories (approximately 14% of total repositories) in the field using keywords in the ecological field suggested by two experts. Major metadata formats used to describe data in ecological research data repositories include Federal Geographic Data Committee Content Standard for Digital Geospatial Metadata (FGDC/CSDGM), Dublin Core, ISO 19115, Ecological Metadata Language (EML), Directory Interchange Format (DIF), Darwin Core, Data Documentation Initiative (DDI), and DataCite Metadata Schema. The number of ecological repositories according to country is 102 in the US, 34 in Germany, 31 in Canada, and one in Korea. A total of 771 non-profit organizations and 12 for-profit organizations are involved in the construction of the ecological field research data repository. Data version control ratio of the ecological field research data repositories registered in re3data was analyzed to be somewhat higher (86.6%) than the total ratio (83.9%). Results of this study can be used to establish policies to build and operate a research data repository in the ecological field.

Development Procedure of Data Organization of Data Repositories for Construction Engineering Research Cyberinfrastructure (건설공학 연구의 사이버 인프라를 위한 데이터 저장소의 데이터 구성의 단계적 개발방법)

  • Lee, Chang-Ho
    • Journal of the Architectural Institute of Korea
    • /
    • v.36 no.10
    • /
    • pp.177-188
    • /
    • 2020
  • The cyberinfrastructure for construction engineering research provides construction engineering researchers and engineers with a research environment that includes data repository, tools, and other computing services through the internet. As a main component of the cyberinfrastructure, the data repository stores the research project data and serves for data curation with data uploads/downloads. Since the data curation naturally depends on how the data is organized in the data repository, the data organization is important for practically useful data repositories. This paper uses the notation of classes and attributes of a data model to discuss the procedural steps to develop the efficient data organization of data repositories such as the data depot of DesignSafe for natural hazards engineering. The procedural development steps begins with the definition of uses for and the size of data repository. The basic organization of main data of the data repository is explored, and then the elaboration of data is proceeded. After the usage of data is evaluated by using a number of evaluation criteria, the data organization is improved based on the evaluation results. These development steps are repeated with various possible sequences until the efficient data organization is finally developed for data repositories for construction engineering research.

Global Data Repository Status and Analysis: Based on Korea, China and Japan Data in re3data.org

  • Kim, Suntae
    • International Journal of Knowledge Content Development & Technology
    • /
    • v.8 no.1
    • /
    • pp.79-89
    • /
    • 2018
  • We collected and analyzed data from e3data.org, which is a global registry of data repository services. We analyzed data profile for three leading Asian economies-Korea, China, and Japan-against the reference data for other participating countries. In particular, we examined how individual countries contribute to the repository, organizational type, versioning and product quality management, and subject tagging. We come to the conclusion that all three Asian countries still fall short in terms of involvement. As for participating institutions, there are 7 from Korea, 64 from China, and 120 from Japan. Among Chinese organizations, 3 are profit, 61 non-profit, and 37 organizations (which yields 1.8%) are involved in repository building. In Japan, there is 1 is commercial and 119 non-profit organizations, of which 57 (3.0%) are involved in repository building. All 7 organizations from Korea are non-profit, and 6 of them (0.3%) are involved in repository building. As regards versioning and product quality management, Korea, China, and Japan are up to par with other countries. Subject analysis reveals that Korea contributes more to geosciences, Japan to physics and geosciences, while China, unlike Korea and Japan, is more active in life sciences. It is hoped that this study will help planning domestic infrastructure for research data repositories with proper consideration for specific research domains and national characteristics.

Functional Requirements of Data Repository for DMP Support and CoreTrustSeal Authentication

  • Kim, Sun-Tae
    • International Journal of Knowledge Content Development & Technology
    • /
    • v.10 no.1
    • /
    • pp.7-20
    • /
    • 2020
  • For research data to be shared without legal, financial and technical barriers in the Open Science era, data repositories must have the functional requirements asked by DMP and CoreTrustSeal. In order to derive functional requirements for the data repository, this study analyzed the Data Management Plan (DMP) and CoreTrustSeal, the criteria for certification of research data repositories. Deposit, Ethics, License, Discovery, Identification, Reuse, Security, Preservation, Accessibility, Availability, and (Meta) Data Quality, commonly required by DMP and CoreTrustSeal, were derived as functional requirements that should be implemented first in implementing data repositories. Confidentiality, Integrity, Reliability, Archiving, Technical Infrastructure, Documented Storage Procedure, Organizational Infrastructure, (Meta) Data Evaluation, and Policy functions were further derived from CoreTrustSeal. The functional requirements of the data repository derived from this study may be required as a key function when developing the repository. It is also believed that it could be used as a key item to introduce repository functions to researchers for depositing data.

A Design of Data Repository for Software Maintenance Tools (소프트웨어 유지보수 도구를 위한 자료 저장소의 설계)

  • Choe, Eun-Man
    • The Transactions of the Korea Information Processing Society
    • /
    • v.2 no.2
    • /
    • pp.168-175
    • /
    • 1995
  • It has been commonly accepted for a while that a successful tool environment must provide for smooth interfaces between its tools. Recent integrated CASE environment must be-based on well-integrated data repository which supports a tightly coupled, consistent environment. For the maintenance tool, it requires a careful design of the maintenance chest's database. Information about a program exists in many different forms after analyses have been performed. It must be possible to associate and select objects for data repository from this information as necessary. This paper suggests a new integrated scheme for the data repository in building software maintenancec environment. The scheme provides many basic services, including storage and management of objects/entities and links/ relations; version and configuration control, query service, data interchange service.

  • PDF

The SAN for Web Warehousing: An Alternative Data Repository (웹 웨어하우징을 위한 신개념의 저장장치 전용네트워크)

  • Soongoo Hong
    • The Journal of Society for e-Business Studies
    • /
    • v.7 no.3
    • /
    • pp.93-103
    • /
    • 2002
  • The combination of data warehousing and Internet technology produces a new concept - web warehousing. Due to the availability of web technologies and the need to make prompt decisions with timely information, web warehousing is emerging as a key strategic business weapon. Yet despite the many promising benefits of web warehousing, researchers have also identified several challenges, including scalability and availability. With the rise of the Internet and data centric computing applications, the use of new Storage Area Network (SAN) technology has been spotlighted for the possibility of a new data repository for web warehousing. In this article, the two new concepts of web warehousing and storage area networks are introduced. In particular, a SAN is discussed in detail as an alternative data repository to overcome the current limitations of web warehousing.

  • PDF

A study of the STEP-based Data Repository and P&ID-3D CAD Model Connected Pilot System at Nuclear Power Plant (원전 대상의 STEP 기반 데이터 저장소 및 P&ID와 3차원 CAD 모델 연계에 관한 연구)

  • 안호준;조광종;박찬국;한순홍;안경익;최영준
    • Proceedings of the Korea Society for Energy Engineering kosee Conference
    • /
    • /
    • pp.395-400
    • /
    • 2004
  • This study is that STEP based Data Repository of APR1400 Nuclear Power Plant Reactor Coolant System is developed. The STEP based Data Repository is accessed by Web-based and an attribute data of Reactor Coolant System Equipment is offered. Also, a P&ID drawing file & 3D CAD Model of Reactor Coolant System is loaded. The P&ID drawing file of Reactor Coolant System Equipment Model is connected with 3D CAD Model file. This 2D/3D CAD Model connected Prototype system confirms a real layout of Reactor Coolant System.

  • PDF

A Clinical Data Repository as an Autonomous Agent (자동 에이전트 기법을 활용한 임상문서저장시스템)

  • Kim, Hwa-Sun;Houshiaryan, Kambiz;Cho, Hune
    • The Transactions of the Korean Institute of Electrical Engineers D
    • /
    • v.54 no.12
    • /
    • pp.732-736
    • /
    • 2005
  • ln general, hospital information system should provide interoperability hat usually and operate independence of other HIS. This study proposes a new HIS paradigm that can be implemented within standard HL7 Interface engine and clinical data repository (CDR). We have developed an alternative architecture relying on agent solutions with distributed queries to heterogeneous databases. This architecture creates a very fine and flexible repository that can handle queries with the bases of standard HL7 messaging structure. Deploying Agent solutions to manipulate autonomy of storage management and sociality for communication with open world is another issue that keeps this system from reinventing existing wheels in medical informatics. This study the first attempt to construct CDR based private clinic. We used the information stored in the clinical patient record system of the internal medicine private hospital which is used rational database. We were searched increasing the 1,000 data entry from 1,000 to 10,000. By the result, experimental CDR showed highly efficient performance more than 6,000. In the future, the CDR can be further extended for clinical information among private hospitals estranged from EHR (Electronic Health Records).

Construction of Construction Drawing Data Repository using Ontology (온톨로지를 이용한 건축 도면데이터 레포지터리 구성)

  • Lee, Hui-Jae;Yoo, Sang-Bong;Kim, In-Han
    • The Journal of Society for e-Business Studies
    • /
    • v.9 no.3
    • /
    • pp.79-94
    • /
    • 2004
  • W3C has developed the RDF standard for utilizing ontology in Web applications. This paper presents extracting, storing, and applying ontology on product data. The management and document information included in DWG files is focused as an example. By analyzing the relationship among the drawing data, the RDF schema is designed frist. Based on the schema ontology is extracted and stored in XML files. As an application of the stored ontology, we the schema ontology is extracted and stored in XML files. As an application of the stored ontology, we can reconfigure the sitemap of drawing data repositories. In this example, the users can select the view that he or she is interested in (e.g., designer, document, project). With such various views of an drawing data repository, the users can access the specific data more effectively.

  • PDF

Registry Metadata Quality Assessment by the Example of re3data.org Schema

  • Kim, Suntae;Choi, Myung-Seok
    • International Journal of Knowledge Content Development & Technology
    • /
    • v.7 no.2
    • /
    • pp.41-51
    • /
    • 2017
  • Nowadays, research data repositories (RDR) have become progressively widespread all over the world. To expand repository services and build up inbound linking strategy, organizations list their repositories with so called Global Registries. Accordingly, such registries should be carefully described by the related data. In this study, I explore the metadata schema of re3data.org. I collect and analyze descriptions from the listed repositories, and come up with some suggestions concerning possible improvements to the metadata schema. To accomplish this, I develop a crawler program, which collects necessary data from the re3data.org. Based on the analysis results, I have identified two issues that required elements is missing, one issue that required element value is missing when the corresponding property is applied, five inconsistency issues with re3data controlled vocabulary, six issues with undescribed optional elements, and two inconsistency issues between the elements and their attributes which do not pair with. I believe this discussion can facilitate improvements to the existing re3data.org schema and further help researchers who analyze data repository trends.