Functional Requirements for Research Data Repositories

  • Kim, Suntae (Korea Institute of Science and Technology Information)
  • Received : 2018.02.12
  • Accepted : 2018.02.27
  • Published : 2018.03.31


Research data must be testable. Science is all about verification and testing. To make data testable, tools used to produce, collect, and examine data during the research must be available. Quite often, however, these data become inaccessible once the work is over and the results being published. Hence, information and the related context must be provided on how research data are preserved and how they can be reproduced. Open Science is the international movement for making scientific research data properly accessible for research community. One of its major goals is building data repositories to foster wide dissemination of open data. The objectives of this research are to examine the features of research data, common repository platforms, and community requests for the purpose of designing functional requirements for research data repositories. To analyze the features of the research data, we use data curation profiles available from the Data Curation Center of the Purdue University, USA. For common repository platforms we examine Fedora Commons, iRODS, DataONE, Dataverse, Open Science Data Cloud (OSDC), and Figshare. We also analyze the requests from research community. To design a technical solution that would meet public needs for data accessibility and sharing, we take the requirements of RDA Repository Interest Group and the requests for the DataNest Community Platform developed by the Korea Institute of Science and Technology Information (KISTI). As a result, we particularize 75 requirement items grouped into 13 categories (metadata; identifiers; authentication and permission management; data access, policy support; publication; submission/ingest/management, data configuration, location; integration, preservation and sustainability, user interface; data and product quality). We hope that functional requirements set down in this study will be of help to organizations that consider deploying or designing data repositories.



Supported by : Korea Institute of Science and Technology Information (KISTI)


  1. Boukhari, I., Jean, S., Ait-Sadoune, I., & Bellatreche, L. (2018). The role of user requirements in data repository design. International journal on software tools for technology transfer, 20(1), 19-34.
  2. Brown, G. W., Ouimet, P. P., Robinson, D. T., & Zoller, T. (2017). Understanding Entrepreneurship: Facilitating Academic Research with a Shared Data Repository.
  3. DataONE. (n.d). Retrieved from
  4. Dataverese. (n.d). Retrieved from
  5. Fedora Commons. (n.d). Retrieved from
  6. Figshare. (n.d). Retrieved from
  7. Gibbons, S. (2009). Benefits of an institutional repository. Library Technology Reports, 40(4), 11-16.
  8. Harvey, M. J., McLean, A., & Rzepa, H. S. (2017). A metadata-driven approach to data repository design. Journal of Cheminformatics, 9(1), 4.
  9. iRODS. (n.d). Retrieved from
  10. Johnson, A. E., Stone, D. J., Celi, L. A., & Pollard, T. J. (2017). The MIMIC Code Repository: enabling reproducibility in critical care research. Journal of the American Medical Informatics Association, 25(1), 32-39.
  11. Keyes, D., Palanisamy, M., DiFranco, D. E., & Chin, D. M. (2017). U.S. Patent No. 9,779,261. Washington, DC: U.S. Patent and Trademark Office.
  12. OpenScienceDataCloud (OSDC). (n.d). Retrieved from
  13. RDA. (n.d). Retrieved from
  14. RDA: Functional Requirements for Research Data Repository Platforms. (n.d). Retrieved from
  15. Science. (n.d). Retrieved from
  16. Witt, M., Carlson, J., Brandt, D. S., & Cragin, M. H. (2009). Constructing data curation profiles. International Journal of Digital Curation, 4(3), 93-103.