Skip to content

sara-nl/dmp_feature_matrix

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

41 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Management platform features and evaluation criterion

Author: Sharif Islam
System Expert
Data Services Unit,
SURFsara (provides services in the field of supercomputers, networks, data storage and high-quality visualization)

*Work in Progress

The goal of this project is to evaluate various tools and software that can play a role in data management platform solution(s). The primary community we support is the Dutch research and educational institutes thus some of these ideas came are based on specific use cases we encountered. We hope to make this document suitable for a wider audience. Our goal is to think about both general and specific use cases and then highlight the components and strengths of each tool. Each use case and research data management need are unique and requires various customizations. Therefore, a comprehensive data management platform will include a variety of solutions and customizations.

Background and motivation

Data Management or Research Data Management (RDM) in particular is a broad term that encapsulates activities and tools that are responsible for planning and executing actions on digital data before, during, and after a research project (1). We refer to these different stages as research data life cycle and in each stage there are variety of different challenges for data management. There are also different terms and concepts employed often times for same sets of activities. Keeping these in mind we came up with a set of abstraction layers that support the various stages. Combining these layers and tools thus often times results in different data management solutions.

In the commercial/non-academic business world, "Data Warehouse" and "Data Lake" are two prominent terms. Some of the tools we evaluated came from that world. We are interested in the functionalities that fit our requirements and use cases regardless of the term.

Data Management Abstraction and Layers

  • Storage Abstration Layer
  • Data Virtualisation / Data Source Discovery Layer
  • Data Movement Layer
  • Fedreation Layer
  • Consuming/Publishing Layer
  • Policy
  • Metadata

Data Management Layers

This project contains:

  1. Feature list
  2. software/product list
  3. Detail test results/product review (if exists and possible)

Reference

  1. https://www.dtls.nl/fair-data/research-data-management/research-data-management/

Acknowledgement

  • Christine Staiger, SURFsara
  • Arthur Newton, SURFsara
  • Terrell Russell, iRODS Consortium / RENCI