| This is a list of well documented training data sets covering different data types and different aspects of research data management. |
What are training data sets: https://zenodo.org/records/13805722
Here, we are refering to data sets used in tutorials on research data management, as demo data set in tools or methods, or as examples for challenges in data handling. This definition does not cover data sets used to train AI applications.
Training data sets help to illustrate all stages of the data life cycle (DLC), e.g.
Figure 1: Data life cycle. Source: RDMkit: The ELIXIR Research Data Management toolkit for Life Sciences URL: https://rdmkit.elixir-europe.org
Stat2DataR package (https://cran.r-project.org/web/packages/Stat2Data/index.html)