|
NFDI4Biodiversity already provides a wide variety of useful tools for early project data mobilization, many of which go beyond the initial stages of the data life cycle. Data provision can be handled by BEXIS2 and RightField opens the door for researchers, who have not yet dismissed the spreadsheet but want to implement the FAIR principles into their work. The Biodiversity Workbench and BioMe on the other hand provide powerful frameworks to manage data, from their collection until the end of the project, providing necessary functionality at the early stages that will be relevant when data will be published or shared eventually. However, these tools are not yet incorporated into the aforementioned RDC NFDI4Biodiversity aims to establish.
| Service | Short Description | Personas/Target Groups | Requirements | Kommunikationskanäle | Lizens | Link |
|---|---|---|---|---|---|---|
| ISA Wizard | The ISA Wizard is a tool developed within use cases of NFDI4Biodiversity and FAIRagro, and partly in collaboration with DataPlant. Its primary purpose is to facilitate the creation of datasets structured according to the ISA (Investigation, Study, Assay) data model, which is widely recognized for organizing life science experiments. The ISA Wizard achieves this through an intuitive questionnaire interface, complemented by file upload functionality, allowing users to systematically collect and annotate metadata. One of the key strengths of the ISA Wizard is its configurable questionnaire system, which ensures that the tool remains domain agnostic. This flexibility allows it to be used across a variety of scientific disciplines without modification to the core software. Domain-specific requirements, such as the Minimum Information About a Plant Phenotyping Experiment (MIAPPE) standard for plant phenotyping, can be seamlessly integrated by adjusting the questionnaire configuration. This approach enables tailored data mobilization workflows while maintaining interoperability and standardization. Upon completion, users can export their curated datasets either in the ARC format, also including direct integration into the PLANTdataHUB platform, or as ISA JSON, supporting further data sharing and reuse. | Post-doc Paul | internet access | GitHub | MIT License | Repository: https://github.com/IPK-BIT/isa-wizard Life Deployment: https://ipk-bit.github.io/isa-wizard/ |
| PLANTDataHub and ARCitect | The NFDI consortium DataPlant focuses on establishing a sustainable research data management for the German plant research community by providing digital and in-person services. Based on the Annotated Research Context (ARC) framework, DataPlant provides a central reference implementation called PLANTDataHub. PLANTDataHub can be used collaboratively for data storage, management and sharing. It can handle large datasets and is developed with research data management in mind, i.e. increasing FAIRness of the data by providing structured and transparent data and metadata storage, incorporating data analysis within the platform and conforming to international Research Data Management (RDM) standards concerning the digital objects (e.g. RO-Crate FAIR digital objects). Additionally ARC is an open-source and community-driven initiative keeping the framework adaptable and relevant within the community. In combination with the ARCitect to create and manage ARCs and then synchronize them with the database, DataPlant provides a powerful tool for early data mobilization and beyond. PLANTDataHub can be used as a data provider and as FAIR data storage. | Post-doc Paul | https://nfdi4plants.github.io/nfdi4plants.knowledgebase/arcitect/installation/ | GitHub | https://www.nfdi4plants.org/arc-data-hub/ | |
| NOMAD Lab | The NFDI consortium FAIRmat specializes in RDM for material sciences. One of their services is the NOMAD Lab which combines the functionalities of a data repository and archive, an encyclopedia and an analytics toolkit divided within their service subcategories NOMAD Oasis and the public NOMAD services. As a tool for early data mobilization the repository can be used as a data provider, the encyclopedia as a tool for data availability and exploration and the analytics toolkit, which has evolved into a artificial intelligence toolkit, can be useful in accessing the available data in NOMAD. | Post-doc Paul | internet access | Discord | Not applicable | https://nomad-lab.eu/nomad-lab/ |
| ClOWM | The cloud based workflow manager is a tool hosted by NFDI4Microbiota to integrate scientific workflows (curated, written in Nextflow DSL) with data storage, hpc components and a user friendly interface as an open-source software. The provision of standardized workflows is beneficial along the whole data life cycle and can be helpful with early data mobilization, by creating awareness for the available tools and standard operating procedures. | Post-doc Paul | internet access | support@clowm.bi.denbi.de | Not applicable | https://clowm.bi.denbi.de/ |
| OMERO | OMERO is a cross-platform client-server software platform for visualizin, managing and annotating scientific image data. This includes the archival of images and the export to a number of formats. OMERO provides rights and role management. As a tool for early data mobilization OMERO takes on the role of an ELN specificially for image data and accompanies the scientists from the data collection to the data publication, enhancing FAIR data requirements and reproducibility. | Post-doc Paul | https://omero.readthedocs.io/en/stable/sysadmins/index.html | GitHub | GNU GPL | https://www.openmicroscopy.org/omero/scientists/ |
The overall goal of the NFDI is to increase awareness for the importance of research data and research data management and to establish infrastructure, services and tools to ultimately create workflows and automated processes for researchers within and beyond different scientific disciplines. Nomad and the PLANTDataHub work adjacent to the idea of NFDI4Biodiversity’s Research Data Commons, by providing infrastructure that combines several aspects from different phases of the data life cycle. NOMAD and the PLANTDataHub are already usable for scientists from the respective disciplines. Experiences from the development, initialisation and subsequent community acceptance of these services can be useful in establishing and further developing the Research Data Commons platform within the discipline of biodiversity science. ELNs are a tool to overhaul the way data is traditionally collected and processed. This entails the development of suitable software, the allocation of storage space on (locally) hosted servers for the collected data that are available long term, secure and maintainable, but also the availability of input devices which, especially in biodiversity sciences, need to be able to withstand fieldwork and laboratory conditions. Currency ELNs remain individual solutions for local institutions. These are just a few highlights of services other consortia provide, mainly for their own community. Not mentioned here are knowledge bases many consortia provide to explain and guide users regarding research data management. The project base4NFDI tries to consolidate a lot of services from the different consortia into single access points but does not yet focus on tools concerning early data mobilization.
Of the 16 states of Germany, only two do not have a dedicated state initiative working on research data management, support and infrastructure. Only 6 of those offer an ELN, often in direct cooperation with a local university where the instance is hosted. Additionally a variety of services, ranging from certification courses, Research Data Management (RDM) consultations, events, Repositories and many more are provided. The visibility of these state initiatives also varies.
forschungsdaten.info is a portal for RDM related knowledge. It bundles information about the state initiatives, the NFDI consortia, international infrastructures, their tools and services, as well as some basic introductory information regarding RDM. The website is maintained by a national team of RDM specialists.
The UAG Schulung und Fortbildung of the DINI/nestor AG Forschungsdaten provides a comprehensive workshop concept (meant to teach RDM trainers), which is modular, expandable and can easily be adapted to teach researchers, whose main focus will not be on teaching RDM, but on using RDM within their research.
In support of students and early-career scientists who want to learn the basics of RDM we provide an open educational resource on the topic research data management (Selbstlerneinheit), containing valuable information on all stages of the data life cycle. This is the easiest entry point into RDM, as it will provide a basic understanding and be a guide for further reading. Additionally, NFDI4Biodiversity supports teachers and students of RDM with video series published on Youtube, with topics ranging from Basics of RDM, services provided and tutorials on how to use them, as well as the handling of data in general. The consortium provides GitHub Repositories usable as working environments for Jupyter, R and data validation in PANGAEA, and a Zenodo-Community where slides and other material is published for reuse.
NFDI4Biodiversity provides custom trainings for biodiversity related institutions, working groups and projects on the following topics:
We also provide yearly Seasonal Schools for PhD students, researchers, data collectors or data center staff members, offering a broad range of basic and advanced knowledge in the management of biodiversity, ecology and environmental data. The intensive courses include input presentations from experts and hands-on exercises and are tailored to the attendees prior expertise.
The seasonal schools are designed to create an open, collaborative environment that invites networking and knowledge sharing among participants and can therefore facilitate early data mobilization by furthering the participants knowledge on available tools and services, workflows and best practices.
All of these teaching and training materials and the information about the events can be found on the NFDI4Biodiversity website and its Knowledgebase.
| Service | Description | Persona | Link |
|---|---|---|---|
| NFDI4Microbiota Website | Website | Post-doc Paul | https://nfdi4microbiota.de/ |
| NFDI4Bioimage Website | Website | Post-doc Paul | https://nfdi4bioimage.de/home/ |
| NFDI4Earth OneStop4All | central web-based access point to all NFDI4Earth resources and services | Post-doc Paul | https://www.nfdi4earth.de/2facilitate/onestop4all |
| Materialsammlung | Database providing teaching material for RDM | Data Manager Doro | https://rs.cms.hu-berlin.de/uag_fdm/pages/home.php?login=true |
DALIA | Database providing links to teaching material for RDM | Data Manager Doro | https://dalia.education/en |
FAIRagro training content | Publication of teaching materials for agrosystems research | Data Manager Doro | https://zenodo.org/records/11148701 |
GFBio DMP Service | Online Tool to guide through the process of creating a Data Management Plan | Post-doc Paul Data Manager Doro | https://dmp.gfbio.org/ |
|