GFBio aims to support individual researchers, research groups, and large projects in developing and implementing effective data management strategies for FAIR (Findable, Accessible, Interoperable, and Reusable) data, as part of good scientific practice (learn more about the FAIR principles and applicability here).
A well-structured Data Management Plan (DMP) clarifies how and what data will be created, processed, and documented. Its name means data archiving and publication regarding costs as well as access conditions for the scientific community and the public. In short, a DMP helps you to ask all the right questions concerning data management.
DMPs are increasingly required as a mandatory proposal part by research institutions and funding agencies. By writing a DMP, you save a lot of time in the long run, you are aware of potential data management challenges and obstacles, and you increase the transparency and the integrity of your work.
In this How-to, you learn about preparing a DMP with the GFBio Data Management Plan Tool (DMPT) and about the services GFBio offers to support you.
You have just heard about DMPs and now want to write your own? If you have never considered data management planning before, it might be good to get some general information.
Data management planning is the starting point in the data life cycle (DLC). Data management plans (DMPs) are ideally established at this point. But a DMP is not only created at the beginning of a project, but it is also a living document, and you should return to it at each step of the data life cycle. Ideally, the DMP should be revisited often throughout the lifetime of a project to ensure proper data documentation and management. You can think of a DMP as an addition to the data policy of a larger umbrella project.
Have a look at our training materials, in which you can find fact sheets and education modules: GFBio_Emodule_Propose_151123 (as .ppt file) on data management planning and other aspects of the data life cycle. Also, have a look at the best practices linked in our training materials.
In the following steps, you will learn about the most important aspects of data management planning and get in touch with the GFBio Data Management Plan Tool (DMPT). You can continue reading the How-To with or without using the DMPT in parallel.
The GFBio DMPT contains the most important questions that you should ask when starting with data management planning and is based on the DFG Guidelines on the Handling of Research Data in Biodiversity Research. It does not provide a ready-to-use DMP that you can just attach to your project proposal. Please use our personal DMP support (you will find a button at the end of the tool) to contact our experts.
It is all about the planning!
Collect general information about your project right at the beginning. Besides the project name and the people involved, you already give a first impression of the research data you will produce.
What is your main subject? - In the GFBio DMPT, you can choose between several categories from Algae & Protists to Zoology.
What type of project is it? - Perhaps you are doing Laboratory and/or Field Work.
The funding you are applying for may also influence the content or structure of your DMP. Some funders have special requirements for a DMP as a mandatory part of the proposal (e.g., Horizon 2020; more about Horizon here).
Funders, as well as institutions or publishers, may also have their own policies and guidelines for data management. You do not need to know all the details of these at the beginning of your data management planning, but it is important to know the requirements and to document them. If you are unsure which policies or guidelines you have to stick to, just let us know, and we can support you in identifying them.

When it is either text files, code, GIS, numeric and molecular data, or multimedia files – depending on your research questions and methods, you create different types of data. In environmental and ecological research, it is common to have more than one data type. Studies and projects are a complex composition of various methods. If you are an experienced researcher in your field, you might already have an idea of the data formats (e.g., .xlsx, .csv, .docx, .txt, .shp) you will create. In this case, you can specify your data types, even to the level of data formats. Please document data types and formats as detailed as possible when sending us your DMP support request. The more information we get, the better we can support you in your data management planning.
Data volume and the number of files you will create are important pieces of information regarding data management. They affect questions concerning backup strategies and techniques, as well as the publication of your research data. Standards, methodologies, and tools for data collection, data assurance, and data management may also provide important information for planning your data management. Are you part of a larger umbrella project and have to stick to conventions on standards, methodologies, and software tools? That’s fine, this information is very important and should be documented in your DMP.
Working in environmental sciences often implies the collection of physical objects, which in some cases have to be archived as well. With GFBio, you can submit and archive several kinds of objects related to your research data. By means of some basic questions on them (integrated in the DMPT), we can estimate if your physical objects can be archived in GFBio if you would like it so in the future. Your DMP should also contain a notice if you are working with sequence data, as it requires special expertise and has to follow certain standards.

Have you already thought about metadata?
Metadata is the description of your research data. They contain important information on the data content, collection, and processing (who, when, what, where, how, how often, etc.). Only by means of metadata, data portals like GFBio can browse, filter, and navigate through huge research data collections. They also make your data findable. There are certain metadata standards and defined terminologies that enable the integration of research data from different partners. You can, for example, use our VAT tool and intersect your own data sets with layers from GFBio data archives.
Are you already familiar with metadata standards and know how to handle them?
That's great, list all the standards you will comply with in your DMP.
You have no idea of what we are talking about, or you don't know yet what your metadata will look like?
Don't worry, just let us know, and we will find out together with you.
Also, have a look at our data life cycle fact sheet Describe.

In some cases, data cannot or should not be published immediately when submitting to an archive. One possible reason might be that you collect research data within the scope of your PhD thesis, and you have to publish your studies before making data available for others. In this case, you can define access restrictions for your research data. This means you can already submit your data to a public archive and get an identifier for them (e.g., DOI) – which makes them citable and reusable - even though you don't want your data to be published immediately.
Unlike your data, your metadata does not have any access restrictions. Thus, people will find the information about your research data in a search engine, but they cannot access the data themselves. Since metadata should always contain information about contact persons – which will be YOU in the given case – people will be able to contact you if they are interested in your research. When the access restriction expires, the data will also become available.
You might also collect sensitive data, which is subject to ethical issues or legal restrictions and requirements. Personally identifiable information, Red List Species, and genetic resources have to be handled according to specific legal requirements and may not easily be published. If you feel uncertain about legal requirements, we support you in identifying them.
Conscientious data management implies licensing your data. GFBio supports the idea of open access to research data. But open access does not mean everyone can use your data at will. Data can (or must) be cited in the same manner as publications. Licenses, such as the Creative Commons licenses, define citation demands as well as further terms of use, for example, whether the data may be transformed.

Design a plan for data backup. Which technologies in which locations will be used to store your data? Are there regular backup routines, or will you back up manually? Who is responsible for data backup, and which service providers (e.g., local IT support) are involved?
Find a suitable long-term archive for your data. An advantage of submitting your research data to GFBio is that we can spread your data over our data centers. Each of our data centers has certain fields of specialization on which you can build.
Think of a submission plan. When will your data be submitted to its long-term archive(s)? Remember, at this point, you already have the final and quality-assured version of your data sets. You can submit data linked to an article at the same time as the article’s publication. Alternatively, you can continuously submit data during a project runtime (e.g., in defined time intervals) or submit all datasets at the end of the project. And you can also define any other reasonable submission plan for your project data." There is no one-size-fits-all plan for data submission; it always depends on your research conditions.
Find more information on data preservation in our data life cycle fact sheets Preserve and Submit.

You can contact us anytime and ask your questions. In case of DMP support requests, we recommend filling in the DMPT as far as possible and then sending a support request (you will find a button at the end of the tool). We will then contact you to talk about open questions. In your support request, you can also inform us of any questions you have or the specific aspect of the DMP for which you require further information and support.

Verwandte Artikel erscheinen hier basierend auf den Stichwörtern, die Sie auswählen. Klicken Sie, um das Makro zu bearbeiten und Stichwörter hinzuzufügen oder zu ändern.
|