DataPLANT NFDI – recommendation for funding

On 18th May 2020 we received the recommendation for funding of the DFGs NFDI panel of experts on the DataPLANT proposal. The BioDATEN Science Data Center is both involved in the role of fundamental plant researchers and in the role of service providers. In this decision, which has yet to be confirmed by the GWK (Gemeinsame Wissenschaftskonferenz) on 26th June 2020, DataPLANT was recommended for funding along with eight other consortia. For BioDATEN we are pleased that we have been able to successfully evolve our vision together with the other applicant institutions in the DataPLANT proposal. Further feedback on the proposal and the final decision is announced in August 2020 at the latest. As nine consortia are to be funded instead of the seven originally announced, it cannot yet be said whether this will have an impact on the planned budget. The planning will be accordingly cautious.

In order to advance the information of the community about the DataPLANT activities, there is now an initial website available. Further, the DataPLANT speakers have set up a Twitter account @nfdi4plants. At the same time, the DataPLANT speakers have joined cross-cutting activities and are participating, for example, in the exchange on metadata in the NFDI at the beginning of July 2020.
The advice of the NFDI expert panel was based on the results of the evaluation and the comments of the applicants on the evaluation results. In order to be able to adequately evaluate the applications reviewed in seven expert colloquia, the NFDI expert panel also paid special attention to the framework for setting up the NFDI in a scientifically guided procedure.

(DvS, JK, HG)

BioDATEN participating in the state wide RDM WG meeting (12.05.)

Representatives of the Science Data Center BioDATEN took part in the RDM Working Group (AK FDM) online meeting on 12th May 2020 and contributed to the discussion both on quality assurance and governance of research data.
The long-term character of research data and the various parties involved in the process create new challenges in research organisations with a high level of fluctuation in personnel. The researchers who created, collected and processed data may have left the research institution and projects were officially completed long ago. Many projects are also confronted with questions regarding copyright, handling of personal data and appropriate licenses for research data. Responsibility for the data must therefore be exercised in a regulated manner at any point in the data lifecycle by appropriate entities. After using service providers for storing and processing data during the project period, other entities can get involved after the project ends like subject-specific repositories or university publication services such as dark archives. The potentially perpetual storage of data after project completion is accompanied by responsibilities whose assignment should be clarified. Part of the handling can be derived from data management plans, other parts could be defined in research data policies of the scientific institution.

Research data management also affects aspects of good scientific practice. With the objective of achieving the greatest possible transparency and openness while at the same time considering the prudent handling of sensitive data, this requires clear regulations and agreements. For example, questions of copyright and the handling of personal data should already be clarified at the start of a research project or project in general. The same goes for the granting of suitable licenses for research data taken over by storage facilities, like repositories or dark archives, or data generated by researchers.
BioDATEN as a science data center should therefore help it's community by suggesting guidelines and promote their implementation. These include the question of data release and regulations comparable to publication contracts with the University Library. Increasingly, an (institutionalised) RDM is becoming a prerequisite for the approval of grants for new research projects. In order to ensure that this does not remain a one-sided process, both individual researchers and the research institutions and its entities, such as research groups, institutes or CRCs, must be included in the governance considerations for the RDM.

In the following, a document with guidelines for practical aspects of the RDM will be compiled for individual professorships and individual working groups as well as for larger collaborative projects. These should contain clear regulations on costs and responsibility including the assignment of rights to the storage facility in cases such as data migration and deletion. Early planning is facilitated by filling out Data Management Plans. This makes it possible to plan costs and expenses for the infrastructure providers involved.


BioDATEN joined three working groups for cross-cutting topics

The Science Data Center BioDATEN is actively involved in the SDC's cross-cutting activities in Baden-Württemberg coordinated by the bw2FDM project. On Wednesday, 2nd April three online meetings were held which were dedicated to the topics of legal support, business models, and meta data. BioDATEN was present both in the role as contributor and consumer through various participants from Tübingen and Freiburg. In the discussion of the group, possible objectives of the legal working group got outlined, ranging from the creation of a discussion platform and a network for recommendations of action to the preparation of workshops, trainings, FAQs, checklists, handouts, knowledge bases, and model contracts. Further on, the activities in other states with regard to legal issues in RDM will be monitored. Of course, general advice with regard to legal issues in specific individual cases is problematic. The individual research institutions are primarily responsible to provide this. The working group reiterated the importance of cooperation of the technical and the legal sides. The next meeting will primarily deal with the legal implications of data sharing and reusing.

Business and operation models are definitely a relevant cross-cutting topic as many state sponsored projects and state-wide services share the same set of challenges including sustainable financing and the assignment of required personnel on non-existing permanent positions.
The BioDATEN project is aware of this as in the precursor ViCE projects which have also addresses aspects on operation and business models.
Theses aspects have already been elaborated but not solved. A certain base might be provided by the already existing cooperation framework existing in today's regulations (check for suitable legal entities and cooperation frameworks). Another option is the tight cooperation and coordination with ongoing NFDI activities like the DataPLANT NFDI for fundamental plant research for parts of the BioDATEN communities. For sustainable operation qualified staff plays a key role as well as the provisioning of storage systems for the data.

The third online meeting addressed the cross-cutting topic of metadata. Metadata are very important throughout the complete lifecycle of research data. While there are basic, descriptive metadata that are very likely to be shared across communities like author, DOI, affiliation etc. there are also very domain and research specific metadata like species, treatment, experimental design etc. Descriptive metadata are best handled by using established standards like DataCite. Research specific metadata are probably best coped with by using existing ontologies. Researches can than pick the label that describes their data best. Additionally, there are technical and process related metadata. To a large extend, both can be collected automatically during the data processing on HPC resources or by tools like FITS. The well established PREMIS framework is a good solution to capture technical metadata.
These four kinds of metadata must be containerised into one file. The METS framework offers such a container and is also well established.
A special role plays the usage of ORCID as a method to reference authors, contributors etc. Unlike institution-based identity management, ORCID is person-based and every researcher can update their affiliation and contact information. This comes in handy as soon as researchers leave their home institution and are no longer reachable under the institution’s email address.


Sponsored by: