Helmholtz Metadata Collaboration | Conference 2024

Name: Helmholtz Metadata Collaboration | Conference 2024
Start: 2024-11-04T09:00:00+01:00
End: 2024-11-06T17:30:00+01:00
Location: virtual event

4–6 Nov 2024

virtual event

Europe/Berlin timezone

Contact

event@helmholtz-metadaten.de

Contribution List

128. Welcome

Sören Lorenz

04/11/2024, 09:00

129. Meta data - An industry point of view

Tobias Hof (Airbus)

04/11/2024, 09:15

KEYNOTE

Keynote

The needs from the industry and how to address them in a collaborative data ecosystem

Over the last years, the industry's needs have gone from simple delivery of materials and parts to a growing need for reliable and easily accessible meta-data. On the basis of concrete examples out of the Aerospace supply chain, the presentation will show some examples and will underline the importance of...

91. Towards Standardizing Catalysis and Beamline experiments at HZB

Ana Velazquez Sanchez (HZB), Sonal Ramesh Patel (HZB)

04/11/2024, 10:30

4. Metadata annotation and management

TALK

Session A1

Advanced catalysts are key to sustainable energy, reducing emissions, and improving resource efficiency. However, the synthesis of novel catalysts usually involves a unique blend of scientific methods, precise catalyst formulations, and the empirical knowledge of scientists. Additionally, the wide variety of techniques performed at different beamlines in synchrotron radiation facilities, along...

98. Using SHACL Shapes to create semantic (meta)data

Dr Marc Fuhrmans (TU Darmstadt), Stephan Tittel (TU Darmstadt)

04/11/2024, 10:30

5. Technical solutions for findable and machine-readable metadata

TALK

Session A2

Using RDF is a natural choice for modelling semantically linked metadata for FAIR research data. However, the learning curve for RDF is steep, and even for data stewards, becoming familiar with all the relevant technicalities can be a major barrier. Therefore, ULB Darmstadt is heavily involved in developing and providing services that facilitate the creation and use of semantic metadata,...

71. FAIRlead - A Domain Independent, Model Driven Approach to follow the FAIR Data Avenue

Andreas Schmidt (IT4EDM/IAI)

04/11/2024, 10:50

6. Interoperable semantics at domain and application level

TALK

Session A2

Enriching data with describing metadata is the key-enabler for the reusability and interoperability of experimental results and thus to further research in a scientific domain. However, in order to be able to use data of former scientific work (both initial data and result data from experiments), a common understanding of the semantics of this data is essential. This understanding is typically...

89. NetCDF Metadata Guidelines for the Helmholtz Earth and Environment Community

Romy Fösig (KIT)

04/11/2024, 10:50

4. Metadata annotation and management

TALK

Session A1

In the pursuit of making data FAIR (Findable, Accessible, Interoperable, Reusable) (Wilkinson et al., 2016: https://doi.org/10.1038/sdata.2016.18), the need for well and comprehensively described datasets is decisive. In order to facilitate interoperability and reusability, it is essential to have self-describing data, which can only be achieved by enriching data with metadata. Within the...

103. Metadata for Ionospheric and Space Weather Observations (MISO)

Angelica Castillo (GeoForschungs Zentrum, Potsdam (GFZ)), Dr Asim Khawaja (GeoForschungs Zentrum, Potsdam (GFZ))

04/11/2024, 11:10

4. Metadata annotation and management

TALK

Session A1

The focus of this project is the development of a standardized metadata vocabulary, essential for creating interoperable and easily discoverable data products across various research groups. By examining space weather-specific data products and formats, the project addresses the need for consistent metadata standards that will enhance collaboration and data sharing on an international scale....

115. The EM Glossary: a community effort towards a harmonised terminology in electron microscopy

Oonagh Mannix (HMC matter/HZB)

04/11/2024, 11:10

6. Interoperable semantics at domain and application level

TALK

Session A2

For data to be fully exploitable and re-usable in different contexts it needs to be annotated with rich metadata that uses commonly understood vocabularies and semantics [1]. Using terminology that is standardized and agreed upon within a community ensures unambiguous understanding of metadata.
In the field of EM, a number of application-level initiatives independently started developing...

86. Progress and lessons learned from Data and Metadata management in laser-particle acceleration at HZDR

Hans-Peter Schlenvoigt (HZDR)

04/11/2024, 11:45

6. Interoperable semantics at domain and application level

TALK

Session B1

Metadata is a key element in data management when taking account of the F.A.I.R.(findable, accessible, interoperable and reusable) principles, answering the need for better data integration and enrichment. In the field of high-intensity laser-plasma physics, numerical simulations and experiments go hand in hand, complementing each other. While simulation codes are well documented and output...

75. Tracking the cell – metadata for single-cell genomics in biomedicine [CellTrack]

Florian Heyl (The German Cancer Research Center (DKFZ))

04/11/2024, 11:45

4. Metadata annotation and management

TALK

Session B2

As the volume of omics single-cell data continues to grow, so too must our data management and processing capabilities to ensure its effective secondary use, particularly in research and diagnostics. While single-cell data holds immense potential for AI applications, current documentation standards fall short of being AI-ready. To address these challenges, we organized a Writathon, resulting...

109. Harmonizing Marine Data for Real-Time Ingestion into Digital Twins of the Ocean (DTOs): An Open Data Infrastructure Approach

Dr Enoc Martínez (SARTI-UPC)

04/11/2024, 12:05

6. Interoperable semantics at domain and application level

TALK

Session B1

The study of climate change and its impact on marine environments requires large-scale, multidisciplinary data that are often collected by various national and marine institutes, fishery associations, as well as by research groups. With the proliferation of underwater observatories, profilers, and autonomous underwater vehicles (AUVs), significant progress has been made in collecting...

69. Introducing the Open Data Format: A New FAIR-Compliant Data Format using DDI Codebook

Tom Hartl (DIW Berlin), Xiaoyao Han (DIW Berlin)

04/11/2024, 12:05

4. Metadata annotation and management

TALK

Session B2

Currently, social scientists use different and sometimes proprietary software to analyse data, which processes metadata in diverse ways. Data formats of statistical software packages are only partially compatible and pose an obstacle to replication studies. Proprietary data formats jeopardise the requirement for interoperability enshrined in the FAIR principles. As part of KonsortSWD, we...

113. M3eta: An extensible metadata scheme for advanced momentum microscopy in the age of big data

Christian Tusche (Forschungszentrum Jülich)

04/11/2024, 12:25

4. Metadata annotation and management

TALK

Session B2

The electronic structure determines many of the macroscopic physical properties of a material. Photoelectron momentum microscopy (MM) has matured into a powerful tool for the detailed characterization of the exciting electronic properties of novel quantum materials. By applying the principles of high-resolution imaging modern instruments simultaneously capture hundreds of tomographic slices of...

94. Streamlined Submission of Human Omics Data via the GHGA Metadata Model

Dr Anandhi Iyappan (EMBL Heidelberg), Ms Karoline Mauer (Systems Medicine, German Center for Neurodegenerative Diseases (DZNE) e.V, German Center for Neurodegenerative Diseases (DZNE), PRECISE Platform for Genomics and Epigenomics at DZNE, and University of Bonn, Bonn, Germany)

04/11/2024, 12:25

6. Interoperable semantics at domain and application level

TALK

Session B1

The German Human Genome Phenome Archive (GHGA) is a national infrastructure that promotes the secure storage, exchange, and management of access-controlled human omics data. To facilitate user-friendly and comprehensive data submissions, we developed the GHGA metadata model. The standardized model aims at maximizing the amount of collected metadata on the submitter side, enabling reusable...

130. Introduction Poster sessions

04/11/2024, 13:45

87. Digital sample management and documentation of analytical methods – Development of an electronic lab notebook at the Helmholtz-Institute Freiberg

Theresa Schaller (HMC/ HZDR)

04/11/2024, 14:00

7. Infrastructure and common practices for consolidation of (meta)data

POSTER&PITCH

Poster Session A

At the Helmholtz-Institute Freiberg for Resource Technology (HIF), researchers develop new technologies to improve circular economy. In this context, different types of samples (e.g. rock samples, recycling material) play an important role. The sample passes through different states and labs – starting at the sample preparation, through the analysis of the particular sample to the final...

84. HARMONise - Towards the sustainable management of metadata associated with marine molecular sequencing data

Christina Bienhold (AWI Helmholtz Centre for Polar and Marine Research)

04/11/2024, 14:00

4. Metadata annotation and management

POSTER&PITCH

Poster Session A

Biomolecules, such as DNA and RNA, provide a wealth of information about the distribution and function of marine organisms, and molecular sequencing data from the marine realm is generated across several Helmholtz Centers. Biomolecular (meta)data, i.e. DNA and RNA sequences and all steps involved in their creation, exhibit great internal diversity and complexity. However, high-quality...

77. Nuclear, Astro, and Particle Metadata Integration for eXperiments (NAPMIX)

Dr Andrew. K. Mistry (GSI Helmholtzzentrum für Schwerionenforschung GmbH(GSI))

04/11/2024, 14:00

6. Interoperable semantics at domain and application level

POSTER&PITCH

Poster Session A

The Nuclear, Astro, and Particle Metadata Integration for eXperiments (NAPMIX) project was recently awarded funding within the scope of the OSCARS call on open science and will start in December 2024. The project aims to facilitate data management and data publication under the FAIR principles on the European level by developing a cross-domain metadata schema and...

72. Presenting a prototype platform for mapping epidemiological cohort metadata with environmental and earth observation metadata: the MetaMap³ project

Kathrin Wolf (Helmholtz Zentrum München GmbH - German Research Center for Environmental Health, Institute of Epidemiology, Neuherberg, Germany)

04/11/2024, 14:00

5. Technical solutions for findable and machine-readable metadata

POSTER&PITCH

Poster Session A

Introduction: The environment plays an important role for human health and efficient linkage of epidemiological cohorts with environmental data is crucial to quantify human exposures. However, there are no harmonized standards for automatic mapping of metadata of our three domains Health (HMGU), Earth & Environment (UFZ), and Aeronautics, Space & Transport (DLR).

Objective: We aimed to...

79. Results and Insights into the HMC Data Professionals Survey 2024

Lucas Kulla (DKFZ)

04/11/2024, 14:00

1. Human actors in the FAIR data landscape

POSTER&PITCH

Poster Session A

The objective of the HMC Data Professionals Survey 2024 is to gain insights into the perspectives, workflows and needs of research data professionals in the Helmholtz Association. The survey is part of HMC's mission to enhance the sustainable management of research data and to more closely align its services with the needs of data professionals at various Helmholtz centers. A comprehensive and...

126. SmartER: Synergizing Metadata from Scholarly Repositories to Support Research Data Management

Dr Muhammad Asif Suryani (GESIS - Leibniz-Institut für Sozialwissenschaften in Köln)

04/11/2024, 14:00

4. Metadata annotation and management

POSTER&PITCH

Poster Session A

There has been substantial increase in number of scientific publications across diverse disciplines. These publications often generate metadata, scholarly content and scientific models/source code etc. Though such information is made available to research communities under open science initiative, numerous scholarly repositories have emerged over the years to harvest metadata in various...

105. The Helmholtz Digitization Ontology: Harmonized semantics for the Helmholtz digital ecosystem

Said Fathalla

04/11/2024, 14:00

6. Interoperable semantics at domain and application level

POSTER&PITCH

Poster Session A

Abstract
Research in the Helmholtz Association undergoes continuous digitization. The heterogeneity of scientific contexts within Helmholtz leads to ambiguity and conflicts regarding metadata semantics. To ensure semantic interoperability of this decentralized data ecosystem, metadata should be aligned and harmonized with European and global initiatives to ensure an open and interoperable...

116. The HMC FAIR Data Dashboard: A Data-Driven Approach to Monitor and Improve FAIR Data in the Helmholtz Association

Mojeeb Rahman Sedeqi (Helmholtz-Zentrum Berlin für Materialien und Energie GmbH (HZB), Helmholtz Metadata Collaboration (HMC))

04/11/2024, 14:00

2. Metrics, tools and workflows for metadata assessment

POSTER&PITCH

Poster Session A

Here we present the latest updates of our data-driven approach to monitoring and assessing the state of open and FAIR data in the Helmholtz Association. The approach consists of two parts: a modular data harvesting-, validation- and assessment pipeline, and a dashboard with interactive statistics about the Helmholtz-data publications identified. The dashboard provides insight into which data...

122. Connecting information across repositories – a keyword-based approach

Emanuel Söding (GEOMAR), Stanislav Malinovschii (HMC)

04/11/2024, 15:00

5. Technical solutions for findable and machine-readable metadata

POSTER&PITCH

Poster Session B

Knowledge Graphs help to connect and organize information from different sources and entities. They can be used to apply advanced search and filtering techniques on very large datasets and reveal connections and dependencies across the data. To be useful, however, they require highly uniform and harmonized data sets. So far, most knowledge graphs on scientific data have used bibliographic data...

76. Datathons: fostering equitability in data reuse in ecology

Stephanie Jurburg (UFZ)

04/11/2024, 15:00

10. Enabling and incentivising community-driven initiatives

POSTER&PITCH

Poster Session B

Approaches to rapidly collecting global biodiversity data are increasingly important, but biodiversity blind spots persist. We organized a three-day Datathon event to improve the openness of local biodiversity sequence data and facilitate data reuse by local researchers. The first Datathon, organized among microbial ecologists in Uruguay and Argentina assembled the largest microbiome dataset...

95. How to make Biomedical Imaging Datasets AI-ready?

Stefan Dvoretskii (HMC Hub Health, DKFZ)

04/11/2024, 15:00

4. Metadata annotation and management

POSTER&PITCH

Poster Session B

The vast amount of observations needed to train new generation AI models (Foundation Models) necessitates a strategy of combining data from multiple repositories in a semi-automatic way to minimize human involvement. However, many public data sources present challenges such as inhomogeneity, lack of machine-actionable data, and manual access barriers. These issues can be mitigated through the...

82. Improving interdisciplinary research with cross-domain metadata for qualitative data objects

Noemi Betancort (SuUB Bremen, Research Data Center Qualiservice)

04/11/2024, 15:00

6. Interoperable semantics at domain and application level

POSTER&PITCH

Poster Session B

The aim of a cooperation between the DDI Alliance and [QualidataNet][1] - a network for qualitative data that is being created as part of the NFDI - is to describe qualitative data in a standardized way so that researchers can find it and use it for their own research, regardless of discipline and thematic location.

Since last year, QualidataNet has been involved in the metadata...

121. Interactively exploring metadata with Beaverdam

Heather More (Institute for Advanced Simulation (IAS-6 and IAS-9), Forschungszentrum Jülich)

04/11/2024, 15:00

7. Infrastructure and common practices for consolidation of (meta)data

POSTER&PITCH

Poster Session B

Scientists frequently need to get an overview of their experiments by summarizing information spread over multiple files and storage locations. This metadata may include items such as experimental conditions, subject details, and characteristics of the experimental data. It is common for researchers to spend time developing their own solutions tailored to their specific use case. However,...

119. Sharing the load - defining responsibilities for common data elements to the appropriate stakeholders in data management

Emanuel Söding (GEOMAR)

04/11/2024, 15:00

1. Human actors in the FAIR data landscape

POSTER&PITCH

Poster Session B

At the Helmholtz Association, we strive to establish a well-formed harmonized data space, connecting information across distributed data infrastructures. This requires standardizing the description of data sets with suitable metadata to achieve interoperability and machine actionability.
One way to make connections between datasets and to avoid redundancy in metadata is the consistent use of...

108. Supporting Polymer Membrane Research: Enabling Semantics with PolyMat Ontology

Dr Marta Dembska

04/11/2024, 15:00

6. Interoperable semantics at domain and application level

POSTER&PITCH

Poster Session B

The laboratory of the future necessitates innovative solutions for efficient digital (meta)data capture. Electronic laboratory notebooks (ELNs) are progressively replacing traditional documentation methods, significantly improving research data management and laboratory processes. However, free-text data entry presents challenges for automation and data quality. Ontologies address these issues...

68. The CREATIVE project - customising a generic repository with domain-specific metadata

Sibylle Hassler (Institute of Meteorology and Climate Research – Atmospheric Trace Gases and Remote Sensing, Karlsruhe Institute of Technology, Karlsruhe, Germany)

04/11/2024, 15:00

4. Metadata annotation and management

POSTER&PITCH

Poster Session B

The CREATIVE project aims to make the generic repository RADAR4KIT easily accessible and attractive for the domain-specific communities organized in the Climate and Environment Centre (CEC) at the Karlsruhe Institute of Technology (KIT). This aim will be achieved with the help of customized templates and input masks for subject-specific metadata, which enhance the RADAR4KIT usability for the...

99. Development of recommendations for the implementation of semantic artifacts in HMC Earth and Environment

Dorothee Kottmeier (HMC E&E @PANGAEA/AWI)

04/11/2024, 16:00

6. Interoperable semantics at domain and application level

POSTER&PITCH

Poster Session C

Embedding semantics within research metadata serves to standardize, refine and contextualize it, thereby improving interoperability between data sources and promoting the FAIR principles. Within the Helmholtz Association, we are committed to evaluating existing semantic resources and established practices and to developing guidelines for their handling and use in the field of earth and...

111. Empowering community-driven change and developments towards a FAIR data future in agrosystem science - First Evidence from the NFDI initiative FAIRagro

Anne Sennhenn (Leibniz Institute for Agricultural Engineering and Bioeconomy (ATB))

04/11/2024, 16:00

10. Enabling and incentivising community-driven initiatives

POSTER&PITCH

Poster Session C

In agrosystem science, the transition to a FAIR (Findable, Accessible, Interoperable, Reusable) data future is essential for fostering innovation and collaboration. While technical developments provide the necessary infrastructure, the true challenge lies in changing ingrained habits and cultural practices. To address this, the FAIRagro initiative has developed a participation concept aimed at...

125. Enhancing Metadata Handling in Research Software

Mustafa Soylu (Forschungszentrum Jülich)

04/11/2024, 16:00

4. Metadata annotation and management

POSTER&PITCH

Poster Session C

The rapid evolution of research software necessitates efficient and accurate metadata management to ensure software discoverability, reproducibility, and overall project quality. However, manually curating metadata can be time-consuming and prone to errors. This poster presents two innovative tools designed to streamline and improve metadata management: fair-python-cookiecutter and...

80. From Schema to Questionnaire: Humanizing data description

Mr Mohamed Anis Koubaa (Institute for Automation and applied Informatics), Philipp Schmurr (KIT IAI)

04/11/2024, 16:00

4. Metadata annotation and management

POSTER&PITCH

Poster Session C

Enriching data with metadata is a key concept for the data output of scientific
research to be FAIR. Data processing software and custom code often do not
support the annotation with metadata out-of-the-box or the usage process does
not mandate it. This confronts data creators and maintainers with challenges
to annotate their data. From a Human Machine Interface (HMI)...

118. Graduate - An intuitive user interface for modeling semantic graph data

Leon Steinmeier (Helmholtz Institute Freiberg)

04/11/2024, 16:00

4. Metadata annotation and management

POSTER&PITCH

Poster Session C

Research data management (RDM) is an important aspect of modern scientific research, which is heavily relying on interconnected data sets and corresponding metadata. For modeling and integrating these interconnections and metadata, the Resource Description Framework (RDF) has often been proposed as a standard, since it has been in use by search engines and knowledge management systems for...

117. Harmonizing the Implementations of PIDs across Repositories

Andrea Pörsch (HMC Hub EE at GFZ)

04/11/2024, 16:00

6. Interoperable semantics at domain and application level

POSTER&PITCH

Poster Session C

In our increasingly digital and interconnected world, the integration of Persistent Identifiers (PIDs) in metadata are essential for machine-readable and -understandable metadata as also described in the FAIR Guiding Principles for research data management. PIDs provide unique, permanent and machine-readable references to various types of digital objects, including publications, datasets,...

104. Software Curation and Reporting Dashboard

Michael Meinel (Deutsches Zentrum für Luft- und Raumfahrt e.V.)

04/11/2024, 16:00

2. Metrics, tools and workflows for metadata assessment

POSTER&PITCH

Poster Session C

Software is important research output. Therefore, funding agencies are interested in the value that a software contributes to the overall results of a funded project. The Helmholtz Association is working towards a system to evaluate data and software publications. The "Task Group Helmholtz Quality Indicators for Data and Software Publications" has already published a vision paper about how...

123. Standardised Metadata Provision in the Communication Protocol SECoP - SECoP@HMC

Klaus Kiefer (Helmholtz-Zentrum Berlin)

04/11/2024, 16:00

5. Technical solutions for findable and machine-readable metadata

POSTER&PITCH

Poster Session C

The Sample Environment Communication Protocol (SECoP) provides a generalized way for controlling measurement equipment – with a special focus on sample environment (SE) equipment [1,2]. In addition, SECoP holds the possibility to transport SE metadata in a well-defined way.

SECoP is designed to be
- simple to use,
- inclusive concerning different control systems and control philosophies...

134. Closing

04/11/2024, 17:00

148. A Glimpse into the Future of Metadata - Practical Challenges in Development and Application of a Metadata Standard

Ludwig Hülk (Reiner Lemoine Institut (RLI))

05/11/2024, 09:00

KEYNOTE

Wake up - Keynote

The complexity and diverse data requirements of energy system research demands a robust and adaptable metadata standard. The OEMetadata Standard, with its recent update to version 2.0, is designed to meet the needs of this transdisciplinary field. Illustrated through practical examples, the key features and enhancements of the standard are presented. Followed by the introduction of an...

93. Metadata considerations in research tool design to achieve vertical interoperability

Tilo Mathes (Research Space)

05/11/2024, 09:45

4. Metadata annotation and management

TALK

Session C2

RSpace is an open-source platform that supports researchers in the active research phase to plan, conduct, and document their work, and thereby make their research more robust and FAIR (Findable, Accessible, Interoperable, Reproducible). Interoperability with tools and services used by researchers throughout the research lifecycle is a fundamental element of RSpace's development philosophy....

106. Roles in FAIR data and their needs

Karl Gerald van den Boogaart (HZDR/FWGB)

05/11/2024, 09:45

1. Human actors in the FAIR data landscape

TALK

Session C1

Different roles interact with research data in very different ways: Technicians, experimental scientists, data analysts, modellers, supervisor, infrastructure providers, data stewards, toolchain providers, project managers, administrative personelle, liberians, publishers, NFDI contact persons, indexing service providers, external data user, programmers,...

Non of them can establish an...

110. Bringing samples to the digital data curation world - the FAIR WISH Project

Kirsten Elger

05/11/2024, 10:05

1. Human actors in the FAIR data landscape

TALK

Session C1

Persistent identifiers (PID) are an essential component of digital research data infrastructure. They are used to unambiguously identify, locate, and cite digital representations of a growing range of entities like publications, data, and others. Physical samples represent the basis for many research results and data and are at the same time deeply in the “long tail” of research data. The HMC...

81. From scientific terms to linked electronic lab notebooks - A holistic approach

Fabian Kirchner (Helmholtz-Zentrum Hereon), Martin Held (Hereon)

05/11/2024, 10:05

6. Interoperable semantics at domain and application level

TALK

Session C2

When gathering your analog research data and metadata, including challenging-to-digitize experimental parameters, aiming at creating a knowledge graph, we suggest the following pipeline for achieving high data quality: agreeing on a shared vocabulary, expanding it to an ontology and eventually semantically annotating the recorded data.
To facilitate this pipeline we developed and use the...

107. PATOF: From the Past To the Future: Legacy Data in Small and Medium-Scale “PUNCH” Experiments - a Blueprint for PUNCH and Other Disciplines

Ding-Ze Hu (Deutsches Elektronen-Synchrotron DESY)

05/11/2024, 10:25

1. Human actors in the FAIR data landscape

TALK

Session C1

The PATOF project builds on work at MAMI particle physics experiment A4. A4 produced a stream of valuable data for many years which already released scientific output of high quality and still provides a solid basis for future publications. The A4 data set consists of 100 TB and 300 million files of different types (hierarchical folder structure and file format with minimal metadata provided...

102. Sample metadata at GEOMAR using LinkAhead: From registration to publication

Felix Mittermayer (GEOMAR Helmholtz Centre for Ocean Research Kiel), Florian Spreckelsen (IndiScale GmbH, Göttingen)

05/11/2024, 10:25

7. Infrastructure and common practices for consolidation of (meta)data

TALK

Session C2

At GEOMAR, a multidisciplinary research centre, a large number of
heterogeneous biological and geological samples need to be managed:
Among other requirements, their metadata and data need to be stored in
a FAIR way, their provenance information as well as their physical
location in the sample storage need to be available, and scientists
need to be supported in organizing their sample...

112. Enhancing Research Data Annotation: The SEPIA Sample Database for Metadata Storage and Exchange

Ms Katherine Rial (Helmholtz-Zentrum Berlin), Mojeeb Rahman Sedeqi (Helmholtz-Zentrum Berlin für Materialien und Energie GmbH (HZB), Helmholtz Metadata Collaboration (HMC)), Rolf Krahl (Helmholtz-Zentrum Berlin für Materialien und Energie)

05/11/2024, 11:00

5. Technical solutions for findable and machine-readable metadata

TALK

Session D2

The SEPIA project aims to improve the management and annotation of research data by providing a comprehensive sample database integrated with an open API. This initiative facilitates the capture and exchange of sample metadata, thereby enriching the research data collected at the Helmholtz-Zentrum Berlin (HZB). This presentation will explore the architecture and functionalities of the SEPIA...

101. Fundamentals of Scientific Metadata – A Hands-on Training Course on FAIR Data Handling for Researchers and Data Stewards

Dr Silke Gerlich (HMC)

05/11/2024, 11:00

8. Development of training programmes

TALK

Session D1

Scientific research has been subject to the fast-progressing digitalization, which impacts how research is conducted today. Generation and sharing of data according to best practices, that support the digital change, bears numerous challenges for the scientists: Implementation of data documentation recommendations like the FAIR Principles [1] require profound knowledge and technical skills and...

100. DatAasee - A Metadata-Lake for Libraries

Dr Christian Himpe (University of Münster)

05/11/2024, 11:20

5. Technical solutions for findable and machine-readable metadata

TALK

Session D2

A library is a super repository of digital and physical data archives, which is organized by metadata. This metadata however, maybe distributed across various databases due to, for example, topical or typical grouping. To provide a unified view or overview of all resources, the metadata needs to be aggregated, normalized, and potentially interconnected. DatAasee is such a metadata...

92. Enabling a Global Research Commons through vertical interoperability of research tools and services

Rory Macneil (Research Space)

05/11/2024, 11:20

9. Consulting concepts

TALK

Session D1

Interoperability is an ongoing challenge given the diverse nature of research and the tools and services researchers use. Addressing interoperability challenges and FAIRification of research at scale is therefore only possible with solid knowledge about the tools and services used in each stage of the research cycle and a forward-facing vision of how they might work together.

For the...

114. The Helmholtz Knowledge Graph: driving the transition towards a FAIR data ecosystem in the Helmholtz Association

Volker Hofmann

05/11/2024, 11:40

7. Infrastructure and common practices for consolidation of (meta)data

TALK

Session D2

Research in the Helmholtz Association is carried out in inter- and multidisciplinary collaborations that span between its 18 independently operating research centers across Germany. Helmholtz digital infrastructure is institutional, and thus Helmholtz's research data and other digital assets are stored and maintained in independent data infrastructures, lacking visibility and accessibility. As...

96. The Reusability of Scientific Data Course

Dr OEzlem OEzkan (HMC)

05/11/2024, 11:40

8. Development of training programmes

TALK

Session D1

The "Reusability of Scientific Data" course is designed to equip researchers with the knowledge and practical skills necessary to ensure their data adheres to the principles of FAIR (Findable, Accessible, Interoperable, Reusable), with a specific focus on reusability. This 4-hour online course provides a detailed exploration of the critical role that data reusability plays in enhancing the...

74. Building FAIR image analysis pipelines for high-content-screening (HCS) data using Galaxy

Riccardo Massei

05/11/2024, 13:00

5. Technical solutions for findable and machine-readable metadata

TALK

Session E2

Imaging is crucial across various scientific disciplines, particularly in life sciences, where it plays a key role in studies ranging from single molecules to whole organisms. However, the complexity and sheer volume of image data, especially from high-content screening (HCS) experiments involving cell lines or other organisms, present significant challenges. Managing and analysing this data...

120. Managing research data in plant sciences through the DataPLANT ontology service landscape

Hannah Doerpholz (Forschungszentrum Jülich)

05/11/2024, 13:00

4. Metadata annotation and management

TALK

Session E1

The DataPLANT consortium, a German National Research Data Infrastructure (NFDI), aims to provide plant researchers a robust and sustainable infrastructure for managing research data. Since the complexity of research data continues to grow, effective methods for managing, annotating, and sharing this data becomes increasingly important. DataPLANT integrates different established concepts for...

85. Ontowhat? Journey towards a sensor maintenance ontology

Linda Baldewein (Helmholtz-Zentrum Hereon)

05/11/2024, 13:20

6. Interoperable semantics at domain and application level

TALK

Session E1

The collection and use of sensor data is crucial for scientists monitoring and observing the Earth's environment. In particular, it enables the evaluation of real natural phenomena over time and is essential for the validation of experiments and numerical simulations. Assessment of data quality beyond statistics includes knowledge and consideration of sensor state, including operation and...

97. Towards FAIR digital objects for Heritage science data

Wim Fremout (Royal Institute for Cultural Heritage (KIK-IRPA))

05/11/2024, 13:20

5. Technical solutions for findable and machine-readable metadata

TALK

Session E2

Heritage science is an interdisciplinary field that involves the scientific study of cultural and natural heritage. It entails collecting and producing a wide variety of data, including descriptions of objects and sites, samples, sampling locations, scientific instrumentation, analytical methods, conservation and restoration records, environmental monitoring data, documentation, and digital...

78. Project MEMAS: Integrated Data Management for Additive Manufacturing enabling High-Fidelity Modeling

Mr Nicolas Unger (German Aerospace Center (DLR e.V.) Institute of Vehicle Concepts)

05/11/2024, 13:40

7. Infrastructure and common practices for consolidation of (meta)data

TALK

Session E2

Predicting the performance of aerospace and automotive structures requires detailed reflection of the actual manufacturing process of each produced part. This is especially the case for composite structures produced with additive manufacturing processes in view of their process complexity and its influence on the product reliability. For high-fidelity numerical models to reflect the actual...

124. Using application-level ontologies in materials simulation workflows

Abril Azocar Guzman (IAS-9, FZJ)

05/11/2024, 13:40

6. Interoperable semantics at domain and application level

TALK

Session E1

The microstructure of materials is characterized by crystallographic defects, which ultimately determine the material properties. In computational materials science, methods and tools are used to predict and analyze defect structures. The increase of computational power has led to the generation of large amounts of complex and heterogeneous data, increasing the need for the implementation of...

136. FAIRly Intelligent - What LLMs Bring to the Research Data Management Table

Sandra Geisler (RWTH Aachen University; Fraunhofer Institute for Applied Information Technology FIT)

05/11/2024, 14:15

KEYNOTE

Keynote

FAIR Research Data Management in interdisciplinary large-scale projects is very challenging. Data formats, acquisition processes, and infrastructure are highly heterogeneous. Furthermore, many tasks in FAIR RDM are tedious and complex for the researchers. In this keynote, we will discuss the potentials of generative AI to support FAIR RDM on examples from a large-scale project.

137. Closing

05/11/2024, 15:15

Highlights
Intro workshops
Thanks

142. 1. Workshop - Advancing Interoperable Semantics: A Practical Workshop on DMPonline's Contributions and Collaborative Future

Magdalena Drafiova (University of Edinburgh)

06/11/2024, 09:00

INTERACTION

Interactive session

We will discuss the world of interoperable semantics at both domain-specific and application-wide levels, focusing on how DMPonline has pioneered enhancements and integrations that promote seamless data exchange and usage across diverse research contexts.
Join us in understanding how DMPonline's developments in interoperable semantics improve data management and use across various domains. We...

143. 2. Workshop - Comparative Analysis of Automated FAIR Assessment Tools’ results: F-UJI, FAIR Enough, and FAIR Checker

Anastasiia Iarkaeva (BIH@Charité (QUEST Center))

06/11/2024, 09:00

INTERACTION

Interactive session

After a brief introduction to the FAIR principles and the significance of automated assessments, participants will engage in hands-on sessions where they will compare the outputs of these tools on a curated list of datasets. The list represents datasets from various repositories that are typical within the biomedical context. Both a generalized overview of FAIR screening results at the...

144. 3. Workshop - The Two Motors of the DataHub Initiative for Environmental Sciences: a Powerful FAIR and Open Research Data Infrastructure together with Joint Semantic Metadata Schemas.

Marc Hanisch (GeoForschungsZentrum Potsdam GFZ), Christof Lorenz (Karlsruhe Institute of Technology), Ulrich Loup

06/11/2024, 09:00

INTERACTION

Interactive session

In environmental sciences, time-series data is crucial for monitoring environmental processes, validating earth system models and remote sensing products, training of data driven methods and better understanding of climate processes. However, even today, there is no uniform standard and interface for making such data consistently available according to the FAIR principles. Therefore, within...

145. 4. Workshop - EOC EO Products Service - Accessing and utilising geodata with STAC-API: Efficient access, intelligent search and seamless processing

Jan-Karl Haug

06/11/2024, 10:00

INTERACTION

Interactive session

To make the data accessible to a broad public, we offer a STAC-based catalog service in addition to the established download and visualization services. It helps finding and accessing data more dynamically and efficiently. As a data and service provider, we are able to make our valuable data and products available to a wide audience without complex infrastructure or inefficient data transfer....

147. 6. Workshop - From scientific terms to linked electronic lab notebooks – The workflow

Fabian Kirchner (Helmholtz-Zentrum Hereon), Martin Held (Hereon)

06/11/2024, 13:30

INTERACTION

Interactive session

In this interaction session, we will consolidate our talk on creating FAIR, rich and shared experimental (meta)data with a knowledge graph in mind. We will present the individual tools of the software workflow live and interactively, starting from vocabulary terms via ontologies to entering research (meta)data and sending it to another Electronic Lab Notebook (ELN).

A prerequisite for FAIR...

146. 5. Workshop - EOC EO Products Service - Accessing and utilising geodata with STAC-API: Efficient access, intelligent search and seamless processing

Jan-Karl Haug

06/11/2024, 14:00

INTERACTION

Interactive session

135. Wake-up call ('Keynote') from NFDI4Energy

Ludwig Hülk (Reiner Lemoine Institut (RLI))

Choose timezone

Helmholtz Metadata Collaboration | Conference 2024

Contact