Helmholtz Metadata Collaboration | Conference 2022

Name: Helmholtz Metadata Collaboration | Conference 2022
Start: 2022-10-05T09:00:00+02:00
End: 2022-10-06T14:15:00+02:00
Location: virtual, details will be shared with you after registration

5–6 Oct 2022

virtual, details will be shared with you after registration

Europe/Berlin timezone

Contact

event@helmholtz-metadaten.de

Session

Postersession II

6 Oct 2022, 11:00

virtual, details will be shared with you after registration

There are no materials yet.

74. A Digital Research Process for FAIR Data and Metadata (2-32)

Jeannette Anniés (Institute of Human Factors and Technology Management IAT, University of Stuttgart)

Postersession

Poster

With new specialisations such as Data Science driven by digitisation, efficiency potentials of a digital transformation are raised in both empirical research and data governance processes. Here, one challenge is to establish open and interoperable datasets, recognising the FAIR criteria (cf. Wilkinson et al., 2016) as a standard of that process. Data – as well as metadata – should comply to...

81. ADVANCE: Advanced metadata standards for biodiversity survey and monitoring data for supporting research and conservation (2-07)

Juliana Silva Menger (UFZ, AWI)

Postersession

Poster

In an ever-changing world, field surveys, inventories and monitoring data are essential for prediction of biodiversity responses to global drivers such as land use and climate change. This knowledge provides the basis for appropriate management. However, field biodiversity data collected across terrestrial, freshwater and marine realms are highly complex and heterogeneous. The successful...

80. ALAMEDA – A scalable multi-domain metadata management platform (1-22)

Oliver Rach

Postersession

Poster

Modern Earth sciences produce a continuous increasing amount of data. These data consist of the measurements/observations and descriptive information (metadata) and include semantic classifications (semantics). Depending on the geoscientific parameter, metadata are stored in a variety of different databases, standards and semantics, which is obstructive for interoperability in terms of limited...

77. AquaDiva MetaData (2-22)

Alsayed Algergawy (Heinz-Nixdorf Chair for Distributed Information Systems, Friedrich Schiller University, Jena, Germany)

Postersession

Poster

The Collaborative Research Centre AquaDiva is a large collaborative project spanning a variety of domains, such as biology, geology, chemistry and computer science with the common goal to better understand the Earth’s critical zone, in particular, how environmental conditions and surface properties shape the structure, properties, and functions of the subsurface. Within AquaDiva large volumes...

85. Automated FAIR4RS software publication with HERMES (1-28)

Stephan Druskat (German Aerospace Center (DLR))

Postersession

Poster

Software as an important method and output of research should follow the RDA "FAIR for Research Software Principles". In practice, this means that research software, whether open, inner or closed source, should be published with rich metadata to enable FAIR4RS. For research software practitioners, this currently often means following an arduous and mostly manual process of software...

62. Data management practices among Helmholtz's research communities - A survey on the status quo and on community-specific demands (2-42)

Lucas Kulla (DKFZ), Silke Gerlich (HMC)

Postersession

Poster

In autumn 2021, the Helmholtz Metadata Collaboration (HMC) concluded its first HMC Community Survey to get in touch with Helmholtz's research communities. The survey aimed at characterizing the community-specific research data management and data publication practices as well as related gaps and needs expressed by Helmholtz's research communities. For this purpose, we developed a question...

53. Domain level ontology design: DISO and MDMC-PROV (2-21)

Ahmad Zainul Ihsan (Instititute for Advanced Simulation – Materials Data Science and Informatics (IAS-9); Forschungszentrum Jülich, Jülich, Germany.)

Postersession

Poster

How can a computer understand the relations of data or objects from the real world? Ontologies are semantic artifacts that capture knowledge about their domain of interest in a machine-understandable form. The main goal of developing ontologies is to formalize concepts and their relations through which humans express meaning and to use them as a communication interface to machines. Thus,...

31. ELN-DIY-Meta: Creating Interoperability for ELNs (1-06)

Martin Held (Hereon)

Postersession

Poster

Electronic lab notebooks (ELNs) serve as means to gather analog metadata, e.g. experimental parameters, that would otherwise be hard to digitalize. However, different systems are often used within the same research institution or community, especially when covering a long, interdisciplinary process chain. The use of different systems in the same institution - each addressing distinct...

46. EM Glossary: A community effort towards coordinated semantics in the electron microscopies (2-23)

Annika Strupp (Institute for Advanced Simulation – Materials Data Science and Informatics (IAS-9); Forschungszentrum Jülich, Jülich, Germany)

Postersession

Poster

Semantic interoperability is one of the major challenges in implementing the FAIR principles [1] for research data. This is especially relevant for interdisciplinary projects, where people from different but related disciplines may use technical terms with differing meaning. Established vocabularies and semantic standards can harmonize domain-specific language and facilitate common...

60. Enriched metadata for hybrid data compilations with applications to cryosphere research (2-27)

Anna Simson (RWTH Aachen, Methods for Model-based Development in Computational Engineering)

Postersession

Poster

In geodisciplines such as the cryosphere sciences, a large variety of data is available in data repositories provided on platforms such as Pangaea. In addition, many computational process models exist that capture various physical, geochemical, or biological processes at a wide range of spatial and temporal scales and provide corresponding simulation data. A natural thought is to...

65. FAIR Digital Objects for 5D imagery of our and other planet(s) (2-02)

Andrea Nass (DLR)

Postersession

Poster

Imaging the environment is an essential and crucial component in spatial science. This concerns nearly everything between the exploration of the ocean floor and investigating planetary surfaces. In and between both domains, this is applied at various scales – from microscopy through ambient imaging to remote sensing – and provides rich information for science. Due to recent the increasing...

47. FAIR DO Application Case for Composing Machine Learning Training Data (2-34)

Nicolas Blumenroehr (Karlsruhe Institute of Technology, Steinbuch Centre for Computing)

Postersession

Poster

The application case for implementing and using the FAIR Digital Object (FAIR DO) concept aims to simplify usage of label information for composing Machine Learning (ML) training data.
Image data sets curated by different domain experts usually have non-identical label terms. This prevents images with similar labels from being easily assigned to the same category. Therefore, using the images...

49. FAIR WISH project – Developing templates to register IGSNs for various sample types (1-01)

Dr Mareike Wieczorek (Alfred-Wegener-Institut Helmholtz-Zentrum für Polar- und Meeresforschung)

Postersession

Poster

The International Generic Sample Number (IGSN) is a unique and persistent identifier for – originally – geological samples. Recently, interest has grown to make the IGSN available for more sample types from further scientific communities from the Earth and Environment (E & E). The IGSN Metadata Schema is modular: The mandatory registration schema is complemented by the IGSN Description Schema...

40. FAIR-DOscope – Explore the facets of FAIR Digital Objects (2-36)

Thomas Jejkal

Postersession

Poster

Working in the realm of FAIR Digital Objects can be very abstract and sometimes overwhelming. There are so many aspects which have to be addressed in order to create a first FAIR Digital Object. And if this is done, the only thing you get is a PID expected to be machine actionable by metadata available in its PID record. But which metadata is in there? Were all fields properly filled? Which...

45. Fundamentals of scientific metadata - an entry-level training course for early career researchers (2-43)

Silke Gerlich (HMC), Annika Strupp

Postersession

Poster

Get your hands dirty with semi-structured metadata in HMC’s remote training course “Fundamentals of scientific metadata: why context matters”!

Have you ever struggled to make sense of research data provided by a collaborator - or even to make sense of your own data 5 months after publication? Do you see difficulties in meeting data description requirements of your funding agency? Do you...

79. HARMONise – Enhancing the interoperability of marine biomolecular (meta)data across Helmholtz Centres (2-04)

Christina Bienhold (AWI Helmholtz Centre for Polar and Marine Research)

Postersession

Poster

Biomolecules, such as DNA and RNA, provide a wealth of information about the distribution and function of marine organisms, and biomolecular research in the marine realm is pursued across several Helmholtz Centers. Biomolecular metadata, i.e. DNA and RNA sequences and all steps involved in their creation, exhibit great internal diversity and complexity. However, high-quality (meta)data...

68. Intergrated Data Workflow using HELIPORT at TELBE (1-15)

Mani Lokamani (HZDR)

Postersession

Poster

At the High-Field High-Repetition-Rate Terahertz facility @ ELBE (TELBE)[1],
ultrafast terahertz-induced dynamics can be probed in various states of matter with highest precision. The TELBE sources offer both, stable and tunable narrowband THz radiation with pulse energies of several microjoules at high repetition rates and a synchronized coherent diffraction radiator,that provides broadband...

78. Medical Imaging as a Case Study of the Use of Metadata in Health Research Data Management (2-15)

Jos Lehmann (German Cancer Research Center (Deutsches Krebsforschungszentrum - DKFZ), Heidelberg, Germany)

Postersession

Poster

The Helmholtz Metadata Collaboration (HMC) promotes the use of metadata in Research Data Management as a means to achieving data findability, accessibility, interoperability, reusability (FAIR). These in turn enable or optimize software functionalities essential to automated research processes, such as multi-, inter- and transdisciplinary indexing and retrieval, versioning, provenance...

71. Meta-analysis of positive controls and laboratory metainformation in microbiome data (2-13)

Luise Rauer

Postersession

Poster

Recent advances in next-generation deep sequencing technologies have revolutionized our understanding of the microbiota’s contribution to human health and disease. However, there are as many microbiome-disease associations as there are different protocols for generating microbiome data. This heterogeneity in laboratory data generation methods leads to protocol-specific biases in microbiome...

29. MetaCook: FAIR Vocabularies Cookbook (1-17)

Dr Nick Garabedian (KIT)

Postersession

Poster

One of the prerequisites for FAIR data publication is the use of FAIR vocabularies. Currently, tools for the collaborative composition of such vocabularies are missing. For this reason, a universal manual and software for user-friendly vocabulary assembly is being composed in the HMC-funded MetaCook project. The project includes 4 separate test cases from 4 labs across KIT and Hereon, which...

67. Metadata Hub - One for all (1-33)

Volker Hartmann (KIT)

Postersession

Poster

The Metadata Hub provides a generic service for metadata repositories. Based on this, different kinds of metadata repositories can be accessed with uniform tools without the researchers having to deal with the complex details.
In the domain of research data management, there are a variety of repositories that offer metadata management services to researchers. This poses the challenge that...

44. Metadata in the Research Workflow: Tools for Enrichment and Validation of Structured Metadata (1-04)

Anton Pirogov (Forschungszentrum Jülich GmbH)

Postersession

Poster

Improving research data management practices is both an organizational and a technical challenge: even in the same research field, (meta)data is often created, stored and processed in an ad-hoc manner. This results in a lack of a clear structure and standardization and makes the metadata “unFAIR”. We present two tools that assist scientists in their research workflows to enrich, structure and...

70. Pilot Dashboard for Open and FAIR Data Metrics by HMC Hub Matter (1-37)

Astrid Gilein (Helmholtz-Zentrum Berlin für Materialien und Energie), Konstantin Pascal Walter (Helmholtz-Zentrum Berlin für Materialien und Energie)

Postersession

Poster

Making research data reusable in an open and FAIR [1] way is part of good scientific practice and is increasingly becoming part of the scientific workflow. Where and how "FAIR" research data is published alongside a research paper, is often not tracked by research institutes. In a pilot project of the Helmholtz Metadata Collaboration (HMC) Hub Matter we developed an approach to automatically...

54. Putting metadata to work in research with Marble and Beaverdam (1-12)

Fiona D'Mello (Forschungszentrum Jülich GmbH), Heather More (Institute of Neuroscience and Medicine (INM-6) and Institute for Advanced Simulation (IAS-6) and JARA-Institute Brain Structure-Function Relationships (INM-10), Jülich Research Centre, Jülich, Germany; Institute for Advanced Simulation (IAS-9), Jülich Research Centre, Jülich, Germany)

Postersession

Poster

Researchers in many fields rely on complex data from specialized instruments and large numbers of experiments. Metadata is key to efficiently document and describe data’s essential attributes, and help to generate overviews of large datasets. Manually collecting and curating the extensive amounts of metadata required – some of which might be even inaccessible – is a major challenge. To support...

37. riaf --- a Repository Infrastructure that Accommodates Files (1-07)

Daniel Mohr (Deutsches Zentrum für Luft- und Raumfahrt e. V.)

Postersession

Poster

riaf is a repository infrastructure to accommodate files. It enables to hold the data with the FAIR principles (see also fair-principles).

riaf is designed to enable provenance and reproducibility of the research data in the early part of the data life cycle, i. e....

87. SECoP@HMC - Metadata in the Sample Environment Communication Protocol (2-11)

Klaus Kiefer

Postersession

Poster

The integration of sample environment (SE) equipment in a beam line experiment is a complex challenge both in the physical world and in the digital world. Different experiment control software offer different interfaces for the connection of SE equipment. Therefore, it is time-consuming to integrate new SE or to share SE equipment between facilities.

To tackle this problem, the...

20. Technology neutral provenance storage & sharing (2-31)

Frank Dressel

Postersession

Poster

Provenance is one of the requirements for reusable data (see FAIR principles). There are data formats, which store data and provenance (metadata) easily together like hdf5, data package, research objects and others. Nevertheless, these are not applicable to all data and all use cases. Therefore, provenance/metadata management systems are often used. Unfortunately, there are at least two...

41. The Helmholtz Kernel Information Profile - FAIR Digital Objects for the Helmholtz Association (2-37)

Thomas Jejkal

Postersession

Poster

In the concept of FAIR Digital Objects, PID Kernel Information is key to machine actionability of digital content. The PID Kernel Information is directly stored in the PID record in the database of the PID resolution service. One of the most important properties is the Data Type that allows PID Kernel Information to be used by machines for fast decision-making. To make a first step into the...

86. The HMC project Metamorphoses - Metadata for the merging of diverse atmospheric data on common subspaces (1-16)

Matthias Schneider (Karlsruhe Institute of Technology)

Postersession

Poster

This poster presents the new HMC project Metamorphoses (“Metadata for the merging of diverse atmospheric data on common subspaces”). The project will develop enhanced standards for storage efficient decomposed arrays and tools for an automated generation of standardised Lagrange trajectory data files thus enabling an optimised and efficient synergetic merging of large remote sensing data sets....

24. Tracking large-scale simulations through unified metadata handling (2-33)

Jose Villamar (Institute of Neuroscience and Medicine (INM-6) and Institute for Advanced Simulation (IAS-6) and JARA-Institute Brain Structure-Function Relationships (INM-10), Jülich Research Centre, Jülich, Germany; RWTH Aachen University, Aachen, Germany)

Postersession

Poster

Simulation is an essential pillar of knowledge generation in science. The numerical models used to describe, predict, and understand real-world systems are typically complex. Consequently, applying these models by means of simulation often poses high demands on computational resources, and requires high-performance computing (HPC) or other dedicated hardware architectures. Metadata describing...

30. Use Cases and Tools in HMC Hub Energy (1-11)

Dr Wolfgang Süß (KIT)

Postersession

Poster

Five Helmholtz Centers are participating in the Research Field Energy, three of them are directly con-tributing to Hub Energy. To be well prepared for their supporting tasks in establishing a FAIR data ecosystem within the energy research community at Helmholtz, the team members of Hub Energy study relevant use cases and develop software tools in close cooperation with FAIR Data Commons. This...

48. Using Schema-based Metadata for Image Labels accessed with FAIR Digital Objects (2-35)

Nicolas Blumenroehr (Karlsruhe Institute of Technology, Steinbuch Centre for Computing)

Postersession

Poster

Scientific image data sets can be continuously enriched by labels describing new features which are relevant for some specific task. This process can be automated by means of Machine Learning (ML) techniques. Although such an approach shows clear advantages, especially when it is applied to large datasets, it also poses an important challenge:
Relabeling image data sets curated by different...

Choose timezone

Helmholtz Metadata Collaboration | Conference 2022

Contact

Presentation materials