Speaker
Description
Research in the Helmholtz Association is carried out in inter- and multidisciplinary collaborations that span between its 18 independently operating research centers across Germany. Helmholtz digital infrastructure is institutional, and thus Helmholtz's research data and other digital assets are stored and maintained in independent data infrastructures, lacking visibility and accessibility. As a consequence their full value remaines unavailable to scientists, managers, strategists, and policymakers.
The Helmholtz Metadata Collaboration (HMC) is taking on this challenge by establishing a Helmholtz FAIR Data Space. As part of this, we develop the Helmholtz Knowledge Graph (Helmholtz KG) [1] as a lightweight interoperability layer that connects Metadata Helmholtz digital assets, which are stored in a decentralized manner. With this KG, we envision (1) providing better cross-organizational access to Helmholtz's (meta)data and information assets on an upper semantic level, (2) harmonizing and optimizing the related metadata across the association, and (3) forming a basis from which the semantic quality and the depths of metadata descriptions is improved and extended into domain and application levels.
In the initial phase, we focused on establishing a working system that (1) contains harvesting pipelines [2] as demonstrators, (2) a User Interface [3] to explore the data, and (3) a SPARQL endpoint [4] to query the graph. Currently, the system harvests and aggregates data from more than 30 data providers. We are further developing the code base to reach a higher maturity level and increase the scalability of the infrastructure in order to accommodate further resources in the future – namely all open data repositories and infrastructures. At the same time, we work on the data-level in order to harmonize metadata representation across Helmholtz. This establishes common standards for Helmholtz data providers and harmonizes metadata where it is stored. For this, HMC established unHIDE: the Unified Helmholtz Information and Data Exchange as a network between Helmholtz data and infrastructure providers.
In the presentation, we will show the status quo of the Helmholtz KG as well as future development avenues and potentials to join forces with infrastructure providers and users.
Refrences/Links
[1] Broeder, J. ; Preuss, G. ; D'Mello, F. ; Fathalla, S. ; Hofmann, V. ; Sandfeld, S. (2024) The Helmholtz Knowledge Graph: driving the Transition towards a FAIR Data Ecosystem in the Helmholtz Association; The Semantic Web: ESWC 2024, Springer Computer Science Proceedings. doi:10.34734/FZJ-2024-03156
[2] https://codebase.helmholtz.cloud/hmc/hmc-public/unhide
[3] https://search.unhide.helmholtz-metadaten.de/
[4] https://sparql.unhide.helmholtz-metadaten.de/
Acknowledgements
This work was supported by (1) the Helmholtz Metadata Collaboration (HMC), an incubator-platform of the Helmholtz Association within the framework of the Information and Data Science strategic initiative
In addition, please add 3 to 5 keywords.
Knowledge Graph, Semantic Interoperability, Infrastructure,
Please assign yourself (presenting author) to one of the following groups. | Data professionals who provide and maintain data infrastructure |
---|---|
For whom will your contribution be of most interest? | Data professionals and stewards |