Speaker
Description
Computer simulations are an essential pillar of knowledge generation in science.
Understanding, reproducing, and exploring the results of simulations relies on tracking and organizing metadata describing numerical experiments.
However, the models used to understand real-world systems, and the computational machinery required to simulate them, are typically complex, and produce large amounts of heterogeneous metadata.
Here, we present general practices for acquiring and handling metadata that are agnostic to software and hardware, and highly flexible for the user.
These consist of two steps: 1) recording and storing raw metadata, and 2) selecting and structuring metadata.
As a proof of concept, we developed a Python tool to help with the second step, and use it to apply our practices to distinct high-performance computing use cases from hydrology and neuroscience.
Our practices and the tool support sustainable numerical workflows, facilitating reproducibility and data reuse in generic simulation-based research.
Acknowledgements
The project was funded by the Helmholtz Metadata Collaboration (HMC) ZT-I-PF-3-026, EU Grant 945539 (HBP), Helmholtz IVF Grant SO-092 (ACA), and Joint lab SMHB. Compute time was granted by VSR computation grant JINB33, Jülich. The work was carried out in part within the HMC Hub Information at the Forschungszentrum Juelich.
Preferred form of presentation | Poster & advertising flash talk |
---|---|
Topic area | Simulator technology and performance |
Keywords | Good practices; Metadata handling; Simulation workflows |
Speaker time zone | UTC+2 |
I agree to the copyright and license terms | Yes |
I agree to the declaration of honor | Yes |