HMC Conference 2025 | Workshop 2: STAMPLATE & the DataHub Digital Ecosystem: Towards a FAIR Research Data Infrastructure for Environmental Time-Series

11:00 - 13:00

Organizer: Dr Christof Lorenz, Karlsruhe Institute of Technology

Co-organizers: Dr Ulrich Loup, Forschungszentrum Jülich; Dr David Schäfer, UFZ; Dr Claas Faber, GEOMAR; Dr Mihir Rambhia, Hereon; Dr Nils Brinckmann, GFZ

Introduction: In environmental sciences, time-series data is crucial for monitoring environmental processes, validating earth system models, training data-driven methods, and understanding climate processes. However, a uniform standard and interface for making such data consistently available according to FAIR principles is still lacking. To address this, seven research centers from Helmholtz Earth & Environment initiated the HMC project STAMPLATE within the DataHub initiative. STAMPLATE aims to establish the Open Geospatial Consortium’s SensorThings API (STA) as the central interface, linking it to other community-driven tools and services to foster a digital ecosystem for environmental time-series data. Within STAMPLATE, we developed a thematic metadata profile for STA, enhancing the core data model with domain-specific information. STA has also been successfully integrated into tools for sensor metadata management, time-series management (TSM) systems, and an overarching (meta)data portal for data consolidation and visualization. Particular attention was given to the consistent description of data quality. To achieve this, we integrated the System for Automated Quality Control (SaQC) into our framework and extended the STA data model, enabling interoperable provision of quality information.

 

Workshop: This session provides an overview of our ecosystem, integrated services, and metadata schema, with hands-on tutorials for selected tools. The highlight will be a bring-your-own-data session, allowing participants to experiment with our tools, use SaQC for flagging their own data, and integrate the results into our infrastructures. Finally, the diverse applicability of our framework is demonstrated through use cases from different communities, such as the Boknis Eck and TERENO observatories. This tutorial is designed for researchers, technicians, and data professionals working with time-series data from any sensor system.

 

Main content:

-          Introduction to the digital DataHub ecosystem

-          STA as generic and modern interface for time-series data

-          Hands-on-tutorials of integrated tools and sub-systems

-          Bring-your-own-data session

 

Required previous knowledge for the hands-on and bring-your-own-data-sessions:

-          Basic skills with Python / Jupyter Notebooks

-          Some experience in data processing (e.g, Pandas, Xarray, etc.)

-          Basic knowledge about processing of JSON-data

 

Please note: This workshop is part of the HMC Conference 2025 taking place in Cologne from 12-14 May 2025. Participation must be in person and is only possible for people who are registered for the conference.

Starts
Ends
Europe/Berlin
Cologne