deRSE24 - Conference for Research Software Engineering in Germany

Name: deRSE24 - Conference for Research Software Engineering in Germany
Start: 2024-03-05T08:30:00+01:00
End: 2024-03-07T17:45:00+01:00
Location: Julius-Maximilians-Universität Würzburg

Mar 5 – 7, 2024

Julius-Maximilians-Universität Würzburg

Europe/Berlin timezone

Contact

derse24@listserv.dfn.de

Refactoring and isolation data pipelines through the use of software containerization and continuous integration

Not scheduled

20m

Mathematisch-Naturwissenschaftliches Hörsaalgebäude (Julius-Maximilians-Universität Würzburg)

Mathematisch-Naturwissenschaftliches Hörsaalgebäude

Julius-Maximilians-Universität Würzburg

Am Hubland 97074 Würzburg

Poster Computational Workflows Poster Session

Mr Benjamin Bruns (Forschungszentrum Jülich GmbH)

At the IAS-8 institute of Forschungszentrum Jülich, the accurate and complete collection of measurement and environmental data is essential for subsequent analyses and modeling in many projects. Although the Bayeos server (https://github.com/BayCEER/bayeos-server) used at FZJ provides an open and standardized data platform for such data, the import and transformation of data from different sources is often difficult in terms of provision, traceability and subsequent adjustments. To address this problem, a flexible import and transformation pipeline for time series data was developed based on Python and a PostgreSQL-based integration database. There is a clean separation of import, transformation and aggregation processes, which also allows for easy customization. Each individual step of the defined pipeline runs as a container in a Docker environment. There is a template for a basic pipeline, which can be easily customized to define additional pre- and post-processing steps. This template has been successfully adapted for different existing data pipelines. Once this has been done, the containers are built automatically using the CI/CD pipeline of the DevOps platform Gitlab. In addition, Gitlab's own container registry ensures easy deployment and updating of the pipeline elements.

Mr Benjamin Bruns (Forschungszentrum Jülich GmbH)

There are no materials yet.

deRSE24 - Conference for Research Software Engineering in Germany

Contact

Refactoring and isolation data pipelines through the use of software containerization and continuous integration

Mathematisch-Naturwissenschaftliches Hörsaalgebäude

Julius-Maximilians-Universität Würzburg

Speaker

Description

Primary author

Presentation materials

Choose timezone

deRSE24 - Conference for Research Software Engineering in Germany

Contact

Speaker

Description

Primary author

Presentation materials