organized in cooperation with Helmholtz Information & Data Science Academy (HIDA) and Helmholtz Federated IT Services (HIFIS)
Data Processing with Pandas and Visualization with Matplotlib
In this workshop you will learn how to employ the Python frameworks Pandas for loading, filtering and evaluating tabular data and matplotlib to generate paper-ready plots from this data. After an introduction of the basic concepts we will explore the necessary workflow from raw data to plot using a real-life dataset.
The course is taught in two alternating ways:
-
A live-coding lecture in which you will write the program along with your instructor while getting to know elements of the programming language and their use-cases
-
Hands-on exercises where you will solve posed tasks on your own, supported by the instructor for questions and feedback.
Learning Goals
By the end of the course, you will be familiar with the fundamental concepts of these frameworks and can employ them to create analysis and visualization scripts for your own datasets.
Prerequisites
To participate in this course, you need have a good understanding of the fundamental concepts of the Python programming language and be familiar with the programming tool you are using. Understanding of the following concepts is beneficial and helps to focus more on the core content:
-
Object-oriented Programming
-
Generator Expressions
-
Regular Expressions (Regex)
-
Assignment Expressions
Target Group
Learners of all academic fields who have to work with datasets: cleaning, investigating, modifying and plotting them.
Course Days & Times
Apr 7, 2025, 10 am - 4 pm
Apr 8, 2025, 10 am - 4 pm
Apr 9, 2025, 10 am - 2 pm
NOTE: Registration will open March 10, 2025, 12 pm.
Attendance & Certificates
The course content is coordinated, so we strongly recommend that you do not miss any part of the course. To receive a certificate we expect at least 80% attendance and active participation.
Registration & Cancellation
This course is open to individuals affiliated with Helmholtz or a HIDA Partner only. You may register for the course allocating yourself to one of the following groups:
- All Helmholtz affiliations
- Helmholtz Information & Data Science School (HIDSS) affiliation
- HIDA Partner affiliation
Please note that after the first two weeks of the registration period the unbooked seats from categories 2 and 3 will be opened for all Helmholtz affiliations (category 1).
Your registration for this course is binding. If you need to leave/miss the course for a period of time, please let us know in advance via hida-courses@helmholtz.de.
If you have to cancel the course for any reason, please do so as soon as possible to allow time for others to take your seat. To cancel, please withdraw your registration on the course site or write an email to hida-courses@helmholtz.de.
Additional Information
There is no waiting list for this course! If someone withdraws from a course, their place is automatically reopened. We therefore advise you to keep an eye on the registration in case the course is fully booked and you would like to attend. Also, this course will be offered again in the future - you can check our HIDA course catalog for updates.
This course is free of charge.