Data Processing with PandasOnline Event

Europe/Berlin
Fredo Erxleben (Helmholtz-Zentrum Dresden-Rossendorf), HIFIS Noreply Bot (Functional Account)
Description

General Information

LanguageEnglish
LevelIntermediate
CostFree

 

 

 

Content

This course will give a hands-on introduction to the data processing framework Pandas.

 

This framework is written in Python and very popular in the Data Science areas thanks to its wide variety of functionality and usability.

In the workshop a fundamental introduction will be given and afterwards a guided exercise allows the participants to gain experience at their own pace while having the instructors available for quick feedback and advice.

Required Tools/Knowledge

  • A basic knowledge of the Python language (variables, functions, loops, conditions)
  • A working Python setup (The instructors will use Thonny for the workshop but you can bring whatever suits you)
  • The pandas package should be installed

Registration

To register for the course, please log in via the Helmholtz AAI (Step-by-step Instructions). If you do not have an account yet, the AAI will guide you through the setup process.

Registration period: 2023-02-13 to 2023-02-19

    • 09:00 09:15
      Welcome and Introdoction 15m
    • 09:15 10:30
      Theory
      • 09:15
        The Series Type 15m

        How to work with the data type to represent a series of data points

      • 09:30
        Data Frames 15m

        How to work with the data type to represent tabular data

      • 09:45
        Accessing Data 15m

        Learn about ways to retrieve the desired data from Series or DataFrames

      • 10:00
        Filtering Data 15m

        How to reduce given data collections to elements that fulfill the desired criteria

      • 10:15
        Modifying Data 15m

        How to change data in Series or DataFrames in various ways

    • 10:30 11:00
      Breakfast 30m
    • 11:00 11:15
      Questions & Answers 15m
    • 11:15 11:30
      Introduction to the Exercise 15m
    • 11:30 17:00
      Hands-on Session
      • 11:30
        Part 1 1h

        A warm-up to retrieve and load real-life scientific data

      • 12:30
        Lunch 1h
      • 13:30
        Part 2 1h 30m

        Cleaning and analyzing the retrieved data

      • 15:00
        Coffee 30m
      • 15:30
        Part 3 1h 30m

        Deriving new information and re-shaping the data to answer more complex queries