sponsored by Helmholtz Information & Data Science Academy (HIDA)
in cooperation with Core Facility Statistical Consulting at Helmholtz Zentrum München - German Research Center for Environmental Health (Helmholtz Munich)
Basic Methods in Machine Learning
In the Basic Methods in Machine Learning course, we delve into the practical application of fundamental machine learning techniques for data analysis using Python. This course is designed for individuals who want to start using machine learning for data analysis, focusing less on traditional statistics and more on predictive modeling to classify data or predict outcomes. The course is taught interactively with live coding using Jupyter Notebook.
By the end of the course, you will be able to confidently select and utilize basic machine learning techniques, effectively interpret your findings, and apply them to real-world scenarios.
Topics:
-
Introduction to Machine Learning
-
Linear Regression
-
Classification Algorithms
-
Logistic regression
-
K-nearest neighbors
-
Decision Trees
-
-
Model Evaluation and Selection
-
Cross-validation
-
Performance metrics
-
Differences from "Introduction to Statistics" course:
-
Emphasis on predictive modeling rather than statistical inference.
-
Focus on understanding key ML terminology and practical applications.
-
Helmholtz Munich doctoral researchers cannot replace the mandatory course "Introduction to Statistics" with this course.
Methods:
The course consists of theoretical lessons on machine learning tools, how to apply machine learning techniques and how to evaluate the results. Theoretical lessons will be followed by hands-on examples with best-practice solutions in Python.
Learning Goals
1. Understand the Fundamentals of Machine Learning
- Define machine learning and its role in data analysis, distinguishing it from traditional statistical methods.
- Interpret the results of machine learning models and effectively communicate their insights for decision-making purposes.
2. Apply Linear Regression for Predictive Modeling
- Implement linear regression techniques to model relationships between variables and make predictions.
- Integrate automatic feature selection methods like shrinkage approaches.
3. Explore Classification Algorithms
- Apply and differentiate between logistic regression, k-nearest neighbors (KNN), and decision trees for classification tasks.
- Interpret the results of classification algorithms.
4. Evaluate and Select Models Effectively
- Use cross-validation techniques to assess model performance and select the best model for a given task.
- Understand and calculate performance metrics such as accuracy, precision, recall, and F1 score to evaluate classification models.
5. Implement Machine Learning Techniques Using Python
- Apply learned machine learning algorithms and techniques in Python to solve real-world data analysis problems.
- Focus on understanding key ML terminology and practical applications.
Prerequisites
Programming skills with Python, e.g. course First Steps in Python. Basic understanding of statistical methods, in particular regression analysis, is recommended, e.g. course Introduction to Statistics.
Target Group
Individuals keen on analyzing data in Python, particularly those interested in machine learning techniques, without having prior knowledge in ML.
Course Days & Times
April 1, 2025, 1:30 pm - 5 pm
April 2, 2025, 1:30 pm - 5 pm
April 10, 2025, 1:30 pm - 5 pm
April 11, 2025, 1:30 pm - 5 pm
NOTE: Registration will open March 4, 2025, 12 pm.
Attendance & Certificates
The course content is coordinated, so we strongly recommend that you do not miss any part of the course. To receive a certificate we expect full time and active participation.
Registration & Cancellation
This course is open to individuals affiliated with Helmholtz or a HIDA Partner only. You may register for the course allocating yourself to one of the following groups:
- All Helmholtz affiliations
- Helmholtz Information & Data Science School (HIDSS) affiliation
- HIDA Partner affiliation
Please note that after the first two weeks of the registration period the unbooked seats from categories 2 and 3 will be opened for all Helmholtz affiliations (category 1).
Your registration for this course is binding. If you need to leave/miss the course for a period of time, please let us know in advance via hida-courses@helmholtz.de.
If you have to cancel the course for any reason, please do so as soon as possible to allow time for others to take your seat. To cancel, please withdraw your registration on the course site or write an email to hida-courses@helmholtz.de.
Additional Information
There is no waiting list for this course! If someone withdraws from a course, their place is automatically reopened. We therefore advise you to keep an eye on the registration in case the course is fully booked and you would like to attend. Also, this course will be offered again in the future - you can check our HIDA course catalog for updates.
This course is free of charge.