Topic 1 Days

Name: Topic 1 Days
Start: 2023-06-21T12:00:00+02:00
End: 2023-06-23T14:15:00+02:00
Location: Telegrafenberg

21–23 Jun 2023

Telegrafenberg

Europe/Berlin timezone

Contact

Overfitting and overextending – reframing the potential of machine-learning techniques in calibrating low-cost sensors

22 Jun 2023, 15:50

15m

Building H (Telegrafenberg)

Building H

Telegrafenberg

Invited Talk Deep/Machine learning and data science Deep/Machine learning and data science

Sean Schmitz

Machine-learning (ML) techniques have been recently applied to the calibration of low-cost sensors (LCS). Many studies report successes in calibration with ML techniques such as random forests (RF), neural networks (NN), and support vector regression (SVR). We find that calibrating LCS for the measurement of nitrogen dioxide (NO2) and particulate matter (PM) with ML techniques is not as beneficial as previously reported. While some hierarchical tree-based methods such as RF and gradient-boosting machines (GBM) find success, they also have substantial limitations. Others such as NN and SVR are prone to overfitting, such that prediction with these models on new data is inadvisable. Instead, we find in calibrating for NO2 and PM2.5, multiple linear regression (MLR) is the most reliable, transparent, and consistent. Though many ML techniques have potential for use in a variety of applications, they may not always be appropriate, as shown here with the calibration of LCS.

Sean Schmitz

22_-_Schmitz_2023_overfitting and overextending_v2.pdf

Topic 1 Days

Contact

Overfitting and overextending – reframing the potential of machine-learning techniques in calibrating low-cost sensors

Building H

Telegrafenberg

Speaker

Description

Author

Presentation materials

Choose timezone

Topic 1 Days

Contact

Speaker

Description

Author

Presentation materials