Concept Drift Mitigation in Low-Cost Air Quality Monitoring Networks

Gerardo D'Elia; Matteo Ferro; Paolo Sommella; Sergio Ferlito; Saverio De Vito; Girolamo Di Francia

doi:10.3390/s24092786

Concept Drift Mitigation in Low-Cost Air Quality Monitoring Networks

Sensors (Basel). 2024 Apr 27;24(9):2786. doi: 10.3390/s24092786.

Authors

Gerardo D'Elia^{1

2}, Matteo Ferro³, Paolo Sommella², Sergio Ferlito¹, Saverio De Vito¹, Girolamo Di Francia¹

Affiliations

¹ TERIN-SSI-EDS Laboratory, ENEA CR-Portici, P. le E. Fermi 1, 80055 Portici, Italy.
² Department of Industrial Engineering (DIIn), University of Salerno, Via Giovanni Paolo II, 132, 84084 Fisciano, Italy.
³ Hippocratica Imaging S.r.l., Via Giulio Pastore, 32, 84131 Salerno, Italy.

Abstract

Future air quality monitoring networks will integrate fleets of low-cost gas and particulate matter sensors that are calibrated using machine learning techniques. Unfortunately, it is well known that concept drift is one of the primary causes of data quality loss in machine learning application operational scenarios. The present study focuses on addressing the calibration model update of low-cost NO₂ sensors once they are triggered by a concept drift detector. It also defines which data are the most appropriate to use in the model updating process to gain compliance with the relative expanded uncertainty (REU) limits established by the European Directive. As the examined methodologies, the general/global and the importance weighting calibration models were applied for concept drift effects mitigation. Overall, for all the devices under test, the experimental results show the inadequacy of both models when performed independently. On the other hand, the results from the application of both models through a stacking ensemble strategy were able to extend the temporal validity of the used calibration model by three weeks at least for all the sensor devices under test. Thus, the usefulness of the whole information content gathered throughout the original co-location process was maximized.

Keywords: air quality network; calibration model update; concept drift; general calibration; global calibration; importance weighting; relative expanded uncertainty.

Grants and funding

B57H22003350007/POR Campania FESR Research and Innovation programs, ARMONIA Project