Impact analysis of environmental and social factors on early-stage COVID-19 transmission in China by machine learning

Environ Res. 2022 May 15:208:112761. doi: 10.1016/j.envres.2022.112761. Epub 2022 Jan 21.

Abstract

As a highly contagious disease, COVID-19 caused a worldwide pandemic and it is still ongoing. However, the infection in China has been successfully controlled although its initial transmission was also nationwide and has caused a serious public health crisis. The analysis on the early-stage COVID-19 transmission in China is worth investigating for its guiding significance on prevention to other countries and regions. In this study, we conducted the experiments from the perspectives of COVID-19 occurrence and intensity. We eliminated unimportant factors from 113 variables and applied four machine learning-based classification and regression models to predict COVID-19 occurrence and intensity, respectively. The influence of each important factor was analysed when applicable. Our optimal model on COVID-19 occurrence prediction presented an accuracy of 91.91% and the best R2 of intensity prediction reached 0.778. Linear regression-based model was identified as unable to fit and predict the intensity, and thus only the variable influence on COVID-19 occurrence can be explained. We found that (1) CO VID-19 was more likely to occur in prosperous cities closer to the epicentre and located on higher altitudes, (2) and the occurrence was higher under extreme weather and high minimum relative humidity. (3) Most air pollutants increased the risk of COVID-19 occurrence except NO2 and O3, and there existed a lag effect of 6-7 days. (4) NPIs (non-pharmaceutical interventions) did not show apparent effect until two weeks after.

Keywords: Air pollutants; COVID-19; Machine learning; Meteorology; Non-pharmaceutical interventions; Social data.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Air Pollutants* / analysis
  • Air Pollution* / analysis
  • COVID-19* / epidemiology
  • China / epidemiology
  • Cities
  • Humans
  • Machine Learning
  • Particulate Matter / analysis
  • SARS-CoV-2
  • Social Factors

Substances

  • Air Pollutants
  • Particulate Matter