Machine Learning-Based Integration of High-Resolution Wildfire Smoke Simulations and Observations for Regional Health Impact Assessment

Int J Environ Res Public Health. 2019 Jun 17;16(12):2137. doi: 10.3390/ijerph16122137.


Large wildfires are an increasing threat to the western U.S. In the 2017 fire season, extensive wildfires occurred across the Pacific Northwest (PNW). To evaluate public health impacts of wildfire smoke, we integrated numerical simulations and observations for regional fire events during August-September of 2017. A one-way coupled Weather Research and Forecasting and Community Multiscale Air Quality modeling system was used to simulate fire smoke transport and dispersion. To reduce modeling bias in fine particulate matter (PM2.5) and to optimize smoke exposure estimates, we integrated modeling results with the high-resolution Multi-Angle Implementation of Atmospheric Correction satellite aerosol optical depth and the U.S. Environmental Protection Agency AirNow ground-level monitoring PM2.5 concentrations. Three machine learning-based data fusion algorithms were applied: An ordinary multi-linear regression method, a generalized boosting method, and a random forest (RF) method. 10-Fold cross-validation found improved surface PM2.5 estimation after data integration and bias correction, especially with the RF method. Lastly, to assess transient health effects of fire smoke, we applied the optimized high-resolution PM2.5 exposure estimate in a short-term exposure-response function. Total estimated regional mortality attributable to PM2.5 exposure during the smoke episode was 183 (95% confidence interval: 0, 432), with 85% of the PM2.5 pollution and 95% of the consequent multiple-cause mortality contributed by fire emissions. This application demonstrates both the profound health impacts of fire smoke over the PNW and the need for a high-performance fire smoke forecasting and reanalysis system to reduce public health risks of smoke hazards in fire-prone regions.

Keywords: PM2.5 air pollution; fire smoke modeling; health impact assessment; machine learning-based data fusion.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Air Pollutants / analysis*
  • Air Pollution / analysis*
  • Algorithms
  • Environmental Monitoring / methods*
  • Health Impact Assessment / methods*
  • Humans
  • Machine Learning*
  • Northwestern United States
  • Smoke / analysis*
  • Wildfires*


  • Air Pollutants
  • Smoke