Estimate of soil heavy metal in a mining region using PCC-SVM-RFECV-AdaBoost combined with reflectance spectroscopy

Environ Geochem Health. 2023 Dec;45(12):9103-9121. doi: 10.1007/s10653-023-01488-w. Epub 2023 Mar 4.

Abstract

Soil contamination with heavy metals is a relatively serious issue in China. Traditional soil heavy metal survey methods cannot meet the demand for rapid and real-time large-scale area soil heavy metal surveys. We chose a typical mining area in Henan Province as the study area, collected 124 soil samples in the field and obtained their soil hyperspectral data indoors using a spectrometer. After different spectral transformations of the soil spectral curves, Pearson correlation coefficients (PCC) between them and the heavy metals Cd, Cr, Cu, and Ni were calculated, and after correlation evaluation, the best spectral transformations for each heavy metal were determined and preselected characteristic wavebands were obtained. Then the support vector machine recursive feature elimination cross-validation (SVM-RFECV) is used to select among the preselected feature wavebands to obtain the final modeled wavebands, and the Adaptive Boosting (AdaBoost), Gradient Boosting Decision Tree (GBDT), Random Forest (RF), and Partial Least Squares (PLS) methods were used to establish the inversion model. The results showed that the PCC-SVM-RFECV can effectively select characteristic wavebands with high contribution to modeling from high-dimensional data. Spectral transformations methods can improve the correlation of spectra with heavy metals. The location and quantity of characteristic wavebands for the four heavy metals were different. The accuracy of AdaBoost was significantly better than that of GBDT, RF, and PLS (i.e., Ni: [Formula: see text]). This study can provide a technical reference for the use of hyperspectral inversion models for large-scale monitoring of soil heavy metal content.

Keywords: Characteristic wavebands select; Hyperspectral; Inversion modeling; Soil heavy metal; Spectral transformation.

MeSH terms

  • China
  • Environmental Monitoring / methods
  • Metals, Heavy* / analysis
  • Soil / chemistry
  • Soil Pollutants* / analysis
  • Spectrum Analysis
  • Support Vector Machine

Substances

  • Soil
  • Soil Pollutants
  • Metals, Heavy