Tree-based algorithms for spatial modeling of soil particle distribution in arid and semi-arid region

Environ Monit Assess. 2024 Feb 14;196(3):264. doi: 10.1007/s10661-024-12431-6.

Abstract

Accurate estimation of particle size distribution across a large area is crucial for proper soil management and conservation, ensuring compatibility with capabilities and enabling better selection and adaptation of precision agricultural techniques. The study investigated the performance of tree-based models, ranging from simpler options like CART to sophisticated ones like XGBoost, in predicting soil texture over a wide geographic region. Models were constructed using remotely sensed plant and soil indexes as covariates. Variable selection employed the Boruta approach. Training and testing data for machine learning models consisted of particle size distribution results from 622 surface soil samples collected in southeastern Turkey. The XGBoostClay model emerged as the most accurate predictor, with an R2 value of 0.74. Its superiority was further underlined by a 21.36% relative improvement in XGBoostClay RMSE compared to RFClay and 44.5% compared to CARTClay. Similarly, the R2 values for XGBoostSilt and XGBoostSand models reached 0.71 and 0.75 in predicting sand and silt content, respectively. Among the considered covariates, the normalized ratio vegetation index and slope angle had the highest impact on clay content (21%), followed by topographic position index and simple ratio clay index (20%), while terrain ruggedness index had the least impact (18%). These results highlight the effectiveness of Boruta approach in selecting an adequate number of variables for digital mapping, suggesting its potential as a viable option in this field. Furthermore, the findings of this study suggest that remote sensing data can effectively contribute to digital soil mapping, with tree-based model development leading to improved prediction performance.

Keywords: Clay; Data mining; Digital soil mapping; Model; Particle size distribution; Remote sensing; Sand; Silt.

MeSH terms

  • Algorithms
  • Clay
  • Environmental Monitoring / methods
  • Sand*
  • Soil*

Substances

  • Soil
  • Clay
  • Sand