Skip to main page content
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 May 2;12(5):e0176684.
doi: 10.1371/journal.pone.0176684. eCollection 2017.

Remote sensing-based measurement of Living Environment Deprivation: Improving classical approaches with machine learning

Affiliations
Free PMC article

Remote sensing-based measurement of Living Environment Deprivation: Improving classical approaches with machine learning

Daniel Arribas-Bel et al. PLoS One. .
Free PMC article

Abstract

This paper provides evidence on the usefulness of very high spatial resolution (VHR) imagery in gathering socioeconomic information in urban settlements. We use land cover, spectral, structure and texture features extracted from a Google Earth image of Liverpool (UK) to evaluate their potential to predict Living Environment Deprivation at a small statistical area level. We also contribute to the methodological literature on the estimation of socioeconomic indices with remote-sensing data by introducing elements from modern machine learning. In addition to classical approaches such as Ordinary Least Squares (OLS) regression and a spatial lag model, we explore the potential of the Gradient Boost Regressor and Random Forests to improve predictive performance and accuracy. In addition to novel predicting methods, we also introduce tools for model interpretation and evaluation such as feature importance and partial dependence plots, or cross-validation. Our results show that Random Forest proved to be the best model with an R2 of around 0.54, followed by Gradient Boost Regressor with 0.5. Both the spatial lag model and the OLS fall behind with significantly lower performances of 0.43 and 0.3, respectively.

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Location of Liverpool city and LED index values at LSOA level, shown over a Stamen Terrain base map (base map tiles by Stamen Design, under a CC BY 3.0, data by OpenStreetMap, under CC BY SA).
Fig 2
Fig 2. Feature importance plot (Random Forest).
Fig 3
Fig 3. Partial dependence plots for the four most relevant variables (Gradient Boost Regressor): RSF, percentage of vegetation and water, and RMM.
Fig 4
Fig 4. Cross-validated R2 (median values in parenthesis).

Similar articles

Cited by

References

    1. Jain S. Remote sensing application for property tax evaluation. International Journal of Applied Earth Observation and Geoinformation. 2008;10(1):109–121. 10.1016/j.jag.2007.10.008 - DOI
    1. Taubenböck H, Wegmann M, Roth A, Mehl H, Dech S. Urbanization in India—Spatiotemporal analysis using remote sensing data. Computers, Environment and Urban Systems. 2009;33(3):179–188. 10.1016/j.compenvurbsys.2008.09.003 - DOI
    1. Steele JE, Sundsøy PR, Pezzulo C, Alegana VA, Bird TJ, Blumenstock J, et al. Mapping poverty using mobile phone and satellite data. J R Soc Interface. 2017;14(20160690). - PMC - PubMed
    1. Grace K, Nagle NN, Husak G. Can Small-Scale Agricultural Production Improve Children’s Health? Examining Stunting Vulnerability among Very Young Children in Mali, West Africa. Annals of the American Association of Geographers. 2016;4452(March):1–16.
    1. Jean N, Burke M, Xie M, Davis WM, Lobell DB, Ermon S. Combining satellite imagery and machine learning to predict poverty. Science (New York, NY). 2016;353(6301):790–794. 10.1126/science.aaf7894 - DOI - PubMed

Grant support

The author(s) received no specific funding for this work.

LinkOut - more resources