The predictive performance and stability of six species distribution models

PLoS One. 2014 Nov 10;9(11):e112764. doi: 10.1371/journal.pone.0112764. eCollection 2014.

Abstract

Background: Predicting species' potential geographical range by species distribution models (SDMs) is central to understand their ecological requirements. However, the effects of using different modeling techniques need further investigation. In order to improve the prediction effect, we need to assess the predictive performance and stability of different SDMs.

Methodology: We collected the distribution data of five common tree species (Pinus massoniana, Betula platyphylla, Quercus wutaishanica, Quercus mongolica and Quercus variabilis) and simulated their potential distribution area using 13 environmental variables and six widely used SDMs: BIOCLIM, DOMAIN, MAHAL, RF, MAXENT, and SVM. Each model run was repeated 100 times (trials). We compared the predictive performance by testing the consistency between observations and simulated distributions and assessed the stability by the standard deviation, coefficient of variation, and the 99% confidence interval of Kappa and AUC values.

Results: The mean values of AUC and Kappa from MAHAL, RF, MAXENT, and SVM trials were similar and significantly higher than those from BIOCLIM and DOMAIN trials (p<0.05), while the associated standard deviations and coefficients of variation were larger for BIOCLIM and DOMAIN trials (p<0.05), and the 99% confidence intervals for AUC and Kappa values were narrower for MAHAL, RF, MAXENT, and SVM. Compared to BIOCLIM and DOMAIN, other SDMs (MAHAL, RF, MAXENT, and SVM) had higher prediction accuracy, smaller confidence intervals, and were more stable and less affected by the random variable (randomly selected pseudo-absence points).

Conclusions: According to the prediction performance and stability of SDMs, we can divide these six SDMs into two categories: a high performance and stability group including MAHAL, RF, MAXENT, and SVM, and a low performance and stability group consisting of BIOCLIM, and DOMAIN. We highlight that choosing appropriate SDMs to address a specific problem is an important part of the modeling process.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Betula / physiology
  • Magnoliopsida / classification
  • Magnoliopsida / physiology*
  • Pinus / physiology*
  • Quercus / classification
  • Quercus / physiology
  • Species Specificity
  • Statistical Distributions*

Grants and funding

This study was jointly funded by the National Natural Science Foundation of China (NSFC 31100311) and the excellent provincial young fund of Anhui (2012SQRL113ZD). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.