Equivalence of improvement in area under ROC curve and linear discriminant analysis coefficient under assumption of normality

Stat Med. 2011 May 30;30(12):1410-8. doi: 10.1002/sim.4196. Epub 2011 Feb 21.


In this paper we investigate the addition of new variables to an existing risk prediction model and the subsequent impact on discrimination quantified by the area under the receiver operating characteristics curve (AUC of ROC). Based on practical experience, concerns have emerged that the significance of association of the variable under study with the outcome in the risk model does not correspond to the significance of the change in AUC: that is, often the variable is significant, but the change in AUC is not. This paper demonstrates that under the assumption of multivariate normality and employing linear discriminant analysis (LDA) to construct the risk prediction tool, statistical significance of the new predictor(s) is equivalent to the statistical significance of the increase in AUC. Under these assumptions the result extends asymptotically to logistic regression. We further show that equality of variance-covariance matrices of predictors within cases and non-cases is not necessary when LDA is used. However, our practical example from the Framingham Heart Study data suggests that the finding might be sensitive to the assumption of normality.

MeSH terms

  • Computer Simulation
  • Coronary Disease / etiology
  • Discriminant Analysis
  • Humans
  • Models, Statistical*
  • ROC Curve*
  • Risk Assessment / methods*
  • Risk Factors