Comparing ROC curves derived from regression models

Venkatraman E Seshan; Mithat Gönen; Colin B Begg

doi:10.1002/sim.5648

Comparing ROC curves derived from regression models

Stat Med. 2013 Apr 30;32(9):1483-93. doi: 10.1002/sim.5648. Epub 2012 Oct 3.

Authors

Venkatraman E Seshan¹, Mithat Gönen, Colin B Begg

Affiliation

¹ Department of Epidemiology and Biostatistics, Memorial Sloan-Kettering Cancer Center, New York, NY 10065, USA.

Abstract

In constructing predictive models, investigators frequently assess the incremental value of a predictive marker by comparing the ROC curve generated from the predictive model including the new marker with the ROC curve from the model excluding the new marker. Many commentators have noticed empirically that a test of the two ROC areas often produces a non-significant result when a corresponding Wald test from the underlying regression model is significant. A recent article showed using simulations that the widely used ROC area test produces exceptionally conservative test size and extremely low power. In this article, we demonstrate that both the test statistic and its estimated variance are seriously biased when predictions from nested regression models are used as data inputs for the test, and we examine in detail the reasons for these problems. Although it is possible to create a test reference distribution by resampling that removes these biases, Wald or likelihood ratio tests remain the preferred approach for testing the incremental contribution of a new marker.

Publication types

Comparative Study
Research Support, N.I.H., Extramural

MeSH terms

Area Under Curve
Biomarkers / analysis*
Humans
Likelihood Functions
Models, Statistical*
ROC Curve*
Regression Analysis*

Substances

Biomarkers

Abstract

Publication types

MeSH terms

Substances

Grants and funding