One Model, Several Results: The Paradox of the Hosmer-Lemeshow Goodness-Of-Fit Test for the Logistic Regression Model

J Epidemiol Biostat. 2000;5(4):251-3.

Abstract

Background: The Hosmer-Lemeshow test, used extensively to assess the fit of the logistic regression model, is performed by several statistical packages. Recent studies have shown some problems in the use of this test when ties are present. These problems were attributed merely to the test implementation.

Methods: We analysed the order of the observations as an alternative explanation of the problem of ties. Using a data-set of 1393 intensive care unit (ICU) patients we performed the Hosmer-Lemeshow test with all possible subjects dispositions.

Results: We obtained about one million different P values, ranging from 0.01 to 0.95.

Discussion: It is already known that when the Hosmer-Lemeshow goodness-of-fit test is performed with a number of covariate patterns lower than the number of subjects, its result may be inaccurate. We showed that the extent of this problem could be relevant under particular conditions. We also suggest a strategy for estimating the extent of the problem and subsequent interpretation.

MeSH terms

  • Hospital Mortality
  • Humans
  • Intensive Care Units
  • Italy
  • Logistic Models*
  • Predictive Value of Tests
  • Severity of Illness Index
  • Statistics as Topic / methods