A solution to the problem of separation in logistic regression

Stat Med. 2002 Aug 30;21(16):2409-19. doi: 10.1002/sim.1047.

Abstract

The phenomenon of separation or monotone likelihood is observed in the fitting process of a logistic model if the likelihood converges while at least one parameter estimate diverges to +/- infinity. Separation primarily occurs in small samples with several unbalanced and highly predictive risk factors. A procedure by Firth originally developed to reduce the bias of maximum likelihood estimates is shown to provide an ideal solution to separation. It produces finite parameter estimates by means of penalized maximum likelihood estimation. Corresponding Wald tests and confidence intervals are available but it is shown that penalized likelihood ratio tests and profile penalized likelihood confidence intervals are often preferable. The clear advantage of the procedure over previous options of analysis is impressively demonstrated by the statistical analysis of two cancer studies.

Publication types

  • Comparative Study

MeSH terms

  • Breast Neoplasms / radiotherapy
  • Case-Control Studies*
  • Computer Simulation
  • Endometrial Neoplasms / pathology
  • Female
  • Humans
  • Likelihood Functions*
  • Logistic Models*
  • Lung Neoplasms / etiology
  • Monte Carlo Method
  • Neovascularization, Pathologic / pathology
  • Radiotherapy / adverse effects
  • Smoking / adverse effects