Variable selection in the cox regression model with covariates missing at random

Biometrics. 2010 Mar;66(1):97-104. doi: 10.1111/j.1541-0420.2009.01274.x. Epub 2009 May 18.

Abstract

We consider variable selection in the Cox regression model (Cox, 1975, Biometrika 362, 269-276) with covariates missing at random. We investigate the smoothly clipped absolute deviation penalty and adaptive least absolute shrinkage and selection operator (LASSO) penalty, and propose a unified model selection and estimation procedure. A computationally attractive algorithm is developed, which simultaneously optimizes the penalized likelihood function and penalty parameters. We also optimize a model selection criterion, called the IC(Q) statistic (Ibrahim, Zhu, and Tang, 2008, Journal of the American Statistical Association 103, 1648-1658), to estimate the penalty parameters and show that it consistently selects all important covariates. Simulations are performed to evaluate the finite sample performance of the penalty estimates. Also, two lung cancer data sets are analyzed to demonstrate the proposed methodology.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Computer Simulation
  • Data Interpretation, Statistical*
  • Humans
  • Lung Neoplasms / mortality*
  • Models, Statistical*
  • Multivariate Analysis
  • Proportional Hazards Models*
  • Sample Size
  • Survival Analysis*
  • Survival Rate*