Impact of the model-building strategy on inference about nonlinear and time-dependent covariate effects in survival analysis

Stat Med. 2014 Aug 30;33(19):3318-37. doi: 10.1002/sim.6178. Epub 2014 Apr 22.

Abstract

Cox's proportional hazards (PH) model assumes constant-over-time covariate effects. Furthermore, most applications assume linear effects of continuous covariates on the logarithm of the hazard. Yet, many prognostic factors have time-dependent (TD) and/or nonlinear (NL) effects, that is, violate these conventional assumptions. Detection of such complex effects could affect prognosis and clinical decisions. However, assessing the effects of each of the multiple, often correlated, covariates in flexible multivariable analyses is challenging. In simulations, we investigated the impact of the approach used to build the flexible multivariable model on inference about the TD and NL covariate effects. Results demonstrate that the conclusions regarding the statistical significance of the TD/NL effects depend heavily on the strategy used to decide which effects of the other covariates should be adjusted for. Both a failure to adjust for true TD and NL effects of relevant covariates and inclusion of spurious effects of covariates that conform to the PH and linearity assumptions increase the risk of incorrect conclusions regarding other covariates. In this context, iterative backward elimination of nonsignificant NL and TD effects from the multivariable model, which initially includes all these effects, may help discriminate between true and spurious effects. The practical importance of these issues was illustrated in an example that reassessed the predictive ability of selected biomarkers for survival in advanced non-small-cell lung cancer. In conclusion, a careful model-building strategy and flexible modeling of multivariable survival data can yield new insights about predictors' roles and improve the validity of analyses.

Keywords: Cox's proportional hazards model; flexible modeling; hypothesis testing; model-building strategy; residual confounding.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biomarkers, Tumor / blood
  • Biostatistics
  • C-Reactive Protein / metabolism
  • Carcinoma, Non-Small-Cell Lung / blood
  • Carcinoma, Non-Small-Cell Lung / mortality
  • Computer Simulation
  • Humans
  • Leukocyte Count
  • Likelihood Functions
  • Linear Models
  • Lung Neoplasms / blood
  • Lung Neoplasms / mortality
  • Models, Statistical
  • Multivariate Analysis
  • Neutrophils
  • Nonlinear Dynamics
  • Prognosis
  • Proportional Hazards Models
  • Serum Albumin / metabolism
  • Smoking / adverse effects
  • Survival Analysis*
  • Time Factors

Substances

  • Biomarkers, Tumor
  • Serum Albumin
  • C-Reactive Protein