Survival ensembles

Biostatistics. 2006 Jul;7(3):355-73. doi: 10.1093/biostatistics/kxj011. Epub 2005 Dec 12.


We propose a unified and flexible framework for ensemble learning in the presence of censoring. For right-censored data, we introduce a random forest algorithm and a generic gradient boosting algorithm for the construction of prognostic and diagnostic models. The methodology is utilized for predicting the survival time of patients suffering from acute myeloid leukemia based on clinical and genetic covariates. Furthermore, we compare the diagnostic capabilities of the proposed censored data random forest and boosting methods, applied to the recurrence-free survival time of node-positive breast cancer patients, with previously published findings.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Breast Neoplasms / pathology
  • Female
  • Humans
  • Leukemia, Myelomonocytic, Acute / diagnosis
  • Lymphatic Metastasis / pathology
  • Models, Statistical
  • Prognosis
  • Survival Analysis*