I-Boost: an integrative boosting approach for predicting survival time with multiple genomics platforms

Genome Biol. 2019 Mar 7;20(1):52. doi: 10.1186/s13059-019-1640-4.


We propose a statistical boosting method, termed I-Boost, to integrate multiple types of high-dimensional genomics data with clinical data for predicting survival time. I-Boost provides substantially higher prediction accuracy than existing methods. By applying I-Boost to The Cancer Genome Atlas, we show that the integration of multiple genomics platforms with clinical variables improves the prediction of survival time over the use of clinical variables alone; gene expression values are typically more prognostic of survival time than other genomics data types; and gene modules/signatures are at least as prognostic as the collection of individual gene expression data.

Keywords: Cancer genomics; Data integration; Gene modules; Variable selection.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Gene Expression Profiling / methods*
  • Gene Expression Regulation, Neoplastic*
  • Gene Regulatory Networks*
  • Genomics / methods*
  • Humans
  • Models, Statistical
  • Neoplasms / genetics
  • Neoplasms / mortality*
  • Prognosis
  • Software*
  • Survival Rate