Nonparametric competing risks analysis using Bayesian Additive Regression Trees

Stat Methods Med Res. 2020 Jan;29(1):57-77. doi: 10.1177/0962280218822140. Epub 2019 Jan 7.

Abstract

Many time-to-event studies are complicated by the presence of competing risks. Such data are often analyzed using Cox models for the cause-specific hazard function or Fine and Gray models for the subdistribution hazard. In practice, regression relationships in competing risks data are often complex and may include nonlinear functions of covariates, interactions, high-dimensional parameter spaces and nonproportional cause-specific, or subdistribution, hazards. Model misspecification can lead to poor predictive performance. To address these issues, we propose a novel approach: flexible prediction modeling of competing risks data using Bayesian Additive Regression Trees (BART). We study the simulation performance in two-sample scenarios as well as a complex regression setting, and benchmark its performance against standard regression techniques as well as random survival forests. We illustrate the use of the proposed method on a recently published study of patients undergoing hematopoietic stem cell transplantation.

Keywords: Cumulative incidence; graft-versus-host disease (GVHD); hematopoietic stem cell transplant; machine learning; nonproportional; treatment heterogeneity; variable selection.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Bayes Theorem*
  • Benchmarking
  • Computer Simulation
  • Graft vs Host Disease / epidemiology*
  • Hematopoietic Stem Cell Transplantation*
  • Humans
  • Incidence
  • Machine Learning
  • Regression Analysis