Joint modeling of multivariate longitudinal data and survival data in several observational studies of Huntington's disease

BMC Med Res Methodol. 2018 Nov 16;18(1):138. doi: 10.1186/s12874-018-0592-9.


Background: Joint modeling is appropriate when one wants to predict the time to an event with covariates that are measured longitudinally and are related to the event. An underlying random effects structure links the survival and longitudinal submodels and allows for individual-specific predictions. Multiple time-varying and time-invariant covariates can be included to potentially increase prediction accuracy. The goal of this study was to estimate a multivariate joint model on several longitudinal observational studies of Huntington's disease, examine external validity performance, and compute individual-specific predictions for characterizing disease progression. Emphasis was on the survival submodel for predicting the hazard of motor diagnosis.

Methods: Data from four observational studies was analyzed: Enroll-HD, PREDICT-HD, REGISTRY, and Track-HD. A Bayesian approach to estimation was adopted, and external validation was performed using a time-varying AUC measure. Individual-specific cumulative hazard predictions were computed based on a simulation approach. The cumulative hazard was used for computing predicted age of motor onset and also for a deviance residual indicating the discrepancy between observed diagnosis status and model-based status.

Results: The joint model trained in a single study had very good performance in discriminating among diagnosed and pre-diagnosed participants in the remaining test studies, with the 5-year mean AUC = .83 (range .77-.90), and the 10-year mean AUC = .86 (range .82-.92). Graphical analysis of the predicted age of motor diagnosis showed an expected strong relationship with the trinucleotide expansion that causes Huntington's disease. Graphical analysis of the deviance-type residual revealed there were individuals who converted to a diagnosis despite having relatively low model-based risk, others who had not yet converted despite having relatively high risk, and the majority falling between the two extremes.

Conclusions: Joint modeling is an improvement over traditional survival modeling because it considers all the longitudinal observations of covariates that are predictive of an event. Predictions from joint models can have greater accuracy because they are tailored to account for individual variability. These predictions can provide relatively accurate characterizations of individual disease progression, which might be important in the timing of interventions, determining the qualification for appropriate clinical trials, and general genotypic analysis.

Keywords: Joint modeling (JM) - survival analysis - linear mixed modeling (LMM) - external validation - proportional hazards model - Huntington’s disease (HD).

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Algorithms*
  • Bayes Theorem
  • Disease Progression
  • Female
  • Humans
  • Huntington Disease / diagnosis*
  • Huntington Disease / genetics
  • Longitudinal Studies
  • Male
  • Middle Aged
  • Multivariate Analysis*
  • Observational Studies as Topic*
  • Patient-Specific Modeling*
  • Reproducibility of Results
  • Survival Analysis