An additive Gaussian process regression model for interpretable non-parametric analysis of longitudinal data

Nat Commun. 2019 Apr 17;10(1):1798. doi: 10.1038/s41467-019-09785-8.

Abstract

Biomedical research typically involves longitudinal study designs where samples from individuals are measured repeatedly over time and the goal is to identify risk factors (covariates) that are associated with an outcome value. General linear mixed effect models are the standard workhorse for statistical analysis of longitudinal data. However, analysis of longitudinal data can be complicated for reasons such as difficulties in modelling correlated outcome values, functional (time-varying) covariates, nonlinear and non-stationary effects, and model inference. We present LonGP, an additive Gaussian process regression model that is specifically designed for statistical analysis of longitudinal data, which solves these commonly faced challenges. LonGP can model time-varying random effects and non-stationary signals, incorporate multiple kernel learning, and provide interpretable results for the effects of individual covariates and their interactions. We demonstrate LonGP's performance and accuracy by analysing various simulated and real longitudinal -omics datasets.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Biomedical Research / methods*
  • Case-Control Studies
  • Data Interpretation, Statistical*
  • Datasets as Topic
  • Feasibility Studies
  • Female
  • Humans
  • Linear Models
  • Longitudinal Studies*
  • Male
  • Metagenomics / methods
  • Normal Distribution
  • Proteomics / methods
  • Research Design*
  • Software