Stochastic imputation for integrated transcriptome association analysis of a longitudinally measured trait

Stat Methods Med Res. 2020 Apr;29(4):1167-1180. doi: 10.1177/0962280219852720. Epub 2019 Jun 7.

Abstract

The mechanistic pathways linking genetic polymorphisms and complex disease traits remain largely uncharacterized. At the same time, expansive new transcriptome data resources offer unprecedented opportunity to unravel the mechanistic underpinnings of complex disease associations. Two-stage strategies involving conditioning on a single, penalized regression imputation for transcriptome association analysis have been described for cross-sectional traits. In this manuscript, we propose an alternative two-stage approach based on stochastic regression imputation that additionally incorporates error in the predictive model. Application of a bootstrap procedure offers flexibility when a closed form predictive distribution is not available. The two-stage strategy is also generalized to longitudinally measured traits, using a linear mixed effects modeling framework and a composite test statistic to evaluate whether the genetic component of gene-level expression modifies the biomarker trajectory over time. Simulations studies are performed to evaluate relative performance with respect to type-1 error rates, coverage, estimation error, and power under a range of conditions. A case study is presented to investigate the association between whole blood expression for each of five inflammasome genes with inflammatory response over time after endotoxin challenge.

Keywords: Biomarker response; genome-wide association studies; multiple imputation; repeated measures; transcriptome-wide association studies.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Cross-Sectional Studies
  • Gene Expression Profiling
  • Genome-Wide Association Study*
  • Phenotype
  • Polymorphism, Single Nucleotide / genetics
  • Transcriptome*