Stochastic imputation for integrated transcriptome association analysis of a longitudinally measured trait

Evan L Ray; Jing Qian; Regina Brecha; Muredach P Reilly; Andrea S Foulkes

doi:10.1177/0962280219852720

Stochastic imputation for integrated transcriptome association analysis of a longitudinally measured trait

Stat Methods Med Res. 2020 Apr;29(4):1167-1180. doi: 10.1177/0962280219852720. Epub 2019 Jun 7.

Authors

Evan L Ray¹, Jing Qian², Regina Brecha¹, Muredach P Reilly³, Andrea S Foulkes¹

Affiliations

¹ Department of Mathematics and Statistics, Mount Holyoke College, South Hadley, MA, USA.
² Department of Biostatistics and Epidemiology, School of Public Health and Health Sciences, University of Massachusetts, Amherst, MA, USA.
³ Department of Medicine, Columbia University, NY, USA.

Abstract

The mechanistic pathways linking genetic polymorphisms and complex disease traits remain largely uncharacterized. At the same time, expansive new transcriptome data resources offer unprecedented opportunity to unravel the mechanistic underpinnings of complex disease associations. Two-stage strategies involving conditioning on a single, penalized regression imputation for transcriptome association analysis have been described for cross-sectional traits. In this manuscript, we propose an alternative two-stage approach based on stochastic regression imputation that additionally incorporates error in the predictive model. Application of a bootstrap procedure offers flexibility when a closed form predictive distribution is not available. The two-stage strategy is also generalized to longitudinally measured traits, using a linear mixed effects modeling framework and a composite test statistic to evaluate whether the genetic component of gene-level expression modifies the biomarker trajectory over time. Simulations studies are performed to evaluate relative performance with respect to type-1 error rates, coverage, estimation error, and power under a range of conditions. A case study is presented to investigate the association between whole blood expression for each of five inflammasome genes with inflammatory response over time after endotoxin challenge.

Keywords: Biomarker response; genome-wide association studies; multiple imputation; repeated measures; transcriptome-wide association studies.

Publication types

Research Support, N.I.H., Extramural

MeSH terms

Cross-Sectional Studies
Gene Expression Profiling
Genome-Wide Association Study*
Phenotype
Polymorphism, Single Nucleotide / genetics
Transcriptome*

Abstract

Publication types

MeSH terms

Grants and funding