In this study, we conduct a comparison of three most recent statistical methods for joint variable selection and covariance estimation with application of detecting expression quantitative trait loci (eQTL) and gene network estimation, and introduce a new hierarchical Bayesian method to be included in the comparison. Unlike the traditional univariate regression approach in eQTL, all four methods correlate phenotypes and genotypes by multivariate regression models that incorporate the dependence information among phenotypes, and use Bayesian multiplicity adjustment to avoid multiple testing burdens raised by traditional multiple testing correction methods. We presented the performance of three methods (MSSL - Multivariate Spike and Slab Lasso, SSUR - Sparse Seemingly Unrelated Bayesian Regression, and OBFBF - Objective Bayes Fractional Bayes Factor), along with the proposed, JDAG (Joint estimation via a Gaussian Directed Acyclic Graph model) method through simulation experiments, and publicly available HapMap real data, taking asthma as an example. Compared with existing methods, JDAG identified networks with higher sensitivity and specificity under row-wise sparse settings. JDAG requires less execution in small-to-moderate dimensions, but is not currently applicable to high dimensional data. The eQTL analysis in asthma data showed a number of known gene regulations such as STARD3, IKZF3 and PGAP3, all reported in asthma studies. The code of the proposed method is freely available at GitHub (https://github.com/xuan-cao/Joint-estimation-for-eQTL).
Keywords: eQTL; join analysis; network modeling; variable selection.
Joint high-dimensional Bayesian variable and covariance selection with an application to eQTL analysis.Biometrics. 2013 Jun;69(2):447-57. doi: 10.1111/biom.12021. Epub 2013 Apr 22. Biometrics. 2013. PMID: 23607608
Exploring Bayesian Approaches to eQTL Mapping Through Probabilistic Programming.Methods Mol Biol. 2020;2082:123-146. doi: 10.1007/978-1-0716-0026-9_9. Methods Mol Biol. 2020. PMID: 31849012
Part 1. Statistical Learning Methods for the Effects of Multiple Air Pollution Constituents.Res Rep Health Eff Inst. 2015 Jun;(183 Pt 1-2):5-50. Res Rep Health Eff Inst. 2015. PMID: 26333238
Expression QTLs Mapping and Analysis: A Bayesian Perspective.Methods Mol Biol. 2017;1488:189-215. doi: 10.1007/978-1-4939-6427-7_8. Methods Mol Biol. 2017. PMID: 27933525 Review.
Bayesian Inference for Mixed Model-Based Genome-Wide Analysis of Expression Quantitative Trait Loci by Gibbs Sampling.Front Genet. 2019 Mar 22;10:199. doi: 10.3389/fgene.2019.00199. eCollection 2019. Front Genet. 2019. PMID: 30967893 Free PMC article. Review.