Multiple SNP Set Analysis for Genome-Wide Association Studies Through Bayesian Latent Variable Selection

Genet Epidemiol. 2015 Dec;39(8):664-77. doi: 10.1002/gepi.21932. Epub 2015 Oct 30.

Abstract

The power of genome-wide association studies (GWAS) for mapping complex traits with single-SNP analysis (where SNP is single-nucleotide polymorphism) may be undermined by modest SNP effect sizes, unobserved causal SNPs, correlation among adjacent SNPs, and SNP-SNP interactions. Alternative approaches for testing the association between a single SNP set and individual phenotypes have been shown to be promising for improving the power of GWAS. We propose a Bayesian latent variable selection (BLVS) method to simultaneously model the joint association mapping between a large number of SNP sets and complex traits. Compared with single SNP set analysis, such joint association mapping not only accounts for the correlation among SNP sets but also is capable of detecting causal SNP sets that are marginally uncorrelated with traits. The spike-and-slab prior assigned to the effects of SNP sets can greatly reduce the dimension of effective SNP sets, while speeding up computation. An efficient Markov chain Monte Carlo algorithm is developed. Simulations demonstrate that BLVS outperforms several competing variable selection methods in some important scenarios.

Keywords: Bayesian variable selection; GWAS; imaging phenotypes; linkage disequilibrium blocks.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Bayes Theorem
  • Gene Frequency / genetics*
  • Genome-Wide Association Study / methods*
  • Humans
  • Linkage Disequilibrium / genetics
  • Markov Chains
  • Models, Genetic
  • Monte Carlo Method
  • Phenotype
  • Polymorphism, Single Nucleotide / genetics*
  • Quantitative Trait, Heritable*
  • Schizophrenia / epidemiology
  • Schizophrenia / genetics*
  • Sweden / epidemiology