Incorporating Functional Genomic Information in Genetic Association Studies Using an Empirical Bayes Approach

Genet Epidemiol. 2016 Apr;40(3):176-87. doi: 10.1002/gepi.21956. Epub 2016 Feb 1.

Abstract

There is a large amount of functional genetic data available, which can be used to inform fine-mapping association studies (in diseases with well-characterised disease pathways). Single nucleotide polymorphism (SNP) prioritization via Bayes factors is attractive because prior information can inform the effect size or the prior probability of causal association. This approach requires the specification of the effect size. If the information needed to estimate a priori the probability density for the effect sizes for causal SNPs in a genomic region isn't consistent or isn't available, then specifying a prior variance for the effect sizes is challenging. We propose both an empirical method to estimate this prior variance, and a coherent approach to using SNP-level functional data, to inform the prior probability of causal association. Through simulation we show that when ranking SNPs by our empirical Bayes factor in a fine-mapping study, the causal SNP rank is generally as high or higher than the rank using Bayes factors with other plausible values of the prior variance. Importantly, we also show that assigning SNP-specific prior probabilities of association based on expert prior functional knowledge of the disease mechanism can lead to improved causal SNPs ranks compared to ranking with identical prior probabilities of association. We demonstrate the use of our methods by applying the methods to the fine mapping of the CASP8 region of chromosome 2 using genotype data from the Collaborative Oncological Gene-Environment Study (COGS) Consortium. The data we analysed included approximately 46,000 breast cancer case and 43,000 healthy control samples.

Keywords: Bayes factors; empirical Bayes; fine mapping; functional information; genetic association.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Bayes Theorem*
  • Breast Neoplasms / genetics
  • Case-Control Studies
  • Caspase 2 / genetics
  • Chromosomes, Human, Pair 2 / genetics
  • Female
  • Genetic Association Studies / methods*
  • Genome, Human / genetics*
  • Genomics*
  • Genotype
  • Humans
  • Models, Genetic
  • Polymorphism, Single Nucleotide / genetics
  • Probability

Substances

  • Caspase 2