Skip to main page content
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Jul 13;547(7662):173-178.
doi: 10.1038/nature22969. Epub 2017 Jun 28.

Fine-mapping Inflammatory Bowel Disease Loci to Single-Variant Resolution

Affiliations
Free PMC article

Fine-mapping Inflammatory Bowel Disease Loci to Single-Variant Resolution

Hailiang Huang et al. Nature. .
Free PMC article

Abstract

Inflammatory bowel diseases are chronic gastrointestinal inflammatory disorders that affect millions of people worldwide. Genome-wide association studies have identified 200 inflammatory bowel disease-associated loci, but few have been conclusively resolved to specific functional variants. Here we report fine-mapping of 94 inflammatory bowel disease loci using high-density genotyping in 67,852 individuals. We pinpoint 18 associations to a single causal variant with greater than 95% certainty, and an additional 27 associations to a single variant with greater than 50% certainty. These 45 variants are significantly enriched for protein-coding changes (n = 13), direct disruption of transcription-factor binding sites (n = 3), and tissue-specific epigenetic marks (n = 10), with the last category showing enrichment in specific immune cells among associations stronger in Crohn's disease and in gut mucosa among associations stronger in ulcerative colitis. The results of this study suggest that high-resolution fine-mapping in large samples can convert many discoveries from genome-wide association studies into statistically convincing causal variants, providing a powerful substrate for experimental elucidation of disease mechanisms.

Conflict of interest statement

The authors declare no competing financial interests.

Figures

Extended Data Figure 1
Extended Data Figure 1. Power of the fine-mapping analysis.
Power (y axis) to identify the causal variant in a correlated pair (strength of correlation shown by color) increases with the significance of the association (x axis), and therefore with sample size and effect size. The vertical dashed line shows the genome-wide significance level. To estimate the relationship between the strength of association and our ability to fine-map it, we assumed that the association has only two causal variant candidates, and we defined the signal as successfully fine-mapped if the ratio of Bayes factors between the true causal variant and the non-causal variant is greater than 10 (a 91% posterior, assuming equal priors for the two candidate variants). Using equation (8) in Supplementary Methods, we have logBF=logPr(Y|SNP1)Pr(Y|SNP2)logPr(Y|SNP1,θ1*)Pr(Y|SNP2,θ2*) in which θ* is maximum likelihood estimate of the parameter values. The log-likelihood ratio follows a chi-square distribution: logBF12(χSNP12χSNP22)=12λ(1r2) in which λ is the chi-square statistic of the lead variant and r is the correlation coefficient between the two variants. Because of the additive property of the chi-square distribution, logBF follows a non-central chi-square distribution with 1 degree of freedom and non-centrality parameter λ(1 – r2)/2. Therefore, the power can calculated as the probability that logBF > log(10), given by the cumulative distribution function of the non-central chi-squared distribution.
Extended Data Figure 2
Extended Data Figure 2. Procedures in the fine-mapping analysis.
Details for each stage are described in Methods. The dashed line means the imputation was performed only once after the manual inspection (not iteratively).
Extended Data Figure 3
Extended Data Figure 3. Variance explained.
Variance explained by secondary, tertiary, … variants as a fraction of the primary signal at each locus.
Extended Data Figure 4
Extended Data Figure 4. Functional annotations.
a, Functional annotation for 45 variants having posterior probability > 50%. b, Functional annotation for 116 association signals that are fine-mapped to ≤ 50 variants. Annotations are defined in Methods. We additionally grouped eQTLs into “Immune/Blood” (CD4+, CD8+, CD19+, CD14+ CD15+, platelets) and “Gut” (ileum, transverse colon and rectum). The eQTLs were generated from the ULg dataset using the “frequentist colocalization using conditional P values” approach (Methods).
Extended Data Figure 5
Extended Data Figure 5. Size of credible sets.
Comparison of credible set sizes for primary signals using each of our fine-mapping methods (methods 1, 2 and 3), the combined approach (as adopted in final results) and the approach described in Maller et al. (y axis) and the R2 > 0.6 cut-off (x axis). Fine-mapping maps most signals to smaller numbers of variants.
Extended Data Figure 6
Extended Data Figure 6. Distributions of the allele frequency and the imputation quality.
Panels a-c: distribution of the risk allele frequency for 45 variants having > 50% posterior probability plotted against (a) posterior probability, (b) significance of the association as –log10(P), and (c) odds ratio of the association. Variants are color coded according to their functions. Odds ratio for IBD associations was the larger of odds ratios for CD and UC. Panels d-f: distribution of imputation quality (INFO measure from the IMPUTE2 program) for variants having MAF ≥5% (d), between 5% and 1% (e) and <1% (f).
Extended Data Figure 7
Extended Data Figure 7. Merging and adjudicating signals across methods.
The number of signals for each method is shown in the brackets, and for each method a black bar indicates a signal with p < 1.35 x 10–6, and a grey bar a signal that does not reach that threshold. The colored bar shows the final status of each signal after merging and model selection (Methods). “Low info” corresponds to INFO < 0.8 (the threshold used for signals reported by 1 or 2 methods) and “rare and imputed” to MAF < 0.01 and no genotyped variants in the credible set, regardless of INFO (Methods).
Figure 1
Figure 1. Fine-mapping procedure and output using the SMAD3 region as an example.
a, 1) We merge overlapping signals across methods; 2) select a lead variant (black triangle) and phenotype (color); and 3) choose the best model. Details for each step are available in Methods. b, Example fine-mapping output. This region has been mapped to two independent signals. For each signal, we report the phenotype it is associated with (colored), the variants in the credible set, and their posterior probabilities.
Figure 2
Figure 2. Summary of fine-mapped associations.
a, Independent signals. Sixty-eight loci containing one association and 26 loci containing multiple associations. b, Number of variants in credible sets. 18 associations were fine-mapped to a single variant, and 116 to ≤ 50 variants. c, Distribution of the posterior probability of the variants in credible sets having ≤ 50 variants.
Figure 3
Figure 3. Functional annotation of causal variants.
a, Proportion of credible variants that are protein coding, disrupt/create transcription factor binding motif sites (TFBS) or are synonymous, sorted by posterior probability. b, Epigenetic peaks overlapping credible variants in cell and tissue types from the Roadmap Epigenomics Consortium. Significant enrichment has been marked with asterisks. Proportion of credible variants that overlap (c) core immune peaks for H4K4me1or (d) core gut peaks for H3K27ac (Methods). In panels a, c and d, the vertical dotted lines mark 50% posterior probability and the horizontal dashed lines show the background proportions of each functional category.
Figure 4
Figure 4. Number of credible sets that colocalize eQTLs.
Distributions of the number of colocalizations by chance (violins) and observed number of colocalizations with p-values (dots). Both the background and the observed numbers were calculated using the “Frequentist colocalization using conditional P values” approach (Methods).

Similar articles

  • Characterization of genetic loci that affect susceptibility to inflammatory bowel diseases in African Americans.
    Huang C, Haritunians T, Okou DT, Cutler DJ, Zwick ME, Taylor KD, Datta LW, Maranville JC, Liu Z, Ellis S, Chopra P, Alexander JS, Baldassano RN, Cross RK, Dassopoulos T, Dhere TA, Duerr RH, Hanson JS, Hou JK, Hussain SZ, Isaacs KL, Kachelries KE, Kader H, Kappelman MD, Katz J, Kellermayer R, Kirschner BS, Kuemmerle JF, Kumar A, Kwon JH, Lazarev M, Mannon P, Moulton DE, Osuntokun BO, Patel A, Rioux JD, Rotter JI, Saeed S, Scherl EJ, Silverberg MS, Silverman A, Targan SR, Valentine JF, Wang MH, Simpson CL, Bridges SL, Kimberly RP, Rich SS, Cho JH, Rienzo AD, Kao LWH, McGovern DPB, Brant SR, Kugathasan S. Huang C, et al. Gastroenterology. 2015 Nov;149(6):1575-1586. doi: 10.1053/j.gastro.2015.07.065. Epub 2015 Aug 14. Gastroenterology. 2015. PMID: 26278503 Free PMC article. Clinical Trial.
  • Population-specific genome-wide mapping of expression quantitative trait loci in the colon of Han Chinese.
    Guo CC, Wei N, Liang SH, Wang BL, Sha SM, Wu KC. Guo CC, et al. J Dig Dis. 2016 Sep;17(9):600-609. doi: 10.1111/1751-2980.12399. J Dig Dis. 2016. PMID: 27534592
  • Inherited determinants of Crohn's disease and ulcerative colitis phenotypes: a genetic association study.
    Cleynen I, Boucher G, Jostins L, Schumm LP, Zeissig S, Ahmad T, Andersen V, Andrews JM, Annese V, Brand S, Brant SR, Cho JH, Daly MJ, Dubinsky M, Duerr RH, Ferguson LR, Franke A, Gearry RB, Goyette P, Hakonarson H, Halfvarson J, Hov JR, Huang H, Kennedy NA, Kupcinskas L, Lawrance IC, Lee JC, Satsangi J, Schreiber S, Théâtre E, van der Meulen-de Jong AE, Weersma RK, Wilson DC; International Inflammatory Bowel Disease Genetics Consortium, Parkes M, Vermeire S, Rioux JD, Mansfield J, Silverberg MS, Radford-Smith G, McGovern DP, Barrett JC, Lees CW. Cleynen I, et al. Lancet. 2016 Jan 9;387(10014):156-67. doi: 10.1016/S0140-6736(15)00465-1. Epub 2015 Oct 18. Lancet. 2016. PMID: 26490195 Free PMC article. Clinical Trial.
  • The genetic background of inflammatory bowel disease: from correlation to causality.
    Uniken Venema WT, Voskuil MD, Dijkstra G, Weersma RK, Festen EA. Uniken Venema WT, et al. J Pathol. 2017 Jan;241(2):146-158. doi: 10.1002/path.4817. Epub 2016 Nov 15. J Pathol. 2017. PMID: 27785786 Review.
  • Genetics of inflammatory bowel disease.
    Satsangi J, Parkes M, Jewell DP, Bell JI. Satsangi J, et al. Clin Sci (Lond). 1998 May;94(5):473-8. doi: 10.1042/cs0940473. Clin Sci (Lond). 1998. PMID: 9682668 Review.
See all similar articles

Cited by 84 articles

See all "Cited by" articles

References

    1. Kappelman MD, et al. Direct health care costs of Crohn's disease and ulcerative colitis in US children and adults. Gastroenterology. 2008;135:1907–1913. - PMC - PubMed
    1. Molodecky NA, et al. Increasing incidence and prevalence of the inflammatory bowel diseases with time, based on systematic review. Gastroenterology. 2012;142:46–54.e42. quiz e30. - PubMed
    1. Jostins L, et al. Host–microbe interactions have shaped the genetic architecture of inflammatory bowel disease. Nature. 2012;491:119–124. - PMC - PubMed
    1. Liu JZ, et al. Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across populations. Nat Genet. 2015 doi: 10.1038/ng.3359. - DOI - PMC - PubMed
    1. van de Bunt M, et al. Evaluating the Performance of Fine-Mapping Strategies at Common Variant GWAS Loci. PLoS Genet. 2015;11:e1005535. - PMC - PubMed

Publication types

MeSH terms

Feedback