Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012;8(8):e1002908.
doi: 10.1371/journal.pgen.1002908. Epub 2012 Aug 16.

Use of allele-specific FAIRE to determine functional regulatory polymorphism using large-scale genotyping arrays

Affiliations

Use of allele-specific FAIRE to determine functional regulatory polymorphism using large-scale genotyping arrays

Andrew J P Smith et al. PLoS Genet. 2012.

Abstract

Following the widespread use of genome-wide association studies (GWAS), focus is turning towards identification of causal variants rather than simply genetic markers of diseases and traits. As a step towards a high-throughput method to identify genome-wide, non-coding, functional regulatory variants, we describe the technique of allele-specific FAIRE, utilising large-scale genotyping technology (FAIRE-gen) to determine allelic effects on chromatin accessibility and regulatory potential. FAIRE-gen was explored using lymphoblastoid cells and the 50,000 SNP Illumina CVD BeadChip. The technique identified an allele-specific regulatory polymorphism within NR1H3 (coding for LXR-α), rs7120118, coinciding with a previously GWAS-identified SNP for HDL-C levels. This finding was confirmed using FAIRE-gen with the 200,000 SNP Illumina Metabochip and verified with the established method of TaqMan allelic discrimination. Examination of this SNP in two prospective Caucasian cohorts comprising 15,000 individuals confirmed the association with HDL-C levels (combined beta = 0.016; p = 0.0006), and analysis of gene expression identified an allelic association with LXR-α expression in heart tissue. Using increasingly comprehensive genotyping chips and distinct tissues for examination, FAIRE-gen has the potential to aid the identification of many causal SNPs associated with disease from GWAS.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Principle of High-Throughput Analysis of Open Chromatin Using FAIRE-gen.
In this example, two potentially functional GWAS SNPs which are in complete LD are illustrated: SNP 1, a T>C, where the C allele occurs in a region of open chromatin, relative to allele T; and SNP 2, a G>A, where both SNPs occur within open chromatin. Following formaldehyde-fixing and sonication, the T allele from SNP 1 remains tightly bound within the nucleosome. Upon phenol:chloroform extraction, this DNA-bound nucleosome transfers to the solvent layer, whilst the C allele within open chromatin remains in the aqueous layer and is purified. Upon genotyping with a gene chip, the C allele is enriched compared to the T allele. For SNP 2, the polymorphism does not affect chromatin structure; both alleles are equally enriched following FAIRE. This would suggest that SNP 1 was the more likely causal SNP for the GWAS association, conferring a greater allele-specific regulatory potential.
Figure 2
Figure 2. Correlation of FAIRE LogR Ratio with FAIRE-seq Peaks.
The graph shows the correlation of FAIRE-enriched SNP intensity using Illumina CVD BeadChip with FAIRE-seq peak intensity from the GM12878 lymphoblast cell line. The (mean log R ratio from three FAIRE-enriched DNAs) – (mean log R ratio from their respective control DNAs) were compared with known FAIRE-seq intensities (0 = no FAIRE-seq enrichment; 1–200 = lowest level of FAIRE-seq enrichment; 800–1000 = highest FAIRE-seq enrichment). There is a strong correlation between SNP intensity and FAIRE-seq peak intensity (p = 2.34×10−82).
Figure 3
Figure 3. Allele-Specific Open Chromatin Signals from Heterozygous Lymphoblast Cell Lines.
Manhattan plot showing allele-specific signals of open chromatin using the Human CVD beadchip. The BAF of chromosome 11 SNP, rs7120118 (C allele), is significantly enriched following FAIRE-gen, in an examination of 3,129 SNP heterozygous SNPs in 3 lymphoblast cell lines. No other SNP showed significant allele-specific effects for open chromatin.
Figure 4
Figure 4. UCSC Genome Browser Annotation of rs7120118 Locus on Human Mar. 2009 (NCBI37/hg19) Assembly.
The annotated region surrounding rs7120118 (SNP highlighted in red box) reveals the location of a putative enhancer, with typical features including H3K4me3 signatures, DNase I hypersensitivity and FAIRE-seq enrichment in a number of tissues. The SNP lies between two regions of transcription factor binding sites, including a c-Fos/c-Jun (AP-1 heterodimers), p300 (a transcriptional co-activator), YY1, SRF, GATA-2 complex, and a USF-1 binding site.
Figure 5
Figure 5. Replication of Allele-Specific Effect of rs7120118 from 9 Heterozygous IL-1β Stimulated Lymphoblast Cell Lines Using Illumina Metabochip.
The boxplots indicate the effect size of allele-specific differences in open chromatin. Included are the 7 SNPs in high LD. The B allele (rs7120118 C) is enriched in open chromatin, as is the adjacent SNP, rs2279239 with less statistical significance.
Figure 6
Figure 6. Replication of the Allele-Specific Effect of rs7120118 from 9 Heterozygous Lymphoblast Cell Lines Using the TaqMan Platform.
The effect of C allele-enrichment from the Illumina Metabochip is confirmed using an alternative method of allele-quantification from the TaqMan platform.
Figure 7
Figure 7. Effect of rs7120118 on NR1H3 Gene Expression in Tissue.
A) Effect of rs7120118 genotype on NR1H3 gene expression in heart samples (n = 127). The minor allele (C) is associated with increased expression at p = 0.0127. B) Effect of rs7120118 genotype on NR1H3 gene expression in aortic adventitia samples (n = 133). There is a trend towards increased expression of NR1H3 with the C allele (p = 0.154).
Figure 8
Figure 8. Association of rs7120118 with HDL-C in More Than 100,000 Individuals.
Association of rs7120118 with HDL-C levels was examined using a published dataset from a study in more than 100,000 individuals using LocusZoom to plot the SNPs examined and imputed from 1000 Genomes Project dataset. A high level of LD is present within the locus, with at least 29 genes implicated with HDL. rs7120118 is indicated in purple (p = 1.297×10−14), with the SNPs in strongest LD marked in red. Although not statistically the lead SNP at this region, with the additional effects of this SNP on open chromatin, NR1H3 gene expression, and proximity to NR1H3, rs7120118 is represents a good functional candidate.

Similar articles

Cited by

References

    1. Hindorff LA, Sethupathy P, Junkins HA, Ramos EM, Mehta JP, et al. (2009) Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proceedings of the National Academy of Sciences of the United States of America 106: 9362–9367. - PMC - PubMed
    1. Dimas AS, Dermitzakis ET (2009) Genetic variation of regulatory systems. Current opinion in genetics & development 19: 586–590. - PubMed
    1. Birney E, Stamatoyannopoulos JA, Dutta A, Guigo R, Gingeras TR, et al. (2007) Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 447: 799–816. - PMC - PubMed
    1. Park PJ (2009) ChIP-seq: advantages and challenges of a maturing technology. Nature reviews Genetics 10: 669–680. - PMC - PubMed
    1. Mikkelsen TS, Ku M, Jaffe DB, Issac B, Lieberman E, et al. (2007) Genome-wide maps of chromatin state in pluripotent and lineage-committed cells. Nature 448: 553–560. - PMC - PubMed

Publication types

MeSH terms