Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jan 17;11(1):343.
doi: 10.1038/s41467-019-13762-6.

Using regulatory variants to detect gene-gene interactions identifies networks of genes linked to cell immortalisation

Affiliations

Using regulatory variants to detect gene-gene interactions identifies networks of genes linked to cell immortalisation

D Wragg et al. Nat Commun. .

Abstract

The extent to which the impact of regulatory genetic variants may depend on other factors, such as the expression levels of upstream transcription factors, remains poorly understood. Here we report a framework in which regulatory variants are first aggregated into sets, and using these as estimates of the total cis-genetic effects on a gene we model their non-additive interactions with the expression of other genes in the genome. Using 1220 lymphoblastoid cell lines across platforms and independent datasets we identify 74 genes where the impact of their regulatory variant-set is linked to the expression levels of networks of distal genes. We show that these networks are predominantly associated with tumourigenesis pathways, through which immortalised cells are able to rapidly proliferate. We consequently present an approach to define gene interaction networks underlying important cellular pathways such as cell immortalisation.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Employing a gene’s cis-eQTL complement to identify genetic interactions.
a SNPs within 1 Mb of a gene’s transcription start and termination sites were used to train an expression level prediction model using PredictDB. Predictions from these models correspond to the additive effect of all cis-regulatory variants for a given gene. b The observed expression of geneA can then be modelled as an interaction (X) between its cis-eQTL (predicted expression) and the observed expression of other genes in the genome (geneB). The error term represents, for example, uncaptured environmental or trans effects linked to variation in the expression of geneA, not capture by these other terms.
Fig. 2
Fig. 2. Model prediction performance and validation.
a R2 of predicted versus observed gene expression levels in the Lothian Birth Cohort (LBC1936) against the rank of the R2 value. Only the 5033 genes that passed the kurtosis threshold are shown. The horizontal line indicates an R2 = 0.1, for which the 1205 genes exceeding this threshold were retained for further analysis. b Validation of the LBC1936 gene prediction models using the 1000 Genomes data. Genes with an R2 > 0.1 across both datasets are plotted, with genes with an R2 > 0.5 in both datasets labelled. Genes annotated twice have more than one probe on the array used in the LBC1936 dataset.
Fig. 3
Fig. 3. Interactions can lead to genotype-dependent and regulatory variant-set-dependent variance in a gene’s expression levels.
Simulated data illustrating that a gene A’s expression is only highest when the cis-eQTL genotype is 1/1 and the expression level of a distal gene (geneB) is low. b Replotting the data in a on different axes illustrates how this interaction is associated with genotype-dependent variance in the expression of geneA, with greater variance in expression between individuals when the eQTL genotype is 1/1. c The interaction leads to genotype-dependent differences in the slope of the relationship between geneA and geneB’s expression levels. df The same principle extended to sets of cis-regulatory variants. P-values derived from Spearman’s rho test.
Fig. 4
Fig. 4. Detecting regulatory variant-set veQTL.
An example regulatory variant-set-based veQTL identified at the SLFN5 locus is shown. a SLFN5 shows a strong correlation between its predicted and observed expression levels in the Lothian Birth Cohort suggesting this gene is under the strong control of nearby regulatory variants. b Plotting the residual after regressing out this cis-eQTL effect against predicted expression levels suggests that this gene also shows set-dependent variance in expression. c Correlating the gene’s predicted expression level to the square of the residuals from b confirms the presence of a veQTL at this locus. df the same as ac but resulting from analysis of the 1000 Genomes data. P-values derived from Spearman’s rho test.
Fig. 5
Fig. 5. Significant regulatory variant-set-dependent interaction between SLFN5 and DANCR.
Example of a significant set-dependent interaction, showing patterns consistent with the simulated data presented in Fig. 3. The expression of SLFN5 is linked to the combination of cis-regulatory variant-set and expression of DANCR. This is shown to be the case for the Lothian Birth Cohort (ad) and is reproduced in the 1000 Genomes data (eh), with both datasets exhibiting statistical significance (FDR < 0.05) and sign concordance for the interaction term coefficient. P and FDR values derived from ANOVA F-test.
Fig. 6
Fig. 6. Binding of key EBV-associated transcription factors at the promoters of genes in the interaction networks.
The median binding levels of four EBV nuclear antigens (EBNA2, EBNA3A, EBNA3C and EBNALP) and five NF-κB subunits (cREL, p50, p52, RelA and RelB) at the promoters of interacting genes (i.e., interacting geneBs. Blue line) as well as genes expressed and tested for an interaction but for which one was not observed (red line). The grey shaded area around the red line represents the 95% confidence interval calculated by sampling 100 times the same number of genes from this background set as were in the foreground set of interacting genes. The processed binding data used in this plot was obtained from Jiang et al..
Fig. 7
Fig. 7. The proportion of variance explained by additive and interaction effects between cis-regulatory variants and the expression of distal genes.
A given interaction explains on average a an additional 0.68% of the variation in the gene’s expression in the Lothian Birth Cohort, and b an additional 2.0% in the GEUVADIS dataset. c The total set of non-redundant interacting genes identified by stepwise regression explained 13.2% of the variation in gene expression in the Lothian Birth Cohort and 17.6% in the GEUVADIS dataset, with the interaction effects alone explaining 2 and 5% of the variation on average respectively (points are coloured by gene name).

Similar articles

Cited by

References

    1. Gusev A, et al. Partitioning heritability of regulatory and cell-type-specific variants across 11 common diseases. Am. J. Hum. Genet. 2014;95:535–552. doi: 10.1016/j.ajhg.2014.10.004. - DOI - PMC - PubMed
    1. Aschard H, et al. Challenges and opportunities in genome-wide environmental interaction (GWEI) studies. Hum. Genet. 2012;131:1591–1613. doi: 10.1007/s00439-012-1192-0. - DOI - PMC - PubMed
    1. Montgomery SB, Dermitzakis ET. From expression QTLs to personalized transcriptomics. Nat. Rev. Genet. 2011;12:277–282. doi: 10.1038/nrg2969. - DOI - PubMed
    1. GTEx Consortium. The Genotype-Tissue Expression (GTEx) project. Nat. Genet. 45, 580–585 (2013). - PMC - PubMed
    1. Lappalainen T, et al. Transcriptome and genome sequencing uncovers functional variation in humans. Nature. 2013;501:506–511. doi: 10.1038/nature12531. - DOI - PMC - PubMed

Publication types