Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Sep 21:6:412.
doi: 10.1038/msb.2010.64.

Identifying the genetic determinants of transcription factor activity

Affiliations

Identifying the genetic determinants of transcription factor activity

Eunjee Lee et al. Mol Syst Biol. .

Abstract

Analysis of parallel genotyping and expression profiling data has shown that mRNA expression levels are highly heritable. Currently, only a tiny fraction of this genetic variance can be mechanistically accounted for. The influence of trans-acting polymorphisms on gene expression traits is often mediated by transcription factors (TFs). We present a method that exploits prior knowledge about the in vitro DNA-binding specificity of a TF in order to map the loci ('aQTLs') whose inheritance modulates its protein-level regulatory activity. Genome-wide regression of differential mRNA expression on predicted promoter affinity is used to estimate segregant-specific TF activity, which is subsequently mapped as a quantitative phenotype. In budding yeast, our method identifies six times as many locus-TF associations and more than twice as many trans-acting loci as all existing methods combined. Application to mouse data from an F2 intercross identified an aQTL on chromosome VII modulating the activity of Zscan4 in liver cells. Our method has greatly improved statistical power over existing methods, is mechanism based, strictly causal, computationally efficient, and generally applicable.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no conflict of interest.

Figures

Figure 1
Figure 1
Overview of our transcription-factor-centric approach to detecting trans-acting sequence variation. (A) We construct a matrix containing the promoter-binding affinity for each combination of the upstream non-coding sequence of a particular gene and the position-specific affinity matrix (PSAM) of a particular transcription factor (TF). (B) The promoter-binding affinity matrix is interpreted as a regulatory connectivity matrix and used to infer a matrix containing the regulatory activity of each TF in each segregant. For each segregant independently, multivariate genome-wide linear regression of segregant-specific differential mRNA expression on the matrix of promoter affinity for all TFs is performed. The coefficients from this linear fit represent (differential) protein-level TF activities. (C) For each TF independently, we treat the inferred activity as a quantitative phenotype and use genetic linkage analysis across all segregants to identify loci that genetically modulate TF activity. Whenever TF activity is statistically associated with genotype at a particular genetic marker, this shows as a high log-odds (LOD) score indicating the presence of a TF activity quantitative trait locus, or ‘aQTL’.
Figure 2
Figure 2
Inferred differences in TF activity between the BY and RM parental strains. Shown are the t-values corresponding to the regression coefficients in a multivariate linear model that predicts genome-wide differential mRNA expression from predicted binding affinity of upstream promoter regions.
Figure 3
Figure 3
Overview of the trans-acting genetic modulators of TF activity mapped using our method. All transcription factors that have at least one significant aQTL region at a 5% FDR are shown. Transcription factors are sorted according to the chromosomal position of their maximum LOD score. Putative causal gene assignments are indicated in green (local aQTL: TF encoded by gene in aQTL) or red (protein–protein interaction identified between TF and gene in aQTL).
Figure 4
Figure 4
(A) Inferred activity of Stb5p in parental strains and segregants. The first and second columns show the activity of Stb5 in six replicates of a BY-reference comparison and six replicates of a RM-reference comparison. The third and fourth columns show the activity of Stb5p for segregants that inherited the BY and RM allele, respectively, at the STB5 locus. (B) LOD score profile for the activity of Stb5p. An asterisk denotes the STB5 locus.
Figure 5
Figure 5
(A) Activity of Fkh1p and Fkh2p across all segregants. The activity of Fkh1p is negatively correlated with that of Fkh2p. The yellow dots correspond to segregants carrying the BY allele at the CDC28 locus, the green dots to those carrying the RM allele. (B) Schematic diagram illustrating the antagonistic modulation of Fkh1p and Fkh2p by Cdc28p. Although the transcriptional targets of Fkh1p are more highly expressed in segregants carrying the BY allele at the CDC28 locus, the opposite is true for the targets of Fkh1p.
Figure 6
Figure 6
(A) Inferred activity of Zscan4p across all F2 mouse population. Each column shows the activity of Zscan4 in homozygous C57BL/6J (BB), heterozygous (BD), and homozygous DBA/2J (DD) mice at aQTL positions, respectively. (B) LOD score profile for Zscan4p.

Similar articles

Cited by

References

    1. Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 3389–3402 - PMC - PubMed
    1. Badis G, Berger MF, Philippakis AA, Talukder S, Gehrke AR, Jaeger SA, Chan ET, Metzler G, Vedenko A, Chen X, Kuznetsov H, Wang CF, Coburn D, Newburger DE, Morris Q, Hughes TR, Bulyk ML (2009) Diversity and complexity in DNA recognition by transcription factors. Science 324: 1720–1723 - PMC - PubMed
    1. Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Roy Stat Soc B (Methodol) 57: 289–300
    1. Biswas S, Storey JD, Akey JM (2008) Mapping gene expression quantitative trait loci by singular value decomposition and independent component analysis. BMC Bioinformatics 9: 244. - PMC - PubMed
    1. Boorsma A, Lu XJ, Zakrzewska A, Klis FM, Bussemaker HJ (2008) Inferring condition-specific modulation of transcription factor activity in yeast through regulon-based analysis of genomewide expression. PLoS ONE 3: e3112. - PMC - PubMed

Publication types

Substances

LinkOut - more resources