Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Oct 19:11:581.
doi: 10.1186/1471-2164-11-581.

The spatial distribution of cis regulatory elements in yeast promoters and its implications for transcriptional regulation

Affiliations

The spatial distribution of cis regulatory elements in yeast promoters and its implications for transcriptional regulation

Zhenguo Lin et al. BMC Genomics. .

Abstract

Background: How the transcription factor binding sites (TFBSs) are distributed in the promoter region have implications for gene regulation. Previous studies used the translation start codon as the reference point to infer the TFBS distribution. However, it is biologically more relevant to use the transcription start site (TSS) as the reference point. In this study, we reexamined the spatial distribution of TFBSs, investigated various promoter features that may affect the distribution, and studied the effect of TFBS distribution on transcriptional regulation.

Results: We found a sharp peak for the distribution of TFBSs at ~115 bp upstream of the TSS, but no clear peak when the translation start codon was used as the reference point. Our analysis of sequence variation data among 63 yeast strains revealed very low deletion polymorphisms in the region between the distribution peak and the TSS, suggesting that the distances between TFBSs and the TSS have been selectively constrained in evolution. As in previous studies, we found that the nucleosome occupancy and the presence/absence of TATA-box in the promoter region affect the TFBS distribution pattern. In addition, we found that there exists a correlation between the 5'UTR length and the TFBS distribution pattern and we showed that the TFBS distribution pattern affects gene transcription level and plasticity.

Conclusions: The spatial distribution of TFBSs obtained using the TSS as the reference point shows a much sharper peak than does the distribution obtained using the translation start codon as the reference point. The TFBS distribution pattern is affected by nucleosome occupancy and presence of TATA-box and it affects the transcription level and transcription plasticity of the gene.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The distribution of TFBSs in yeast promoters. (A) The frequency distribution of the distances from the TFBSs to the transcription starting site (TSS) in genes. This figure shows the spatial distribution of TFBSs in the promoters of 4369 genes (using TFBS dataset IV). The value at each position relative to the TSS is a moving average of a window of 41 bp. The distribution has a very sharp peak ~115 bp upstream of the TSS and the TFBSs are strongly concentrated in the ~ 100 bp region from 180 to 80 bp upstream of the TSS. The blue solid line represents observed values; The red solid and dotted lines represent the mean of randomization and 95% confidence intervals for 1000 randomized tests (B) No sharp peak was found in the frequency distribution of TFBSs relative to the translation start codon. This figure was generated using the same data as in (A) except that the translation start codon instead of the TSS was used as the reference point. (C) The frequency of deletion polymorphisms in the TSS-proximal region is the lowest in the promoter region and is significant lower than random expectation. This figure was generated using the same data as in (A).
Figure 2
Figure 2
The effects of promoter architecture features on the TFBS distribution. (A) Strong negative correlation between the TFBS frequency and the genome-wide average of nucleosome occupancy in the ~200 bp region upstream of the TSS. The intergenic regions were aligned with reference to TSS. (B) The yeast genes were clustered into four groups by k-means clustering based on a 1201 bp region surrounding the TSS. (C) The frequency distributions of TFBSs relative to the TSS in four clusters of genes with diverse nucleosome occupancy patterns. The figure is presented in the same way as Figure 1A. (D) The TFBS distribution differs between TATA box-containing and TATA box-less genes. The TATA box-containing genes have more broadly distributed TFBSs and higher TFBS frequency in the promoter region. (E) The distributions of TFBSs relative to the TSS in three categories of 5'UTR length: short, medium and long. The long 5'UTR genes have more broadly distributed TFBSs and a higher TFBS frequency in the promoter region.
Figure 3
Figure 3
Intrinsic correlations among nucleosome positioning, presence/absence of TATA box and 5'UTR length. (A) Average 5'UTR lengths among the four gene groups clustered by k-mean clustering based on their nucleosome occupancy patterns. The mean value of each group is indicated by a bar and the error bars indicate one standard error. (B) The nucleosome occupancy in the 1201 bp window surrounding the TSS in the three groups of genes with different 5'UTR lengths. The TSS proximal region of long 5'UTR genes has the highest nucleosome occupancy. (C) The proportion of TATA box-containing genes and TATA box-less genes in each 5'UTR length group of genes. The long 5'UTR genes have the highest proportion of TATA box-containing genes, but the lowest proportion of TATA box-less genes. (D) TATA box-containing genes have slightly longer 5'UTRs than do TATA box-less genes.
Figure 4
Figure 4
The distribution patterns of TFBSs among genes with different expression profiles. (A) High expression plasticity genes have distinct TFBS distribution patterns compared to the low plasticity genes. (B) The density of TFBSs has a positive correlation with gene expression level under rich media.

Similar articles

Cited by

References

    1. Chen CY, Tsai HK, Hsu CM, May Chen MJ, Hung HG, Huang GT, Li WH. Discovering gapped binding sites of yeast transcription factors. Proc Natl Acad Sci USA. 2008;105:2527–2532. doi: 10.1073/pnas.0712188105. - DOI - PMC - PubMed
    1. Hansen L, Marino-Ramirez L, Landsman D. Many sequence-specific chromatin modifying protein-binding motifs show strong positional preferences for potential regulatory regions in the Saccharomyces cerevisiae genome. Nucleic Acids Res. 2010;38:1772–1779. doi: 10.1093/nar/gkp1195. - DOI - PMC - PubMed
    1. Harbison CT, Gordon DB, Lee TI, Rinaldi NJ, Macisaac KD, Danford TW, Hannett NM, Tagne JB, Reynolds DB, Yoo J, Jennings EG, Zeitlinger J, Pokholok DK, Kellis M, Rolfe PA, Takusagawa KT, Lander ES, Gifford DK, Fraenkel E, Young RA. Transcriptional regulatory code of a eukaryotic genome. Nature. 2004;431:99–104. doi: 10.1038/nature02800. - DOI - PMC - PubMed
    1. Hughes JD, Estep PW, Tavazoie S, Church GM. Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae. J Mol Biol. 2000;296:1205–1214. doi: 10.1006/jmbi.2000.3519. - DOI - PubMed
    1. Lee TI, Rinaldi NJ, Robert F, Odom DT, Bar-Joseph Z, Gerber GK, Hannett NM, Harbison CT, Thompson CM, Simon I, Zeitlinger J, Jennings EG, Murray HL, Gordon DB, Ren B, Wyrick JJ, Tagne JB, Volkert TL, Fraenkel E, Gifford DK, Young RA. Transcriptional regulatory networks in Saccharomyces cerevisiae. Science. 2002;298:799–804. doi: 10.1126/science.1075090. - DOI - PubMed

Publication types

MeSH terms