Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2008 Feb;18(2):233-41.
doi: 10.1101/gr.6905408. Epub 2007 Dec 20.

Genome-wide analysis reveals regulatory role of G4 DNA in gene transcription

Affiliations
Comparative Study

Genome-wide analysis reveals regulatory role of G4 DNA in gene transcription

Zhuo Du et al. Genome Res. 2008 Feb.

Erratum in

  • Genome Res. 2008 Mar;18(3):516

Abstract

G-quadruplex or G4 DNA, a four-stranded DNA structure formed in G-rich sequences, has been hypothesized to be a structural motif involved in gene regulation. In this study, we examined the regulatory role of potential G4 DNA motifs (PG4Ms) located in the putative transcriptional regulatory region (TRR, -500 to +500) of genes across the human genome. We found that PG4Ms in the 500-bp region downstream of the annotated transcription start site (TSS; PG4M(D500)) are associated with gene expression. Generally, PG4M(D500)-positive genes are expressed at higher levels than PG4M(D500)-negative genes, and an increased number of PG4M(D500) provides a cumulative effect. This observation was validated by controlling for attributes, including gene family, function, and promoter similarity. We also observed an asymmetric pattern of PG4M(D500) distribution between strands, whereby the frequency of PG4M(D500) in the coding strand is generally higher than that in the template strand. Further analysis showed that the presence of PG4M(D500) and its strand asymmetry are associated with significant enrichment of RNAP II at the putative TRR. On the basis of these results, we propose a model of G4 DNA-mediated stimulation of transcription with the hypothesis that PG4M(D500) contributes to gene transcription by maintaining the DNA in an open conformation, while the asymmetric distribution of PG4M(D500) considerably reduces the probability of blocking the progression of the RNA polymerase complex on the template strand. Our findings provide a comprehensive view of the regulatory function of G4 DNA in gene transcription.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Relationship between PG4Ms and gene expression level. Changes in the frequency of PG4MU500 (A) and PG4MD500 (B) with the increased gene expression level are plotted. Genes are ranked by expression level, with each point representing the mean expression level and the frequency of PG4Ms calculated for every 100 genes.
Figure 2.
Figure 2.
Influence of PG4MD500 on gene expression for each tissue/cell. Comparison of the gene expression levels between PG4MD500-positive (black squares) and PG4MD500-negative genes (open squares) in each human tissue/cell type. Error bars represent the 95% confidence interval of the mean expression level.
Figure 3.
Figure 3.
Comparison of expression levels between PG4MD500-positive and PG4MD500-negative genes when gene family and function are controlled. Human genes are clustered according to gene family (A) and gene function (B), and mean expression levels are compared between PG4MD500-positive (X-axis) and PG4MD500-negative genes (Y-axis) in each cluster. The angle bisector represents equal expression levels. The squares below the angle bisector indicate that the expression level of PG4MD500-positive genes is higher than that of PG4MD500-negative genes, and vice versa.
Figure 4.
Figure 4.
Strand asymmetry of PG4MD500. (A) Comparison of the frequency between PG4Mcod (gray line) and PG4Mtem (black line) in the10-kb TSS-flanking region of human RefSeq genes (Ref) (n = 13,276) and pseudogenes (Pse) (n = 824). (B) Ratios of genes containing (1) PG4Mcod but not PG4Mtem (PG4Mcod+tem–); (2) PG4Mtem but not PG4Mcod (PG4Mcod–tem+); (3) more PG4Mcod than PG4Mtem (PG4Mcod>tem); and (4) fewer PG4Mcod than PG4Mtem (PG4Mcod<tem) in the D500 regions (black squares). The corresponding values for PG4MU500 are plotted as a control (gray squares).
Figure 5.
Figure 5.
A model of G4 DNA-mediated stimulation of transcription. Double-stranded DNA is denatured locally during transcription, forming a transcription bubble and exposing the template strand for nascent RNA synthesis (A, top). As RNAP II machinery (gray dashed box) moves along the template strand, the bubble moves with it, and the DNA rewinds to form the duplex behind the bubble (A, bottom). For the PG4MD500-positive genes, the transient separation of the duplex DNA during transcription and transcription-derived negative supercoiling considerably increases the opportunities for the formation of G4 DNA. Such high-order structures are extremely stable, and thus, probably block rehybridization with the complementary strand, holding the structure open and thereby enabling a higher rate of transcription (B). For the same reason, the presence of high numbers of PG4MD500 can help maintain the initial region of the transcript unpaired, which could facilitate the reinitiation of transcription (black dashed box) and also contribute to a higher level of transcription (B, bottom). The presence of PG4Ms in the template strand may hinder the progression of RNAP II. However, the intrinsic asymmetric distribution of PG4Ms between strands in the D500 region (PG4MD500cod > PG4MD500tem, represented in gray and black, respectively; see also Fig. 4) achieves a balance and minimizes the negative effects.

Similar articles

Cited by

References

    1. Ambrus A., Chen D., Dai J., Jones R.A., Yang D., Chen D., Dai J., Jones R.A., Yang D., Dai J., Jones R.A., Yang D., Jones R.A., Yang D., Yang D. Solution structure of the biologically relevant G-quadruplex element in the human c-MYC promoter. Implications for G-quadruplex stabilization. Biochemistry. 2005;44:2048–2058. - PubMed
    1. Barski A., Cuddapah S., Cui K., Roh T.Y., Schones D.E., Wang Z., Wei G., Chepelev I., Zhao K., Cuddapah S., Cui K., Roh T.Y., Schones D.E., Wang Z., Wei G., Chepelev I., Zhao K., Cui K., Roh T.Y., Schones D.E., Wang Z., Wei G., Chepelev I., Zhao K., Roh T.Y., Schones D.E., Wang Z., Wei G., Chepelev I., Zhao K., Schones D.E., Wang Z., Wei G., Chepelev I., Zhao K., Wang Z., Wei G., Chepelev I., Zhao K., Wei G., Chepelev I., Zhao K., Chepelev I., Zhao K., Zhao K. High-resolution profiling of histone methylations in the human genome. Cell. 2007;129:823–837. - PubMed
    1. Brodsky A.S., Meyer C.A., Swinburne I.A., Hall G., Keenan B.J., Liu X.S., Fox E.A., Silver P.A., Meyer C.A., Swinburne I.A., Hall G., Keenan B.J., Liu X.S., Fox E.A., Silver P.A., Swinburne I.A., Hall G., Keenan B.J., Liu X.S., Fox E.A., Silver P.A., Hall G., Keenan B.J., Liu X.S., Fox E.A., Silver P.A., Keenan B.J., Liu X.S., Fox E.A., Silver P.A., Liu X.S., Fox E.A., Silver P.A., Fox E.A., Silver P.A., Silver P.A. Genomic mapping of RNA polymerase II reveals sites of co-transcriptional regulation in human cells. Genome Biol. 2005;6:R64. doi: 10.1186/gb-2005-6-8-r64. - DOI - PMC - PubMed
    1. Burge S., Parkinson G.N., Hazel P., Todd A.K., Neidle S., Parkinson G.N., Hazel P., Todd A.K., Neidle S., Hazel P., Todd A.K., Neidle S., Todd A.K., Neidle S., Neidle S. Quadruplex DNA: Sequence, topology and structure. Nucleic Acids Res. 2006;34:5402–5415. doi: 10.1093/nar/gkl655. - DOI - PMC - PubMed
    1. Catasti P., Chen X., Moyzis R.K., Bradbury E.M., Gupta G., Chen X., Moyzis R.K., Bradbury E.M., Gupta G., Moyzis R.K., Bradbury E.M., Gupta G., Bradbury E.M., Gupta G., Gupta G. Structure-function correlations of the insulin-linked polymorphic region. J. Mol. Biol. 1996;264:534–545. - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources