Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Nov;1(6):515-22.
doi: 10.1534/g3.111.000927. Epub 2011 Nov 1.

Genome-Scale Analysis of Programmed DNA Elimination Sites in Tetrahymena thermophila

Genome-Scale Analysis of Programmed DNA Elimination Sites in Tetrahymena thermophila

Joseph N Fass et al. G3 (Bethesda). 2011 Nov.

Abstract

Genetically programmed DNA rearrangements can regulate mRNA expression at an individual locus or, for some organisms, on a genome-wide scale. Ciliates rely on a remarkable process of whole-genome remodeling by DNA elimination to differentiate an expressed macronucleus (MAC) from a copy of the germline micronucleus (MIC) in each cycle of sexual reproduction. Here we describe results from the first high-throughput sequencing effort to investigate ciliate genome restructuring, comparing Sanger long-read sequences from a Tetrahymena thermophila MIC genome library to the MAC genome assembly. With almost 25% coverage of the unique-sequence MAC genome by MIC genome sequence reads, we created a resource for positional analysis of MIC-specific DNA removal that pinpoints MAC genome sites of DNA elimination at nucleotide resolution. The widespread distribution of internal eliminated sequences (IES) in promoter regions and introns suggests that MAC genome restructuring is essential not only for what it removes (for example, active transposons) but also for what it creates (for example, splicing-competent introns). Consistent with the heterogeneous boundaries and epigenetically modulated efficiency of individual IES deletions studied to date, we find that IES sites are dramatically under-represented in the ∼25% of the MAC genome encoding exons. As an exception to this general rule, we discovered a previously unknown class of small (<500 bp) IES with precise elimination boundaries that can contribute the 3' exon of an mRNA expressed during genome restructuring, providing a new mechanism for expanding mRNA complexity in a developmentally regulated manner.

Keywords: DNA breakage and joining; Tetrahymena; ciliate nuclear dualism; community sequencing project; genome rearrangement.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Annotation of MAC positions of IES excision. (A) A representative high-confidence win1 IES site prediction in the MAC genome. The candidate IES site (IES D) falls within an internal exon of the gene model for TTHERM_00198180, which is displayed in its entirety (see scale bar at top). The genome browser track “Putative IES sites using uniquely mapped Sanger reads” indicates positions of the win1, win2, and win3 IES predictions described in the text. The track “Sanger Reads Left (unique)” shows the extent of MAC-matching sequence within a MIC genome read that maps to the MAC with its left end but has a nonmapping sequence on its right end; the read matched a unique MAC sequence, in contrast to reads segregated to the track “Sanger Reads Left (multi).” The track “Sanger Reads Right (unique)” shows the extent of MAC-matching sequence within a MIC genome read that maps to the MAC with its right end but has a nonmapping sequence on its left end; the read matched a unique MAC sequence, in contrast to reads segregated to the track “Sanger Reads Right (multi).” The tracks “Sanger Reads Fully Mapped” show read alignments that matched the MAC without a left-end-only or right-end-only extension of nonmapping sequence. (B) Widespread MAC chromosome distribution of IES sites predicted from Sanger L and/or R read alignments. Note that the conMAC region containing the smallest MAC genome contigs at right is also overrepresented in the 10 kbp spacer blocks of N that were added between contigs; therefore, the bp amount of MAC assembly in this region is exaggerated as a proportion of conMAC length. The upper track shows the entire conMAC assembly of MAC genome contigs joined in order of decreasing length; in the browser, this track is designated as the fakeasome.
Figure 2
Figure 2
IES validation by PCR. Genomic DNA isolated from strain SB210, CU428, or B2086 was amplified by PCR using primers flanking the putative IES site in MAC-destined DNA, as schematized at right. PCR products are visualized here as the negative image of an agarose gel stained with ethidium bromide. The smaller panels below the main panels for IES 2, A, and E show IES-specific PCR amplification using primer(s) that overlap or are internal to the IES, as also schematized. Relevant DNA standards are indicated. Expected MAC genome amplification products are labeled with “formula image”; IES-containing PCR products are labeled with “formula image”; note that IES size could be underestimated for IES C, D, and E if the IES contains internal repeat(s).
Figure 3
Figure 3
IES excision requirements. For each IES, PCR was done with the schematized primer pairs using genomic DNA isolated from the strain SB210 in vegetative growth or from the polyclonal pool of cells arrested 28 hr after initiation of conjugation for gene knockout strains lacking DCL1, PDD1, or LIA1. PCR products are visualized as the negative image of an agarose gel stained with ethidium bromide. DNA standard lanes have the 500 bp marker denoted with “o”; expected MAC genome amplification products are labeled with “formula image”; and IES-containing PCR products are labeled with “formula image” (these labels are included in a subset of adjacent gel lanes for clarity).
Figure 4
Figure 4
Sequence alignment of MAC junctions resulting from removal of IES C. A multiple sequence alignment is shown for cloned MAC junctions resulting from the removal of IES C, which together have three lengths and four distinct sequences. Clones were sequenced for junctions amplified from DNA of the inbred strains B2086, CU428, and SB210 (lines 1–3) or amplified from the DNA of noninbred strain crosses used in Figure 3 (lines 4–5; conj1 and conj2 indicate the two distinct sequences obtained from many cloned DNA fragment sequences). Below the alignment, “formula image” indicates the region of consensus sequence.
Figure 5
Figure 5
IES host gene expression. (A) Total RNA was isolated from strain SB210 in asexual vegetative growth (V) or at the indicated times after initiation of conjugation using strains SB210 and CU428 (in hours). Northern blot hybridization was performed to detect the putative mRNA region adjacent to the indicated IES. Blots are shown in size register; nonspecific signal left from a previous probing of the same membrane is indicated by “NS.” (B) Transcripts from the region of the LIA2 locus hosting IES B were isolated by RT-PCR and sequenced. Boxes denote exons, and thick lines denote introns or untranslated regions. Gray boxes and solid lines indicate MAC-destined sequence, and the open box and dashed line indicate IES. Translation termination codons (TGA) and positions of mRNA polyadenylation (pA) are shown.

Similar articles

Cited by

References

    1. Baudry C., Malinsky S., Restituito M., Kapusta A., Rosa S., et al. , 2009. PiggyMac, a domesticated piggyBac transposase involved in programmed genome rearrangements in the ciliate Paramecium tetraurelia. Genes Dev. 23: 2478–2483 - PMC - PubMed
    1. Calzone F. J., Angerer R. C., Gorovsky M. A., 1983a. Regulation of protein synthesis in Tetrahymena. Quantitative estimates of the parameters determining the rates of protein synthesis in growing, starved, and starved-deciliated cells. J. Biol. Chem. 258: 6887–6898 - PubMed
    1. Calzone F. J., Stathopoulos V. A., Grass D., Gorovsky M. A., Angerer R. C., 1983b. Regulation of protein synthesis in Tetrahymena. RNA sequence sets of growing and starved cells. J. Biol. Chem. 258: 6899–6905 - PubMed
    1. Chalker D. L., Yao M. C., 1996. Non-Mendelian, heritable blocks to DNA rearrangement are induced by loading the somatic nucleus of Tetrahymena thermophila with germ line-limited DNA. Mol. Cell. Biol. 16: 3658–3667 - PMC - PubMed
    1. Chalker D. L., Yao M. C., 2001. Nongenic, bidirectional transcription precedes and may promote developmental DNA deletion in Tetrahymena thermophila. Genes Dev. 15: 1287–1298 - PMC - PubMed