Refined annotation and assembly of the Tetrahymena thermophila genome sequence through EST analysis, comparative genomic hybridization, and targeted gap closure
- PMID: 19036158
- PMCID: PMC2612030
- DOI: 10.1186/1471-2164-9-562
Refined annotation and assembly of the Tetrahymena thermophila genome sequence through EST analysis, comparative genomic hybridization, and targeted gap closure
Abstract
Background: Tetrahymena thermophila, a widely studied model for cellular and molecular biology, is a binucleated single-celled organism with a germline micronucleus (MIC) and somatic macronucleus (MAC). The recent draft MAC genome assembly revealed low sequence repetitiveness, a result of the epigenetic removal of invasive DNA elements found only in the MIC genome. Such low repetitiveness makes complete closure of the MAC genome a feasible goal, which to achieve would require standard closure methods as well as removal of minor MIC contamination of the MAC genome assembly. Highly accurate preliminary annotation of Tetrahymena's coding potential was hindered by the lack of both comparative genomic sequence information from close relatives and significant amounts of cDNA evidence, thus limiting the value of the genomic information and also leaving unanswered certain questions, such as the frequency of alternative splicing.
Results: We addressed the problem of MIC contamination using comparative genomic hybridization with purified MIC and MAC DNA probes against a whole genome oligonucleotide microarray, allowing the identification of 763 genome scaffolds likely to contain MIC-limited DNA sequences. We also employed standard genome closure methods to essentially finish over 60% of the MAC genome. For the improvement of annotation, we have sequenced and analyzed over 60,000 verified EST reads from a variety of cellular growth and development conditions. Using this EST evidence, a combination of automated and manual reannotation efforts led to updates that affect 16% of the current protein-coding gene models. By comparing EST abundance, many genes showing apparent differential expression between these conditions were identified. Rare instances of alternative splicing and uses of the non-standard amino acid selenocysteine were also identified.
Conclusion: We report here significant progress in genome closure and reannotation of Tetrahymena thermophila. Our experience to date suggests that complete closure of the MAC genome is attainable. Using the new EST evidence, automated and manual curation has resulted in substantial improvements to the over 24,000 gene models, which will be valuable to researchers studying this model organism as well as for comparative genomics purposes.
Figures
Similar articles
-
Macronuclear genome sequence of the ciliate Tetrahymena thermophila, a model eukaryote.PLoS Biol. 2006 Sep;4(9):e286. doi: 10.1371/journal.pbio.0040286. PLoS Biol. 2006. PMID: 16933976 Free PMC article.
-
New contribution to epigenetic studies: Isolation of micronuclei with high purity and DNA integrity in the model ciliated protist, Tetrahymena thermophila.Eur J Protistol. 2021 Aug;80:125804. doi: 10.1016/j.ejop.2021.125804. Epub 2021 May 9. Eur J Protistol. 2021. PMID: 34062315
-
Tetrahymena micronuclear genome mapping. a high-resolution meiotic map of chromosome 1l.Genetics. 2000 Mar;154(3):1141-53. doi: 10.1093/genetics/154.3.1141. Genetics. 2000. PMID: 10757759 Free PMC article.
-
Dynamic nuclear reorganization during genome remodeling of Tetrahymena.Biochim Biophys Acta. 2008 Nov;1783(11):2130-6. doi: 10.1016/j.bbamcr.2008.07.012. Epub 2008 Jul 28. Biochim Biophys Acta. 2008. PMID: 18706458 Free PMC article. Review.
-
Functional genomics: the coming of age for Tetrahymena thermophila.Trends Genet. 2002 Jan;18(1):35-40. doi: 10.1016/s0168-9525(01)02560-4. Trends Genet. 2002. PMID: 11750699 Review.
Cited by
-
Comprehensive analysis reveals dynamic and evolutionary plasticity of Rab GTPases and membrane traffic in Tetrahymena thermophila.PLoS Genet. 2010 Oct 14;6(10):e1001155. doi: 10.1371/journal.pgen.1001155. PLoS Genet. 2010. PMID: 20976245 Free PMC article.
-
Detection of fused genes in eukaryotic genomes using gene deFuser: analysis of the Tetrahymena thermophila genome.BMC Bioinformatics. 2011 Jul 11;12:279. doi: 10.1186/1471-2105-12-279. BMC Bioinformatics. 2011. PMID: 21745395 Free PMC article.
-
Shiga toxin as a bacterial defense against a eukaryotic predator, Tetrahymena thermophila.J Bacteriol. 2009 Aug;191(16):5116-22. doi: 10.1128/JB.00508-09. Epub 2009 Jun 5. J Bacteriol. 2009. PMID: 19502393 Free PMC article.
-
Tetrahymena Glutathione Peroxidase Family: A Comparative Analysis of These Antioxidant Enzymes and Differential Gene Expression to Metals and Oxidizing Agents.Microorganisms. 2020 Jul 5;8(7):1008. doi: 10.3390/microorganisms8071008. Microorganisms. 2020. PMID: 32635666 Free PMC article.
-
Selecting one of several mating types through gene segment joining and deletion in Tetrahymena thermophila.PLoS Biol. 2013;11(3):e1001518. doi: 10.1371/journal.pbio.1001518. Epub 2013 Mar 26. PLoS Biol. 2013. PMID: 23555191 Free PMC article.
References
-
- Asai DJ, Forney JD, Eds . Tetrahymena thermophila. San Diego, CA: Academic Press; 2000.
-
- Collins K, Gorovsky MA. Tetrahymena thermophila. Curr Biol. 2005;15:R317–318. - PubMed
-
- Karrer KM. Tetrahymena genetics: two nuclei are better than one. Methods Cell Biol. 2000;62:127–186. - PubMed
-
- Aury JM, Jaillon O, Duret L, Noel B, Jubin C, Porcel BM, Segurens B, Daubin V, Anthouard V, Aiach N, et al. Global trends of whole-genome duplications revealed by the ciliate Paramecium tetraurelia. Nature. 2006 - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous
