tRNADB-CE: tRNA gene database well-timed in the era of big sequence data
- PMID: 24822057
- PMCID: PMC4013482
- DOI: 10.3389/fgene.2014.00114
tRNADB-CE: tRNA gene database well-timed in the era of big sequence data
Abstract
The tRNA gene data base curated by experts "tRNADB-CE" (http://trna.ie.niigata-u.ac.jp) was constructed by analyzing 1,966 complete and 5,272 draft genomes of prokaryotes, 171 viruses', 121 chloroplasts', and 12 eukaryotes' genomes plus fragment sequences obtained by metagenome studies of environmental samples. 595,115 tRNA genes in total, and thus two times of genes compiled previously, have been registered, for which sequence, clover-leaf structure, and results of sequence-similarity and oligonucleotide-pattern searches can be browsed. To provide collective knowledge with help from experts in tRNA researches, we added a column for enregistering comments to each tRNA. By grouping bacterial tRNAs with an identical sequence, we have found high phylogenetic preservation of tRNA sequences, especially at the phylum level. Since many species-unknown tRNAs from metagenomic sequences have sequences identical to those found in species-known prokaryotes, the identical sequence group (ISG) can provide phylogenetic markers to investigate the microbial community in an environmental ecosystem. This strategy can be applied to a huge amount of short sequences obtained from next-generation sequencers, as showing that tRNADB-CE is a well-timed database in the era of big sequence data. It is also discussed that batch-learning self-organizing-map with oligonucleotide composition is useful for efficient knowledge discovery from big sequence data.
Keywords: BLSOM; big data; database; metagenome; phylogenic maker; tRNA.
Figures
Similar articles
-
tRNADB-CE 2011: tRNA gene database curated manually by experts.Nucleic Acids Res. 2011 Jan;39(Database issue):D210-3. doi: 10.1093/nar/gkq1007. Epub 2010 Nov 11. Nucleic Acids Res. 2011. PMID: 21071414 Free PMC article.
-
tRNADB-CE: tRNA gene database curated manually by experts.Nucleic Acids Res. 2009 Jan;37(Database issue):D163-8. doi: 10.1093/nar/gkn692. Epub 2008 Oct 8. Nucleic Acids Res. 2009. PMID: 18842632 Free PMC article.
-
An artificial intelligence approach fit for tRNA gene studies in the era of big sequence data.Genes Genet Syst. 2017 Sep 12;92(1):43-54. doi: 10.1266/ggs.16-00068. Epub 2017 Mar 24. Genes Genet Syst. 2017. PMID: 28344190
-
A Novel Bioinformatics Strategy to Analyze Microbial Big Sequence Data for Efficient Knowledge Discovery: Batch-Learning Self-Organizing Map (BLSOM).Microorganisms. 2013 Nov 20;1(1):137-157. doi: 10.3390/microorganisms1010137. Microorganisms. 2013. PMID: 27694768 Free PMC article. Review.
-
AI for the collective analysis of a massive number of genome sequences: various examples from the small genome of pandemic SARS-CoV-2 to the human genome.Genes Genet Syst. 2021 Dec 16;96(4):165-176. doi: 10.1266/ggs.21-00025. Epub 2021 Sep 27. Genes Genet Syst. 2021. PMID: 34565757 Review.
Cited by
-
Inferring targeting modes of Argonaute-loaded tRNA fragments.RNA Biol. 2020 Aug;17(8):1070-1080. doi: 10.1080/15476286.2019.1676633. Epub 2019 Oct 15. RNA Biol. 2020. PMID: 31613177 Free PMC article.
-
Global In-Silico Scenario of tRNA Genes and Their Organization in Virus Genomes.Viruses. 2019 Feb 21;11(2):180. doi: 10.3390/v11020180. Viruses. 2019. PMID: 30795514 Free PMC article.
-
FAST: FAST Analysis of Sequences Toolbox.Front Genet. 2015 May 19;6:172. doi: 10.3389/fgene.2015.00172. eCollection 2015. Front Genet. 2015. PMID: 26042145 Free PMC article.
-
Codon Adaptation of Plastid Genes.PLoS One. 2016 May 19;11(5):e0154306. doi: 10.1371/journal.pone.0154306. eCollection 2016. PLoS One. 2016. PMID: 27196606 Free PMC article.
-
Initiator tRNA genes template the 3' CCA end at high frequencies in bacteria.BMC Genomics. 2016 Dec 8;17(1):1003. doi: 10.1186/s12864-016-3314-x. BMC Genomics. 2016. PMID: 27927177 Free PMC article.
References
Publication types
LinkOut - more resources
Full Text Sources
Other Literature Sources
