Skip to main page content
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Jul;68(7):2386-2392.
doi: 10.1099/ijsem.0.002809. Epub 2018 May 24.

Using Average Nucleotide Identity to Improve Taxonomic Assignments in Prokaryotic Genomes at the NCBI

Affiliations
Free PMC article

Using Average Nucleotide Identity to Improve Taxonomic Assignments in Prokaryotic Genomes at the NCBI

Stacy Ciufo et al. Int J Syst Evol Microbiol. .
Free PMC article

Abstract

Average nucleotide identity analysis is a useful tool to verify taxonomic identities in prokaryotic genomes, for both complete and draft assemblies. Using optimum threshold ranges appropriate for different prokaryotic taxa, we have reviewed all prokaryotic genome assemblies in GenBank with regard to their taxonomic identity. We present the methods used to make such comparisons, the current status of GenBank verifications, and recent developments in confirming species assignments in new genome submissions.

Keywords: GenBank; RefSeq; taxonomy; type strains.

Figures

Fig. 1.
Fig. 1.
ANI process workflow for processing of pre-submission genomes.
Fig. 2.
Fig. 2.
Example of taxonomy correction markup on a GenBank record. This information was added after working closely with the submitter to correct the identification of the genome entry.
Fig. 3.
Fig. 3.
K-mer tree showing the distribution of Shigella genomes among those of E. coli. The bar indicates the percentage nucleotide rate over the length of the genome.
Fig. 4.
Fig. 4.
K-mer tree showing genome variability amongst Lactobacillus gasseri assemblies. The ANI cutoff of 93.5 % includes both groups of assemblies (a and b), whilst a 96 % cutoff will separate them. Type assemblies are highlighted. The bar indicates the percentage nucleotide substitution rate over the length of the genome.

Similar articles

See all similar articles

Cited by 17 articles

See all "Cited by" articles

References

    1. Federhen S, Rossello-Mora R, Klenk H-P, Tindall BJ, Konstantinidis KT, et al. Meeting report: GenBank microbial genomic taxonomy workshop (12–13 May, 2015) Stand Genomic Sci. 2016;11:15. doi: 10.1186/s40793-016-0134-1. - DOI
    1. Parker CT, Tindall BJ, Garrity GM. International code of nomenclature of Prokaryotes. Int J Syst Evol Microbiol. 2015 doi: 10.1099/ijsem.0.000778. - DOI - PubMed
    1. Federhen S. Type material in the NCBI taxonomy database. Nucleic Acids Res. 2015;43:D1086–D1098. doi: 10.1093/nar/gku1127. - DOI - PMC - PubMed
    1. Federhen S. The NCBI taxonomy database. Nucleic Acids Res. 2012;40:D136–D143. doi: 10.1093/nar/gkr1178. - DOI - PMC - PubMed
    1. Tatusova T, Ciufo S, Federhen S, Fedorov B, McVeigh R, et al. Update on RefSeq microbial genomes resources. Nucleic Acids Res. 2015;43:D599–D605. doi: 10.1093/nar/gku1062. - DOI - PMC - PubMed

LinkOut - more resources

Feedback