The taxonomic name resolution service: an online tool for automated standardization of plant names
- PMID: 23324024
- PMCID: PMC3554605
- DOI: 10.1186/1471-2105-14-16
The taxonomic name resolution service: an online tool for automated standardization of plant names
Abstract
Background: The digitization of biodiversity data is leading to the widespread application of taxon names that are superfluous, ambiguous or incorrect, resulting in mismatched records and inflated species numbers. The ultimate consequences of misspelled names and bad taxonomy are erroneous scientific conclusions and faulty policy decisions. The lack of tools for correcting this 'names problem' has become a fundamental obstacle to integrating disparate data sources and advancing the progress of biodiversity science.
Results: The TNRS, or Taxonomic Name Resolution Service, is an online application for automated and user-supervised standardization of plant scientific names. The TNRS builds upon and extends existing open-source applications for name parsing and fuzzy matching. Names are standardized against multiple reference taxonomies, including the Missouri Botanical Garden's Tropicos database. Capable of processing thousands of names in a single operation, the TNRS parses and corrects misspelled names and authorities, standardizes variant spellings, and converts nomenclatural synonyms to accepted names. Family names can be included to increase match accuracy and resolve many types of homonyms. Partial matching of higher taxa combined with extraction of annotations, accession numbers and morphospecies allows the TNRS to standardize taxonomy across a broad range of active and legacy datasets.
Conclusions: We show how the TNRS can resolve many forms of taxonomic semantic heterogeneity, correct spelling errors and eliminate spurious names. As a result, the TNRS can aid the integration of disparate biological datasets. Although the TNRS was developed to aid in standardizing plant names, its underlying algorithms and design can be extended to all organisms and nomenclatural codes. The TNRS is accessible via a web interface at http://tnrs.iplantcollaborative.org/ and as a RESTful web service and application programming interface. Source code is available at https://github.com/iPlantCollaborativeOpenSource/TNRS/.
Figures
Similar articles
-
Solr-Plant: efficient extraction of plant names from text.BMC Bioinformatics. 2019 May 22;20(1):263. doi: 10.1186/s12859-019-2874-6. BMC Bioinformatics. 2019. PMID: 31117932 Free PMC article.
-
"gnparser": a powerful parser for scientific names based on Parsing Expression Grammar.BMC Bioinformatics. 2017 May 26;18(1):279. doi: 10.1186/s12859-017-1663-3. BMC Bioinformatics. 2017. PMID: 28549446 Free PMC article.
-
Geographic name resolution service: A tool for the standardization and indexing of world political division names, with applications to species distribution modeling.PLoS One. 2022 Nov 14;17(11):e0268162. doi: 10.1371/journal.pone.0268162. eCollection 2022. PLoS One. 2022. PMID: 36374834 Free PMC article.
-
Does the name really matter? The importance of botanical nomenclature and plant taxonomy in biomedical research.J Ethnopharmacol. 2014 Mar 28;152(3):387-92. doi: 10.1016/j.jep.2013.11.042. Epub 2013 Dec 7. J Ethnopharmacol. 2014. PMID: 24321863 Review.
-
Building essential biodiversity variables (EBVs) of species distribution and abundance at a global scale.Biol Rev Camb Philos Soc. 2018 Feb;93(1):600-625. doi: 10.1111/brv.12359. Epub 2017 Aug 2. Biol Rev Camb Philos Soc. 2018. PMID: 28766908 Review.
Cited by
-
Treemendous: an R package for integrating taxonomic information across backbones.PeerJ. 2024 Feb 28;12:e16896. doi: 10.7717/peerj.16896. eCollection 2024. PeerJ. 2024. PMID: 38436026 Free PMC article.
-
Plant trait and vegetation data along a 1314 m elevation gradient with fire history in Puna grasslands, Perú.Sci Data. 2024 Feb 21;11(1):225. doi: 10.1038/s41597-024-02980-3. Sci Data. 2024. PMID: 38383609 Free PMC article.
-
Consistent patterns of common species across tropical tree communities.Nature. 2024 Jan;625(7996):728-734. doi: 10.1038/s41586-023-06820-z. Epub 2024 Jan 10. Nature. 2024. PMID: 38200314 Free PMC article.
-
Integrated global assessment of the natural forest carbon potential.Nature. 2023 Dec;624(7990):92-101. doi: 10.1038/s41586-023-06723-z. Epub 2023 Nov 13. Nature. 2023. PMID: 37957399 Free PMC article.
-
Climate change and land use threaten global hotspots of phylogenetic endemism for trees.Nat Commun. 2023 Oct 31;14(1):6950. doi: 10.1038/s41467-023-42671-y. Nat Commun. 2023. PMID: 37907453 Free PMC article.
References
-
- Global biodiversity information facility. http://www.gbif.org/
-
- Tropicos. http://www.tropicos.org.
-
- REMIB - Red mundial de informacion sobre biodiversidad. http://www.conabio.gob.mx/remib/doctos/remib_esp.html.
-
- OBIS. http://www.iobis.org/
-
- VertNet. http://vertnet.org/index.php.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
