GinJinn: An object-detection pipeline for automated feature extraction from herbarium specimens
- PMID: 32626606
- PMCID: PMC7328649
- DOI: 10.1002/aps3.11351
GinJinn: An object-detection pipeline for automated feature extraction from herbarium specimens
Abstract
Premise: The generation of morphological data in evolutionary, taxonomic, and ecological studies of plants using herbarium material has traditionally been a labor-intensive task. Recent progress in machine learning using deep artificial neural networks (deep learning) for image classification and object detection has facilitated the establishment of a pipeline for the automatic recognition and extraction of relevant structures in images of herbarium specimens.
Methods and results: We implemented an extendable pipeline based on state-of-the-art deep-learning object-detection methods to collect leaf images from herbarium specimens of two species of the genus Leucanthemum. Using 183 specimens as the training data set, our pipeline extracted one or more intact leaves in 95% of the 61 test images.
Conclusions: We establish GinJinn as a deep-learning object-detection tool for the automatic recognition and extraction of individual leaves or other structures from herbarium specimens. Our pipeline offers greater flexibility and a lower entrance barrier than previous image-processing approaches based on hand-crafted features.
Keywords: TensorFlow; deep learning; herbarium specimens; object detection; visual recognition.
© 2020 Ott et al. Applications in Plant Sciences published by Wiley Periodicals LLC on behalf of Botanical Society of America.
Figures
Similar articles
-
Automated Extraction of Phenotypic Leaf Traits of Individual Intact Herbarium Leaves from Herbarium Specimen Images Using Deep Learning Based Semantic Segmentation.Sensors (Basel). 2021 Jul 2;21(13):4549. doi: 10.3390/s21134549. Sensors (Basel). 2021. PMID: 34283110 Free PMC article.
-
Analyzing trait-climate relationships within and among taxa using machine learning and herbarium specimens.Am J Bot. 2023 May;110(5):e16167. doi: 10.1002/ajb2.16167. Epub 2023 May 21. Am J Bot. 2023. PMID: 37043678
-
Generating segmentation masks of herbarium specimens and a data set for training segmentation models using deep learning.Appl Plant Sci. 2020 Jul 1;8(6):e11352. doi: 10.1002/aps3.11352. eCollection 2020 Jun. Appl Plant Sci. 2020. PMID: 32626607 Free PMC article.
-
Deep learning in digital pathology image analysis: a survey.Front Med. 2020 Aug;14(4):470-487. doi: 10.1007/s11684-020-0782-9. Epub 2020 Jul 29. Front Med. 2020. PMID: 32728875 Review.
-
Medical Image Analysis using Convolutional Neural Networks: A Review.J Med Syst. 2018 Oct 8;42(11):226. doi: 10.1007/s10916-018-1088-1. J Med Syst. 2018. PMID: 30298337 Review.
Cited by
-
From leaves to labels: Building modular machine learning networks for rapid herbarium specimen analysis with LeafMachine2.Appl Plant Sci. 2023 Oct 16;11(5):e11548. doi: 10.1002/aps3.11548. eCollection 2023 Sep-Oct. Appl Plant Sci. 2023. PMID: 37915430 Free PMC article.
-
Identification of herbarium specimen sheet components from high-resolution images using deep learning.Ecol Evol. 2023 Aug 14;13(8):e10395. doi: 10.1002/ece3.10395. eCollection 2023 Aug. Ecol Evol. 2023. PMID: 37589042 Free PMC article.
-
A General-Purpose Machine Learning R Library for Sparse Kernels Methods With an Application for Genome-Based Prediction.Front Genet. 2022 Jun 3;13:887643. doi: 10.3389/fgene.2022.887643. eCollection 2022. Front Genet. 2022. PMID: 35719365 Free PMC article.
-
Automated Extraction of Phenotypic Leaf Traits of Individual Intact Herbarium Leaves from Herbarium Specimen Images Using Deep Learning Based Semantic Segmentation.Sensors (Basel). 2021 Jul 2;21(13):4549. doi: 10.3390/s21134549. Sensors (Basel). 2021. PMID: 34283110 Free PMC article.
-
Is color data from citizen science photographs reliable for biodiversity research?Ecol Evol. 2021 Mar 30;11(9):4071-4083. doi: 10.1002/ece3.7307. eCollection 2021 May. Ecol Evol. 2021. PMID: 33976795 Free PMC article.
References
-
- Abadi, M. , Agarwal A., Barham P., Brevdo E., Chen Z., Citro C., Corrado G. S., et al. 2016. TensorFlow: Large‐scale machine learning on heterogeneous distributed systems. arXiv 1603.04467 [cs] [Preprint]. Published 14 March 2016 [accessed 6 May 2020]. Available at: https://arxiv.org/abs/1603.04467.
-
- Abhishek, D. , Zisserman, A. 2019. The VIA annotation software for images, audio and video In Proceedings of the 27th ACM International Conference on Multimedia (MM ’19), October 21–25, 2019, Nice, France. New York: ACM.
-
- Bonhomme, V. , Picq S., and Gaucherel C.. 2014. Momocs: Outline analysis using R. Journal of Statistical Software 56(13): 1–24.
LinkOut - more resources
Full Text Sources
