Automatic pollen images recognition is crucial for pollinosis symptoms prevention and treatment. The problem of pollen recognition can be efficiently solved using deep learning, however neural networks require tens of thousands of images to generalize. At the same time, the existing open pollen images datasets are very small. In this paper, we present a novel open pollen dataset annotated for both detection and classification tasks. Based on our dataset we study learning from a small data using different state-of-the-art approaches. For the detection task we propose to use our new Bayesian RetinaNet network, which models aleatoric uncertainty. We compare it with the baseline RetinaNet and demonstrate that our model allows for higher detection precision. For the classification task we compare the impact of pre-training on the synthetic images from generative adversarial networks (GANs) and metric-based few-shot learning. Namely, we pre-trained our small convolutional neural network and Siamese neural network classifiers on synthetic pollen images generated by two GANs: StyleGAN and Self-attention GAN. The best classifier is the convolutional neural network pre-trained on StyleGAN images. Our best models achieved 96.3% of mean average precision for the detection task and 97.7% of F1 measure for the classification task on 13 pollen plant species. We have implemented the best models in our pollen recognition web service, which is available for palynologists on request.
Keywords: Few-shot learning; Generative adversarial network; Image generation; Image recognition; Metric learning; Object detection; Pollen recognition; Pollinosis prevention; Siamese network.
Copyright © 2021. Published by Elsevier Ltd.