An Improved Character Recognition Framework for Containers Based on DETR Algorithm
- PMID: 34283160
- PMCID: PMC8272209
- DOI: 10.3390/s21134612
An Improved Character Recognition Framework for Containers Based on DETR Algorithm
Abstract
An improved DETR (detection with transformers) object detection framework is proposed to realize accurate detection and recognition of characters on shipping containers. ResneSt is used as a backbone network with split attention to extract features of different dimensions by multi-channel weight convolution operation, thus increasing the overall feature acquisition ability of the backbone. In addition, multi-scale location encoding is introduced on the basis of the original sinusoidal position encoding model, improving the sensitivity of input position information for the transformer structure. Compared with the original DETR framework, our model has higher confidence regarding accurate detection, with detection accuracy being improved by 2.6%. In a test of character detection and recognition with a self-built dataset, the overall accuracy can reach 98.6%, which meets the requirements of logistics information identification acquisition.
Keywords: DETR (detection with transformers); character recognition; multi-scale location coding; split-attention.
Conflict of interest statement
The authors declare no conflict of interest.
Figures
Similar articles
-
Automatic patient-level recognition of four Plasmodium species on thin blood smear by a real-time detection transformer (RT-DETR) object detection algorithm: a proof-of-concept and evaluation.Microbiol Spectr. 2024 Feb 6;12(2):e0144023. doi: 10.1128/spectrum.01440-23. Epub 2024 Jan 3. Microbiol Spectr. 2024. PMID: 38171008 Free PMC article.
-
Unsupervised Pre-Training for Detection Transformers.IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):12772-12782. doi: 10.1109/TPAMI.2022.3216514. Epub 2023 Oct 3. IEEE Trans Pattern Anal Mach Intell. 2023. PMID: 36269904
-
Accurate leukocyte detection based on deformable-DETR and multi-level feature fusion for aiding diagnosis of blood diseases.Comput Biol Med. 2024 Mar;170:107917. doi: 10.1016/j.compbiomed.2024.107917. Epub 2024 Jan 6. Comput Biol Med. 2024. PMID: 38228030
-
Simple Conditional Spatial Query Mask Deformable Detection Transformer: A Detection Approach for Multi-Style Strokes of Chinese Characters.Sensors (Basel). 2024 Jan 31;24(3):931. doi: 10.3390/s24030931. Sensors (Basel). 2024. PMID: 38339648 Free PMC article.
-
Engineering Aspects of Olfaction.In: Persaud KC, Marco S, Gutiérrez-Gálvez A, editors. Neuromorphic Olfaction. Boca Raton (FL): CRC Press/Taylor & Francis; 2013. Chapter 1. In: Persaud KC, Marco S, Gutiérrez-Gálvez A, editors. Neuromorphic Olfaction. Boca Raton (FL): CRC Press/Taylor & Francis; 2013. Chapter 1. PMID: 26042329 Free Books & Documents. Review.
Cited by
-
Automatic Detection of Secundum Atrial Septal Defect in Children Based on Color Doppler Echocardiographic Images Using Convolutional Neural Networks.Front Cardiovasc Med. 2022 Apr 6;9:834285. doi: 10.3389/fcvm.2022.834285. eCollection 2022. Front Cardiovasc Med. 2022. PMID: 35463790 Free PMC article.
-
Deep Learning for Object Detection, Classification and Tracking in Industry Applications.Sensors (Basel). 2021 Nov 5;21(21):7349. doi: 10.3390/s21217349. Sensors (Basel). 2021. PMID: 34770656 Free PMC article.
References
-
- Druzhkov P.N., Kustikova V.D. A survey of deep learning methods and software tools for image classification and object detection. Pattern Recognit. Image Anal. 2016;26:9–15. doi: 10.1134/S1054661816010065. - DOI
-
- Liu X., Meng G., Pan C. Scene text detection and recognition with advances in deep learning: A survey. Int. J. Doc. Anal. Recognit. 2019;22:143–162. doi: 10.1007/s10032-019-00320-5. - DOI
-
- Redmon J., Farhadi A. YOLOv3: An Incremental Improvement. [(accessed on 4 July 2021)];arXiv. 2018 Available online: https://arxiv.org/abs/1804.02767.1804.02767
-
- Bochkovskiy A., Wang C.Y., Liao H. YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv. 20202004.10934
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
