Artificial neural networks for document analysis and recognition

Simone Marinai; Marco Gori; Giovanni Soda; Computer Society

doi:10.1109/TPAMI.2005.4

Artificial neural networks for document analysis and recognition

IEEE Trans Pattern Anal Mach Intell. 2005 Jan;27(1):23-35. doi: 10.1109/TPAMI.2005.4.

Authors

Simone Marinai¹, Marco Gori, Giovanni Soda, Computer Society

Affiliation

¹ Dipartimento di Sistenmi e Informatica, Università di Firenze, via di S. Marta, 3, 50139 Firenze, Italy. marinai@dsi.unifi.it

PMID: 15628266
DOI: 10.1109/TPAMI.2005.4

Abstract

Artificial neural networks have been extensively applied to document analysis and recognition. Most efforts have been devoted to the recognition of isolated handwritten and printed characters with widely recognized successful results. However, many other document processing tasks, like preprocessing, layout analysis, character segmentation, word recognition, and signature verification, have been effectively faced with very promising results. This paper surveys the most significant problems in the area of offline document image processing, where connectionist-based approaches have been applied. Similarities and differences between approaches belonging to different categories are discussed. A particular emphasis is given on the crucial role of prior knowledge for the conception of both appropriate architectures and learning algorithms. Finally, the paper provides a critical analysis on the reviewed approaches and depicts the most promising research guidelines in the field. In particular, a second generation of connectionist-based models are foreseen which are based on appropriate graphical representations of the learning environment.

Publication types

Comparative Study
Evaluation Study
Validation Study

MeSH terms

Algorithms*
Artificial Intelligence
Computer Graphics
Documentation
Electronic Data Processing / methods*
Handwriting*
Image Enhancement / methods
Image Interpretation, Computer-Assisted / methods*
Information Storage and Retrieval / methods*
Neural Networks, Computer*
Numerical Analysis, Computer-Assisted
Pattern Recognition, Automated / methods*
Reading
Reproducibility of Results
Sensitivity and Specificity
Signal Processing, Computer-Assisted
User-Computer Interface