2018
DOI: 10.3390/jimaging4010015
|View full text |Cite
|
Sign up to set email alerts
|

Transcription of Spanish Historical Handwritten Documents with Deep Neural Networks

Abstract: Abstract:The digitization of historical handwritten document images is important for the preservation of cultural heritage. Moreover, the transcription of text images obtained from digitization is necessary to provide efficient information access to the content of these documents. Handwritten Text Recognition (HTR) has become an important research topic in the areas of image and computational language processing that allows us to obtain transcriptions from text images. State-of-the-art HTR systems are, however… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
20
2

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 27 publications
(24 citation statements)
references
References 30 publications
0
20
2
Order By: Relevance
“…Details can be seen in the scripts provided for this specific dataset in GitHub. 10 Given the relatively scarce text data available in the training transcripts, a character 7-gram was used in this case.…”
Section: Summary Of Results Obtained With the Icfhr-2014 Datasetmentioning
confidence: 99%
See 2 more Smart Citations
“…Details can be seen in the scripts provided for this specific dataset in GitHub. 10 Given the relatively scarce text data available in the training transcripts, a character 7-gram was used in this case.…”
Section: Summary Of Results Obtained With the Icfhr-2014 Datasetmentioning
confidence: 99%
“…Full architecture and training details are included in the scripts available for each dataset in GitHub. 10 As previously discussed, for a given text line image, a trained CRNN estimates a sequence of character posterior probability vectors (often referred to as "ConfMat"). While raw images can be directly accepted as input, results can often be improved if images are previously deskewed, deslanted, cleaned, contrast-enhanced, and/or size-normalized [15,16,17,18].…”
Section: Convolutional and Recurrent Neural Network Optical Modelingmentioning
confidence: 99%
See 1 more Smart Citation
“…State of the art handwriting systems are built with a Multidimensional Long Short-Term Memory (MDLSTM) network [3] or even with a Convolutional Recurrent Neural Network (CRNN) stacked with a Bidirectional Long Short-Term Memory (BLSTM) [4]. The neural network training includes a Connectionist Temporal Classification (CTC) cost function proposed by [5].…”
Section: Introductionmentioning
confidence: 99%
“…Several binarization and text-line segmentation approaches are also benchmarked on these specific documents. The work by Granell et al [9] describes an efficient text-line recognition system, based on CNN and stacks of RNNs, that has been developed for the recognition of historical Spanish documents. These documents include out-of-vocabulary ancient words which are handled by a language model based on sub-lexical units.…”
mentioning
confidence: 99%