2018 13th IAPR International Workshop on Document Analysis Systems (DAS) 2018
DOI: 10.1109/das.2018.15
|View full text |Cite
|
Sign up to set email alerts
|

Handwriting Recognition of Historical Documents with Few Labeled Data

Abstract: Historical documents present many challenges for offline handwriting recognition systems, among them, the segmentation and labeling steps. Carefully annotated textlines are needed to train an HTR system. In some scenarios, transcripts are only available at the paragraph level with no text-line information. In this work, we demonstrate how to train an HTR system with few labeled data. Specifically, we train a deep convolutional recurrent neural network (CRNN) system on only 10% of manually labeled text-line dat… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
20
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 35 publications
(21 citation statements)
references
References 19 publications
0
20
0
Order By: Relevance
“…See all the available details in [26]. The second row corresponds to a system that used 13 CNN layers and 3 layers of BLSTM, along with a word 2-gram for decoding [31]. The results of the here proposed baseline system are also shown in Table 9.…”
Section: Summary Of Results Obtained With the Icdar-2017 Datasetmentioning
confidence: 99%
“…See all the available details in [26]. The second row corresponds to a system that used 13 CNN layers and 3 layers of BLSTM, along with a word 2-gram for decoding [31]. The results of the here proposed baseline system are also shown in Table 9.…”
Section: Summary Of Results Obtained With the Icdar-2017 Datasetmentioning
confidence: 99%
“…Similar to [32], in [36] they also apply some elastic distortions to the original images. In [4] the authors improve the performance by augmenting the training set with specially crafted multiscale data. They also propose a model-based normalization scheme that considers the variability in the writing scale at the recognition phase.…”
Section: Data Augmentationmentioning
confidence: 99%
“…Recently, as part of the ICFHR 2018 READ competition [8], most of the participants proposed an optical model composed of a Convolutional Neural Network (CNN) and Bidirectional Long Short Term Memory (BLSTM) layers as in [15] and [16]. One of them was using Multi-dimensional LSTM (MDLSTM) [17], which has provided good performance when trained on a generic large data set, but has shown difficulty in carrying the writer adaptation process with few samples.…”
Section: A Optical Model Adaptationmentioning
confidence: 99%