2021
DOI: 10.1109/access.2021.3082689
|View full text |Cite
|
Sign up to set email alerts
|

Boosting Offline Handwritten Text Recognition in Historical Documents With Few Labeled Lines

Abstract: In this paper we address the problem of offline handwritten text recognition (HTR) in historical documents when few labeled samples are available and some of them contain errors in the train set. Our three main contributions are: first, we analyze how to perform transfer learning (TL) from a massive database to a smaller historical database, analyzing which layers of the model need fine-tuning. Second, we analyze methods to efficiently combine TL and data augmentation (DA). Finally, we propose an algorithm to … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
9
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 18 publications
(9 citation statements)
references
References 42 publications
0
9
0
Order By: Relevance
“…For real HTR tasks, it is not essential to track the case of characters. We compare our approach on a few labeled datasets (Konzil, Schiller, Ricordi, Patzig, Schwerin) with results from paper [11]. Results are presented in Tab.…”
Section: Resultsmentioning
confidence: 99%
See 4 more Smart Citations
“…For real HTR tasks, it is not essential to track the case of characters. We compare our approach on a few labeled datasets (Konzil, Schiller, Ricordi, Patzig, Schwerin) with results from paper [11]. Results are presented in Tab.…”
Section: Resultsmentioning
confidence: 99%
“…6. For our model, we took the best results from Table 4, and the best results from Table 1 in paper [11]. Our model achieves better CER for 3 of 5 datasets.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations