2023
DOI: 10.1007/s10032-023-00437-8
|View full text |Cite
|
Sign up to set email alerts
|

Historical document image analysis using controlled data for pre-training

Abstract: Using neural networks for semantic labeling has become a dominant technique for layout analysis of historical document images. However, to train or fine-tune appropriate models, large labeled datasets are needed. This paper addresses the case when only limited labeled data are available and promotes a novel approach using so-called controlled data to pre-train the networks. Two different strategies are proposed: The first addresses the real labeling task by using artificial data; the second uses real data to p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 30 publications
0
1
0
Order By: Relevance
“…In related fields, several studies have been conducted to investigate the impact of ground truth quality on deep learning, for example in the context of object detection [2,13], text-line segmentation [3,22], and semantic segmentation [20,25] in natural images or historical document images. However, the problems encountered for HTR are specific and to the best of our knowledge, there are currently no comprehensive studies on the impact of ground-truth quality for deep learning-based HTR.…”
Section: Introductionmentioning
confidence: 99%
“…In related fields, several studies have been conducted to investigate the impact of ground truth quality on deep learning, for example in the context of object detection [2,13], text-line segmentation [3,22], and semantic segmentation [20,25] in natural images or historical document images. However, the problems encountered for HTR are specific and to the best of our knowledge, there are currently no comprehensive studies on the impact of ground-truth quality for deep learning-based HTR.…”
Section: Introductionmentioning
confidence: 99%