2020
DOI: 10.48550/arxiv.2011.14447
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Intrinsic Decomposition of Document Images In-the-Wild

Sagnik Das,
Hassan Ahmed Sial,
Ke Ma
et al.

Abstract: Automatic document content processing is affected by artifacts caused by the shape of the paper, non-uniform and diverse color of lighting conditions. Fully-supervised methods on real data are impossible due to the large amount of data needed. Hence, the current state of the art deep learning models are trained on fully or partially synthetic images. However, document shadow or shading removal results still suffer because: (a) prior methods rely on uniformity of local color statistics, which limit their applic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 39 publications
(76 reference statements)
0
2
0
Order By: Relevance
“…However, this kind of approach consists in large deep learning architectures that need many labelled documents to train. Since labelled data is costly to produce and barely available, the generation of "realistic" synthetic documents (Das et al, 2020) to increase the amount of training data is worth exploring. All in all, the automatic extraction of information from images of population documents have shown to speed up the data entry process, although the performance of such techniques is not perfect, so a manual validation is still needed.…”
Section: Steps Of the Automatic Text Recognition Systemmentioning
confidence: 99%
“…However, this kind of approach consists in large deep learning architectures that need many labelled documents to train. Since labelled data is costly to produce and barely available, the generation of "realistic" synthetic documents (Das et al, 2020) to increase the amount of training data is worth exploring. All in all, the automatic extraction of information from images of population documents have shown to speed up the data entry process, although the performance of such techniques is not perfect, so a manual validation is still needed.…”
Section: Steps Of the Automatic Text Recognition Systemmentioning
confidence: 99%
“…Recently, deep learning has been introduced to document image rectification with promising performance as well as a significant reduction in computational cost. In deep learning based methods [13], [14], [15], [16], [17], [18], [19], document image rectification is approached by directly regressing a dense 2D vector field (warping flow) that samples the pixels from the distorted images to the rectified ones. However, these methods still suffer from two non-trivial issues.…”
Section: Introductionmentioning
confidence: 99%