2019
DOI: 10.1007/978-3-030-21074-8_15
|View full text |Cite
|
Sign up to set email alerts
|

Deep Reader: Information Extraction from Document Images via Relation Extraction and Natural Language

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 18 publications
0
5
0
Order By: Relevance
“…Discriminative approaches for document image cleanup include a CNN-based approach for deblurring [20], a U-net [19] based approach replacing the skip connections between the encoder and decoder blocks with alternating convolutional and recurrent layers for efficient feature extraction [17], a two-stage CNN-based approach where the first stage is to classify the type of deblurring and the second stage to remove it [10], and conditional GANs (cGANs) [25,23], which is a supervised image-to-image translation approach [9]. DE-GAN [23], particularly, is recently proposed based on cGANs with a modified loss function with promising results on binarization, deblurring, and watermark removal in documents.…”
Section: Image Denoising In Documentsmentioning
confidence: 99%
“…Discriminative approaches for document image cleanup include a CNN-based approach for deblurring [20], a U-net [19] based approach replacing the skip connections between the encoder and decoder blocks with alternating convolutional and recurrent layers for efficient feature extraction [17], a two-stage CNN-based approach where the first stage is to classify the type of deblurring and the second stage to remove it [10], and conditional GANs (cGANs) [25,23], which is a supervised image-to-image translation approach [9]. DE-GAN [23], particularly, is recently proposed based on cGANs with a modified loss function with promising results on binarization, deblurring, and watermark removal in documents.…”
Section: Image Denoising In Documentsmentioning
confidence: 99%
“…Robust reading 9 is a common 9 https://rrc.cvc.uab.es/ term under which several approaches are collected. A recent approach in this area is DeepReader (Vishwanath et al, 2018), which is a document understanding approach which seamlessly integrates lowlevel OCR with recognition of higher-level document structure and, to a certain extent, content. Document Visual Question Answering (Mathew et al, 2020), on the other hand, analyses scanned documents beyond mere OCR of text content, including manually applied highlighting, for answering questions about the documents' content.…”
Section: Related Workmentioning
confidence: 99%
“…For instance, text in advertising banners may be interpreted as valuable information. For this reason, a simple OCR detection followed by natural language processing techniques is a suboptimal for WIE (Vishwanath et al, 2018).…”
Section: Introductionmentioning
confidence: 99%