2019 International Conference on Document Analysis and Recognition (ICDAR) 2019
DOI: 10.1109/icdar.2019.00030
|View full text |Cite
|
Sign up to set email alerts
|

Deep Visual Template-Free Form Parsing

Abstract: Automatic, template-free extraction of information from form images is challenging due to the variety of form layouts. This is even more challenging for historical forms due to noise and degradation. A crucial part of the extraction process is associating input text with pre-printed labels. We present a learned, template-free solution to detecting preprinted text and input text/handwriting and predicting pairwise relationships between them. While previous approaches to this problem have been focused on clean i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
44
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 33 publications
(44 citation statements)
references
References 21 publications
0
44
0
Order By: Relevance
“…Another challenge for the information extraction techniques is to process and enhance the quality of the scanned documents, as the documents submitted by the client or supplier are generally scanned with a low-quality scanner or mobile devices. Multi-page unstructured documents consisting of tables with data spanning across different pages complicate retrieval of the correct target data from the document [54].…”
Section: ) Data Related Challengesmentioning
confidence: 99%
See 1 more Smart Citation
“…Another challenge for the information extraction techniques is to process and enhance the quality of the scanned documents, as the documents submitted by the client or supplier are generally scanned with a low-quality scanner or mobile devices. Multi-page unstructured documents consisting of tables with data spanning across different pages complicate retrieval of the correct target data from the document [54].…”
Section: ) Data Related Challengesmentioning
confidence: 99%
“…2) Domain specific datasets: Existing publicly available datasets are very task-specific; that is, they are related to the data extraction of the scientific articles or clinical information that is not generalized [93]. In handwritten datasets, various kinds of handwritings are present, even cursive text, making it challenging for the OCR to detect and extract the actual text, leading to less accurate results [54]. In such cases, the advanced OCR techniques are needed.…”
Section: ) Challenges/issues With Existing Datasetsmentioning
confidence: 99%
“…The study [23] presented a template-free form field extraction method on NAF historical handwritten filled form dataset, with a varied layout and noisy form images using Fully Convolutional Network (FCN). FCN is used along with a Heuristic Detector function for detecting the relationship between label-value pairs.…”
Section: Named Entity Recognition (Ner)mentioning
confidence: 99%
“…Katti et al (2018); explore to directly work on 2D document space using grid-like convolutional models to better preserve spatial context during learning, but the performance is restrictive to the resolution of the grids. Recently, Qian et al (2019); Davis et al (2019); Liu et al (2019) propose to represent documents using graphs, where nodes define word tokens and edges describe the spatial patterns of words. show state-of-the-art performance of Graph Convolutional Networks (GCNs) (Duvenaud et al, 2015) on document understanding.…”
Section: Introductionmentioning
confidence: 99%