2019 International Conference on Document Analysis and Recognition (ICDAR) 2019
DOI: 10.1109/icdar.2019.00060
|View full text |Cite
|
Sign up to set email alerts
|

Attend, Copy, Parse End-to-end Information Extraction from Documents

Abstract: Document information extraction tasks performed by humans create data consisting of a PDF or document image input, and extracted string outputs. This end-to-end data is naturally consumed and produced when performing the task because it is valuable in and of itself. It is naturally available, at no additional cost. Unfortunately, state-of-the-art word classification methods for information extraction cannot use this data, instead requiring word-level labels which are expensive to create and consequently not av… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
29
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 48 publications
(30 citation statements)
references
References 33 publications
0
29
0
Order By: Relevance
“…It implies that it is useful to train over more data [10], [2], [19], [11] . 4) BERT: Bidirectional Encoder Representations for Transformers (BERT) [92] , [10] , [2] , [19] , [86], is nowadays the latest word embedding approach, that is effectively used in numerous biomedical and other text mining tasks. BERT learns the text representation from both the directions to better understand the context and the relationship.…”
Section: ) Named Entity Recognition (Ner)mentioning
confidence: 99%
“…It implies that it is useful to train over more data [10], [2], [19], [11] . 4) BERT: Bidirectional Encoder Representations for Transformers (BERT) [92] , [10] , [2] , [19] , [86], is nowadays the latest word embedding approach, that is effectively used in numerous biomedical and other text mining tasks. BERT learns the text representation from both the directions to better understand the context and the relationship.…”
Section: ) Named Entity Recognition (Ner)mentioning
confidence: 99%
“…• Dataset quality [25], [26] Missing data values and few other errors lead to an insignificant extraction of data.…”
Section: Poor Qualitymentioning
confidence: 99%
“…Some convolutional layers are then applied to these models of document to obtain the token representations. In addition to better understanding the document layout, some authors [18,25] also include the pixel values of the document images in the input for capturing clues not conveyed by the text modality such as table ruling lines, logos and stamps.…”
Section: Related Work On Information Extraction (Ie)mentioning
confidence: 99%