2019 International Conference on Document Analysis and Recognition (ICDAR) 2019
DOI: 10.1109/icdar.2019.00244
|View full text |Cite
|
Sign up to set email alerts
|

ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction

Abstract: Scanned receipts OCR and key information extraction (SROIE) represent the processeses of recognizing text from scanned receipts and extracting key texts from them and save the extracted tests to structured documents. SROIE plays critical roles for many document analysis applications and holds great commercial potentials, but very little research works and advances have been published in this area. In recognition of the technical challenges, importance and huge commercial potentials of SROIE, we organized the I… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
135
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 193 publications
(136 citation statements)
references
References 5 publications
1
135
0
Order By: Relevance
“…Six datasets are used as down-stream tasks. The FUNSD (Jaume et al, 2019), CORD (Park et al, 2019), SROIE (Huang et al, 2019) and Kleister-NDA (Graliński et al, 2020) datasets define entity extraction tasks that aim to extract the value of a set of pre-defined keys, which we formalize as a sequential labeling task. RVL-CDIP (Harley et al, 2015) is for document image classification.…”
Section: Datamentioning
confidence: 99%
“…Six datasets are used as down-stream tasks. The FUNSD (Jaume et al, 2019), CORD (Park et al, 2019), SROIE (Huang et al, 2019) and Kleister-NDA (Graliński et al, 2020) datasets define entity extraction tasks that aim to extract the value of a set of pre-defined keys, which we formalize as a sequential labeling task. RVL-CDIP (Harley et al, 2015) is for document image classification.…”
Section: Datamentioning
confidence: 99%
“…However, registration for such datasets is typically required. For example, ICDAR has such types of datasets, on which many researchers have proposed their work [59], [74].…”
Section: ) Challenges/issues With Existing Datasetsmentioning
confidence: 99%
“…Text extraction is the main stage in automating document image processing [24], [32], [50], [109]. The document images can be compressed or uncompressed, grayscale or color and the text in the images can be editable or non-editable [64], [59], [39].…”
Section: Rq4-ai Approaches Used For Unstructured Document Processingmentioning
confidence: 99%
See 2 more Smart Citations