2021
DOI: 10.1007/978-3-030-86331-9_45
|View full text |Cite
|
Sign up to set email alerts
|

Information Extraction from Invoices

Abstract: The present paper is focused on information extraction from key fields of invoices using two different methods based on sequence labeling. Invoices are semi-structured documents in which data can be located based on the context. Common information extraction systems are model-driven, using heuristics and lists of trigger words curated by domain experts. Their performances are generally high on documents they have been trained for but processing new templates often requires new manual annotations, which is tedi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 12 publications
(8 citation statements)
references
References 15 publications
0
8
0
Order By: Relevance
“…Some typical examples [17] relied on Long Short-term Memory (LSTM) neural networks [18,19]. The method proposed in [20] compared two deep learning approaches, the first based on an NER strategy using a context of words around each label, and the second based on a set of features, similar to those of CloudScan [21]. This second method provided similar results with less data for training.…”
Section: Related Workmentioning
confidence: 99%
“…Some typical examples [17] relied on Long Short-term Memory (LSTM) neural networks [18,19]. The method proposed in [20] compared two deep learning approaches, the first based on an NER strategy using a context of words around each label, and the second based on a set of features, similar to those of CloudScan [21]. This second method provided similar results with less data for training.…”
Section: Related Workmentioning
confidence: 99%
“…Experiments by Hamdi et al [32] with invoice information extraction of document type and number, dates, amounts, and currency show that word classification approach (similar to [18]) outperforms sequence labelling methods [31] on most fields.…”
Section: Related Workmentioning
confidence: 99%
“…Next comes the information extraction phase, which entails identifying the various identifiers such as types, amounts, dates, and other crucial details from the invoices. To achieve this, natural language processing (NLP) techniques, such as named entity recognition (NER), are typically employed, which aids in recognizing and extracting specific information from the text [50], [52].…”
Section: Introductionmentioning
confidence: 99%