Our system is currently under heavy load due to increased usage. We're actively working on upgrades to improve performance. Thank you for your patience.
2021
DOI: 10.1109/access.2021.3096739
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Layout Unstructured Invoice Documents Dataset: A Dataset for Template-Free Invoice Processing and Its Evaluation Using AI Approaches

Abstract: The daily transaction of an organization generates a vast amount of unstructured data such as invoices and purchase orders. Managing and analyzing unstructured data is a costly affair for the organization. Unstructured data has a wealth of hidden valuable information. Extracting such insights automatically from unstructured documents can significantly increase the productivity of an organization. Thus, there is a huge demand to develop a tool that can automate the extraction of key fields from unstructured doc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
3

Relationship

1
8

Authors

Journals

citations
Cited by 17 publications
(6 citation statements)
references
References 35 publications
0
5
0
Order By: Relevance
“…Different from the standard Natural Language Processing (NLP) tasks for text recognition from regular documents or paragraphs, it is not feasible to rely on Computer Vision (CV) and NLP alone to recognize text from unstructured documents. In the text recognition from unstructured documents, using image segmentation technology to extract text ignores semantic information, whereas the text in tables with complex structures cannot be effectively recognized with the standard NLP methods (46). In the present study, we successfully developed a deep learning system by integrating a series of methods (including image processing, table detection, text detection text recognition, and text structurization) to structurize text from UPBMR images.…”
Section: Discussionmentioning
confidence: 99%
“…Different from the standard Natural Language Processing (NLP) tasks for text recognition from regular documents or paragraphs, it is not feasible to rely on Computer Vision (CV) and NLP alone to recognize text from unstructured documents. In the text recognition from unstructured documents, using image segmentation technology to extract text ignores semantic information, whereas the text in tables with complex structures cannot be effectively recognized with the standard NLP methods (46). In the present study, we successfully developed a deep learning system by integrating a series of methods (including image processing, table detection, text detection text recognition, and text structurization) to structurize text from UPBMR images.…”
Section: Discussionmentioning
confidence: 99%
“…It makes use of semi-supervised neural networks for scene text recognition which is further optimized. The model was tested against benchmark detection datasets [9] and gave promising accuracy scores, [10] details the difficulty and challenges in developing a custom dataset and finding high quality varied datasets to annotate and use for training and testing, hence it was made the most out of publicly available datasets. The paper also details their efforts into automating invoice processing by using feature extraction but does not consider the tables present in invoices and the items present in the tables which also need to be processed and extracted if there is a need.…”
Section: Related Workmentioning
confidence: 99%
“…However, the use of the YOLO based detection module gave unsatisfactory results when compared to OpenCV based method, with the YOLO algorithm failing to detect many text regions. Considering the information extraction task as an image segmentation problem loses the text semantics, which can further complicate the processing of unstructured documents [10]. The existence of a dataset in order to train the network for segmentation is another problem in the case of training networks for extracting relevant information from structured documents.…”
Section: Related Workmentioning
confidence: 99%
“…Text regions which include key information are identified using machine learning and feature extraction methods such as Word2Vec, Glove and FastText. Besides, bidirectional long-short term memory neural model also utilized to finding key fields of invoice [30]. In another study, Graph based convolutional models that effective and robust in handling complex documents layout are used to extract and process key field information of any layout without ambiguity.…”
Section: Literature Reviewmentioning
confidence: 99%