The platform will undergo maintenance on Sep 14 at about 7:45 AM EST and will be unavailable for approximately 2 hours.
2021
DOI: 10.3390/app11125344
|View full text |Cite
|
Sign up to set email alerts
|

A Survey of Graphical Page Object Detection with Deep Neural Networks

Abstract: In any document, graphical elements like tables, figures, and formulas contain essential information. The processing and interpretation of such information require specialized algorithms. Off-the-shelf OCR components cannot process this information reliably. Therefore, an essential step in document analysis pipelines is to detect these graphical components. It leads to a high-level conceptual understanding of the documents that make the digitization of documents viable. Since the advent of deep learning, deep … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
23
0
2

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
2

Relationship

3
5

Authors

Journals

citations
Cited by 30 publications
(29 citation statements)
references
References 60 publications
0
23
0
2
Order By: Relevance
“…Manual investigation of cross-datasets evaluation yields the misinterpretation of other graphical page objects [ 2 ] with tables. However, with the obtained results, it is evident that our proposed CasTabDetectoRS produces state-of-the-art results on a specific dataset and generalizes well over the other datasets.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…Manual investigation of cross-datasets evaluation yields the misinterpretation of other graphical page objects [ 2 ] with tables. However, with the obtained results, it is evident that our proposed CasTabDetectoRS produces state-of-the-art results on a specific dataset and generalizes well over the other datasets.…”
Section: Resultsmentioning
confidence: 99%
“…Analogous to the field of computer vision, the power of deepearning has made a remarkable impact in the field of table analysis in document images [ 2 , 8 ]. To the best of our knowledge, Hao et al [ 46 ] introduced the idea of implementing Convolutional Neural Network (CNN) to identify spatial features from document images.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Subsequently, the method distinguished formulas from other graphical page objects such as figures and tables by applying custom heuristics. Succeedingly, researchers have investigated the capabilities of Deep Neural Networks (DNNs) for the problem of formula identification in document images [27,35]. To the best of our knowledge, He et al [36] exploited Convolutional Neural Networks (CNNs) with spatial context to detect mathematical symbols in document images.…”
Section: Related Workmentioning
confidence: 99%
“…Along with the text, the digital documents contain various graphical page objects, such as tables, figures, and formulas [1]. While state-of-the-art OCR (Optical Character Recognition) [2][3][4] systems can process the raw text in document images, they are vulnerable to extracting information from graphical page objects [5].…”
Section: Introductionmentioning
confidence: 99%