2021
DOI: 10.1109/access.2021.3087865
|View full text |Cite
|
Sign up to set email alerts
|

Current Status and Performance Analysis of Table Recognition in Document Images With Deep Neural Networks

Abstract: The first phase of table recognition is to detect the tabular area in a document. Subsequently, the tabular structures are recognized in the second phase in order to extract information from the respective cells. Table detection and structural recognition are pivotal problems in the domain of table understanding. However, table analysis is a perplexing task due to the colossal amount of diversity and asymmetry in tables. Therefore, it is an active area of research in document image analysis. Recent advances in… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
34
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
4

Relationship

3
5

Authors

Journals

citations
Cited by 47 publications
(40 citation statements)
references
References 99 publications
1
34
0
Order By: Relevance
“…Earlier, most of the proposed methods either relied on custom heuristics or leveraged the external meta-data information to tackle the problem of table detection [22][23][24][25][26]. Later, researchers exploited statistical learning [27] followed by deep-learning-based approaches to alleviate the generalization capabilities of table detection systems [6][7][8][10][11][12][28][29][30][31][32]. This section presents a brief overview of some of these approaches.…”
Section: Related Workmentioning
confidence: 99%
“…Earlier, most of the proposed methods either relied on custom heuristics or leveraged the external meta-data information to tackle the problem of table detection [22][23][24][25][26]. Later, researchers exploited statistical learning [27] followed by deep-learning-based approaches to alleviate the generalization capabilities of table detection systems [6][7][8][10][11][12][28][29][30][31][32]. This section presents a brief overview of some of these approaches.…”
Section: Related Workmentioning
confidence: 99%
“…The IoU threshold values for cascaded bounding boxes are set to [0.5, 0.6, 0.7]. We employed three different anchor ratios of [0.5, 1.0, 2.0] with only one anchor scale of [8] since FPN [24] itself performs the multi-scale detection owing to its top-down architecture. We operated with a batch size of one to train our network.…”
Section: Model Configurationmentioning
confidence: 99%
“…Research in document analysis has been trying to develop precise information extraction systems for several years [1][2][3][4]. Although state-of-the-art optical character recognition (OCR) systems [5,6] recognize regular text with high accuracy, they are vulnerable to recognize information from page objects (tables, figures, mathematical formulas) in document images [7,8]. Figure 1 illustrates the problem in which an open-source OCR, Tesseract [4] (we use the LSTMbased version 4.1.1 available at https://github.com/tesseract-ocr/tesseract accessed on 5 July 2021), is applied to extract the content from a document image.…”
Section: Introductionmentioning
confidence: 99%
“…Apart from the text, documents contain graphical page objects, such as tables, figures, and formulas [ 1 , 2 ]. Albeit modern Optical Character Recognition (OCR) systems [ 3 , 4 , 5 ] can extract the information from scanned documents, they fail to interpret information from graphical page objects [ 6 , 7 , 8 , 9 ]. Figure 1 exhibits the problem of extracting tabular information from a document by applying open-source Tesseract OCR [ 10 ].…”
Section: Introductionmentioning
confidence: 99%
“…The problem of accurate table detection in document images is still an open problem in the research community [ 8 , 11 , 12 , 13 , 14 ]. The high amount of intra-class variance (arbitraryayouts of tables, varying presence of rulingines) andow amount of inter-class variance (figures, charts, and algorithms equipped with horizontal and verticalines thatookike tables) makes the task of classifying andocalizing tables in document images even more challenging.…”
Section: Introductionmentioning
confidence: 99%