2009 10th International Conference on Document Analysis and Recognition 2009
DOI: 10.1109/icdar.2009.138
|View full text |Cite
|
Sign up to set email alerts
|

Improving the Table Boundary Detection in PDFs by Fixing the Sequence Error of the Sparse Lines

Abstract: As the rapid growth of PDF documents, recognizing the document structure and components are useful for document storage, classification and retrieval.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0
4

Year Published

2011
2011
2022
2022

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 27 publications
(12 citation statements)
references
References 4 publications
0
8
0
4
Order By: Relevance
“…Thus, it can be interesting to recover the text sequence order. Some algorithms based on the concept of sparse line that take into account the presence of columns and figures can be used [41].…”
Section: Structural Methods For Table Localizationmentioning
confidence: 99%
“…Thus, it can be interesting to recover the text sequence order. Some algorithms based on the concept of sparse line that take into account the presence of columns and figures can be used [41].…”
Section: Structural Methods For Table Localizationmentioning
confidence: 99%
“…Moreover, this method has been further enhanced in [21] to deal with multicolumn tables, by modeling text sequences in order to enable a two-phase algorithm performing within column resorting and crosscolumn resorting.…”
Section: Related Workmentioning
confidence: 99%
“…This table structure easy causes the false positive recognition on multi-column paragraphs. Therefore, instead of text line or text elements, we use the homogeneous region for the same role of text line analysis or sparse lines analysis [20]. In the first step we eliminate …”
Section: Non-ruling Line Table Detectionmentioning
confidence: 99%
“…On each G t k the table candidates are analyzed again to confirm this is the table region or not based on the arrangement of text, text lines (follows row by row and column by column) and the spares line analysis [20].…”
Section: Parallel Tablementioning
confidence: 99%