Handbook of Document Image Processing and Recognition 2014
DOI: 10.1007/978-0-85729-859-1_20
|View full text |Cite
|
Sign up to set email alerts
|

Recognition of Tables and Forms

Abstract: Tables and forms are a very common way to organize information in structured documents. Their recognition is fundamental for the recognition of the documents. Indeed, the physical organization of a table or a form gives a lot of information concerning the logical meaning of the content. This chapter presents the different tasks that are related to the recognition of tables and forms and the associated well-known methods and remaining B. Coüasnon ()

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0

Year Published

2014
2014
2022
2022

Publication Types

Select...
5
1
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 52 publications
(23 citation statements)
references
References 46 publications
0
23
0
Order By: Relevance
“…Hu et al [35] proposed a table detection method relying on the correlation of white spaces and vertical connected component analysis. For the comprehensive summarization of these rule-based approaches, readers may refer to [13,[36][37][38][39]. Although these rule-based methods work well on documents with similar tabular layouts, they are laborious in terms of finding optimal heuristics.…”
Section: Rule-based Approachesmentioning
confidence: 99%
See 1 more Smart Citation
“…Hu et al [35] proposed a table detection method relying on the correlation of white spaces and vertical connected component analysis. For the comprehensive summarization of these rule-based approaches, readers may refer to [13,[36][37][38][39]. Although these rule-based methods work well on documents with similar tabular layouts, they are laborious in terms of finding optimal heuristics.…”
Section: Rule-based Approachesmentioning
confidence: 99%
“…2) High intra-class variance (within the single class such as tables with and without ruling lines). Due to these chal-lenges, it is highly complex to come up with custom heuristics that can assist in developing robust and generic table detection system [13].…”
Section: Introductionmentioning
confidence: 99%
“…Thus, a new table template matching that can deal with such variations has been developed. It matches the hierarchical structure of the table document and the defined template using an association graph (see Pelillo et al [1] and Ishitano [2]) by finding a maximum clique. Thus, the columns and the defined header in the template are detected.…”
Section: Template Processingmentioning
confidence: 99%
“…Among many other difficulties faced when designing a full Information Extraction workflow, we address in this paper the problem of table understanding: recognizing the structural organization of tables to extract data [1]. While in the considered documents (register books, see description Section 2), the vertical structures are relatively simple (nonhierarchical table composed of roughly 7-10 columns), the segmentation into rows turns out to be more challenging.…”
Section: Introductionmentioning
confidence: 99%
“…Line segment recognition has, however, been steadily improved during the last three decades as part of table interpretation [1,2,3,4,5], form processing [6,7,8,9], and engineering drawing analysis [10,11,12,13]. Historical form analysis [14,15] became popular even as most contemporary forms migrated to the web. The Hough transform for line location has remained one of the leading methods for line and arc extraction since its rediscovery by Duda and Hart in the early seventies [16].…”
Section: Prior Workmentioning
confidence: 99%