1999
DOI: 10.1007/3-540-48172-9_21
|View full text |Cite
|
Sign up to set email alerts
|

The T-Recs Table Recognition and Analysis System

Abstract: This paper presents a new approach to table structure recognition as well as to layout analysis. The discussed recognition process differs significantly from existing approaches as it realizes a bottom-up clustering of given word segments, whereas conventional table structure recognizers all rely on the detection of some separators such as delineation or significant white space to analyze a page from the top-down. The following analysis of the recognized layout elements is based on the construction of a tile s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
38
0

Year Published

2006
2006
2021
2021

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 63 publications
(38 citation statements)
references
References 6 publications
(5 reference statements)
0
38
0
Order By: Relevance
“…OCR and rule based text analysis are helpful in recognition of specific text blocks, such as: captions, abstracts, document authors, etc. Tables without grid lines (plain text) are recognized by the T-Recs algorithm [18]. …”
Section: The Idea Of the Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…OCR and rule based text analysis are helpful in recognition of specific text blocks, such as: captions, abstracts, document authors, etc. Tables without grid lines (plain text) are recognized by the T-Recs algorithm [18]. …”
Section: The Idea Of the Methodsmentioning
confidence: 99%
“…Recognition of tables without grid lines These table structures are recognized using the T-Recs table recognition and analysis system [18]. T-Recs realises the bottomup clustering of word segments and does not apply any other top-down specific techniques (separator detection).…”
Section: An Object Is Recognized As a Table Ifmentioning
confidence: 99%
“…For example, table tags exist in HTML, but they are often used for formatting web page layout. Previous work focused on detecting tables from PDF, HTML and ASCII documents using Optical Character Recognition [13], machine learning algorithms such as C4.5 decision trees [17] or SVM [22,19], and heuristics [26].…”
Section: Introductionmentioning
confidence: 99%
“…Several studies have been presented on table recognition [1], [3], [6], [7], [8], [15], [16]. To reduce the complexity of the problem, some of them use tables composed by perfect horizontal and vertical line segments.…”
Section: Introductionmentioning
confidence: 99%