2016 23rd International Conference on Pattern Recognition (ICPR) 2016
DOI: 10.1109/icpr.2016.7900270
|View full text |Cite
|
Sign up to set email alerts
|

Table headers: An entrance to the data mine

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
2
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(5 citation statements)
references
References 20 publications
0
5
0
Order By: Relevance
“…TableDataExtractor returns this standardized table as the output, where each row corresponds to one data cell of the original table, and the columns represent categories to which the data entry belongs. TableDataExtractor is built upon the MIPS (Minimum Indexing Point Search) algorithm developed by Embley et al, 32 and extensions to it, developed by Nagy and Seth, 33 which have been further extended to enable a higher degree of automation, to include support for multiple tables within one table and to automatically resolve footnotes. The algorithms have also been adapted to function in an iterative fashion, which enables the standardization of data in the stub header of the table (Figure 4c).…”
Section: ■ Automated Parsingmentioning
confidence: 99%
“…TableDataExtractor returns this standardized table as the output, where each row corresponds to one data cell of the original table, and the columns represent categories to which the data entry belongs. TableDataExtractor is built upon the MIPS (Minimum Indexing Point Search) algorithm developed by Embley et al, 32 and extensions to it, developed by Nagy and Seth, 33 which have been further extended to enable a higher degree of automation, to include support for multiple tables within one table and to automatically resolve footnotes. The algorithms have also been adapted to function in an iterative fashion, which enables the standardization of data in the stub header of the table (Figure 4c).…”
Section: ■ Automated Parsingmentioning
confidence: 99%
“…Traditional layout analysis can be broadly classified into three methods: top-down methods [19], bottom-up methods [20], and hybrid methods [21]. With the continuous development of machine learning and neural network technology, emerging analysis methods have appeared in the field of layout analysis, mainly based on machine learning.…”
Section: Traditional Layout Analysis Methodsmentioning
confidence: 99%
“…Kise et al [3] used the connected component approach to obtain candidate boundaries of regions for layout segmentation using Voronoi diagrams. Nagy et al [4] cut after projection in the X-Y direction, which is only applicable to structured text with fixed text area and line spacing. This method is sensitive to boundary noise and demands high document quality.…”
Section: Related Workmentioning
confidence: 99%