2017
DOI: 10.1007/978-3-319-59536-8_33
|View full text |Cite
|
Sign up to set email alerts
|

Table Identification and Reconstruction in Spreadsheets

Abstract: Abstract. Spreadsheets are one of the most successful content generation tools, used in almost every enterprise to perform data transformation, visualization, and analysis. The high degree of freedom provided by these tools results in very complex sheets, intermingling the actual data with formatting, formulas, layout artifacts, and textual metadata.To unlock the wealth of data contained in spreadsheets, a human analyst will often have to understand and transform the data manually.To overcome this cumbersome p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0
1

Year Published

2018
2018
2023
2023

Publication Types

Select...
3
2
2

Relationship

2
5

Authors

Journals

citations
Cited by 21 publications
(17 citation statements)
references
References 12 publications
(12 reference statements)
0
16
0
1
Order By: Relevance
“…There is a considerable number of works tackling layout inference and information extraction in spreadsheets. Recent publications propose approaches involving to some extent machine learning techniques, such as [2], [3], [4], [5], and [6]. Also, we find rule-based approaches, like [7].…”
Section: Related Workmentioning
confidence: 80%
See 1 more Smart Citation
“…There is a considerable number of works tackling layout inference and information extraction in spreadsheets. Recent publications propose approaches involving to some extent machine learning techniques, such as [2], [3], [4], [5], and [6]. Also, we find rule-based approaches, like [7].…”
Section: Related Workmentioning
confidence: 80%
“…In a similar fashion to [4], we then use the inferred roles to create the so-called layout regions (see Figure 1c). These group together adjacent cells having the same layout role.…”
Section: Introductionmentioning
confidence: 99%
“…We see recognition and information extraction in spreadsheets as a series of steps, which collectively form our processing pipeline, illustrated in Figure 1. Although we cover various aspects of automatic spreadsheet processing, our research focuses mainly on two crucial tasks: layout inference [13,15] and table identification [10][11][12]14]. Subsequently, we adapt approaches from related work, to extract the information from the detected tables.…”
Section: Processing Pipelinementioning
confidence: 99%
“…We have proposed several approaches for table recognition in spreadsheets [10,12,14]. Initially we employed heuristic-and rulebased methods.…”
Section: Table Recognitionmentioning
confidence: 99%
See 1 more Smart Citation