2019
DOI: 10.1007/978-3-030-30275-7_6
|View full text |Cite
|
Sign up to set email alerts
|

TabbyXL: Rule-Based Spreadsheet Data Extraction and Transformation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 36 publications
0
3
0
Order By: Relevance
“…After the table localization, segmentation, and functional and structural analysis, several methods design algorithms for expressing the inferred structural relationships as relational models (JSON or XML formats) or RDF formats. TranSheet (Hung, Benatallah, & Saint‐Paul, 2011) and TabbyXL (Shigarov, Khristyuk, Mikhailov, & Paramonov, 2019) introduce languages for the specification of the transformations needed to simultaneously define the table functional and structural relationships and then convert them into a relational format. Moreover, Senbazuru (Chen et al, 2013) and HaExcel (Cunha et al, 2014) automatically extrapolate the table functional and structural relationships by applying some of the algorithms described in Section 3.2 and then exploit rule‐based algorithms to define and combine the relational tuples representing the structural relationships.…”
Section: Extracting and Transforming Tablesmentioning
confidence: 99%
See 1 more Smart Citation
“…After the table localization, segmentation, and functional and structural analysis, several methods design algorithms for expressing the inferred structural relationships as relational models (JSON or XML formats) or RDF formats. TranSheet (Hung, Benatallah, & Saint‐Paul, 2011) and TabbyXL (Shigarov, Khristyuk, Mikhailov, & Paramonov, 2019) introduce languages for the specification of the transformations needed to simultaneously define the table functional and structural relationships and then convert them into a relational format. Moreover, Senbazuru (Chen et al, 2013) and HaExcel (Cunha et al, 2014) automatically extrapolate the table functional and structural relationships by applying some of the algorithms described in Section 3.2 and then exploit rule‐based algorithms to define and combine the relational tuples representing the structural relationships.…”
Section: Extracting and Transforming Tablesmentioning
confidence: 99%
“…Such labeling procedure allows defining hierarchical structural relationships, which are then transformed into a relational format. With TabbyXL (Shigarov et al, 2019), users may define functional and structural relationships by using either the standard DROOLS language (RedHat, 2020), or CRL (Shigarov & Mikhailov, 2017), a specifically developed rule-based language for implementing programs aimed at (a) cell cleansing issues (merging, splitting, updating data), (b) role analysis, to recover entities and labels as functional data items presented in tables (marking all the cells having the same role or located in the same functional region with a fixed value), (c) structural analysis (entity-label association), and (d) interpretation (labeling with a category).…”
Section: Basic Extraction and Transformation Toolsmentioning
confidence: 99%
“…[5]), data transformation (e.g. [6]), programming by example (e.g. [4]), and semantic characterization of the information (e.g.…”
Section: Introductionmentioning
confidence: 99%