2010 IEEE 26th International Conference on Data Engineering (ICDE 2010) 2010
DOI: 10.1109/icde.2010.5447816
|View full text |Cite
|
Sign up to set email alerts
|

Optimizing ETL workflows for fault-tolerance

Abstract: Abstract-Extract-Transform-Load (ETL) processes play an important role in data warehousing. Typically, design work on ETL has focused on performance as the sole metric to make sure that the ETL process finishes within an allocated time window. However, other quality metrics are also important and need to be considered during ETL design. In this paper, we address ETL design for performance plus fault-tolerance and freshness. There are many reasons why an ETL process can fail and a good design needs to guarantee… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
57
0

Year Published

2011
2011
2021
2021

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 52 publications
(58 citation statements)
references
References 9 publications
0
57
0
Order By: Relevance
“…A work of [31] extends that of [12] w.r.t. generating an optimal ETL workflow in terms of performance, fault tolerance, and freshness, as described in [23].…”
Section: State-space Approach For Optimizing An Etl Workflowmentioning
confidence: 99%
See 3 more Smart Citations
“…A work of [31] extends that of [12] w.r.t. generating an optimal ETL workflow in terms of performance, fault tolerance, and freshness, as described in [23].…”
Section: State-space Approach For Optimizing An Etl Workflowmentioning
confidence: 99%
“…If an ETL workflow is large and complex, the generation of an optimal workflow may take longer than the actual time of execution of the ETL workflow itself. Moreover, [12,31] are limited to a few transition techniques from one state to another and also do not give an account to translate an optimized logical model to its semantically equivalent physical implementation.…”
Section: Consmentioning
confidence: 99%
See 2 more Smart Citations
“…For example, a strict requirement regarding recoverability may suggest to consider adding recovery points at points of the flow that are generally known for being expensive (e.g., after the extraction phase or after an expensive blocking operator [17]). Of course the final decision on which are the good places to add recovery points is to be taken by an optimizer at the logical level [17].…”
Section: Operation Identificationmentioning
confidence: 99%