2020
DOI: 10.1007/978-3-662-62271-1_2
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Execution of Scientific Workflows in the Cloud Through Adaptive Caching

Abstract: Many scientific experiments are now carried on using scientific workflows, which are becoming more and more data-intensive and complex. We consider the efficient execution of such workflows in the cloud. Since it is common for workflow users to reuse other workflows or data generated by other workflows, a promising approach for efficient workflow execution is to cache intermediate data and exploit it to avoid task re-execution. In this paper, we propose an adaptive caching solution for data-intensive workflows… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 31 publications
0
3
0
Order By: Relevance
“…For example, in [16], the authors simply rely on XCache [19] and its internal implementation of the LRU policy. To the best of our knowledge, only paper [20] explores the usage of a caching technique significantly different from LRU for scientific workloads. The authors propose an adaptive caching solution that is only suitable for tasks with high re-execution rates, which are not present in the WLCG.…”
Section: Pos(isgc2022)009mentioning
confidence: 99%
See 1 more Smart Citation
“…For example, in [16], the authors simply rely on XCache [19] and its internal implementation of the LRU policy. To the best of our knowledge, only paper [20] explores the usage of a caching technique significantly different from LRU for scientific workloads. The authors propose an adaptive caching solution that is only suitable for tasks with high re-execution rates, which are not present in the WLCG.…”
Section: Pos(isgc2022)009mentioning
confidence: 99%
“…In the function cache_file, the second argument indicates the fraction of the file that needs to be cached. The final step is to add a fraction of the first rectangle that did not fit (lines [18][19][20]. Note how we are implicitly solving the fractional knapsack problem, and then we either match or exceed the optimal solution of the original knapsack problem [27, Sec.…”
Section: Olga Chuchukmentioning
confidence: 99%
“…Workflow simplification is performed by the workflow manager before execution, transforming the workflow into an executable workflow and considering the metadata, input, and cache data location. It is based on the workflow simplification method presented in [23].…”
Section: Workflow Simplificationmentioning
confidence: 99%