2015
DOI: 10.1016/j.future.2014.10.008
|View full text |Cite
|
Sign up to set email alerts
|

Pegasus, a workflow management system for science automation

Abstract: Modern science often requires the execution of large-scale, multi-stage simulation and data analysis pipelines to enable the study of complex systems. The amount of computation and data involved in these pipelines requires scalable workflow management systems that are able to reliably and efficiently coordinate and automate data movement and task execution on distributed computational resources: campus clusters, national cyberinfrastructures, and commercial and academic clouds. This paper describes the design,… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
402
0
15

Year Published

2016
2016
2022
2022

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 680 publications
(443 citation statements)
references
References 59 publications
(72 reference statements)
0
402
0
15
Order By: Relevance
“…• Pegasus [3] supports workflow executions in distributed environments, such as campus clusters, grids, clouds, and supercomputers. Pegasus maps an application onto available resources, optimizing the execution in terms of performance and data management.…”
Section: Classification Of Workflow Management Systemsmentioning
confidence: 99%
See 2 more Smart Citations
“…• Pegasus [3] supports workflow executions in distributed environments, such as campus clusters, grids, clouds, and supercomputers. Pegasus maps an application onto available resources, optimizing the execution in terms of performance and data management.…”
Section: Classification Of Workflow Management Systemsmentioning
confidence: 99%
“…In recent years, numerous workflow management systems (WMSs) have been developed to manage the execution of diverse workflows on heterogeneous computing resources [3,4,5,6,7,8,9]. As user communities adopt and evolve WMSs to fit their own needs, many of the features and capabilities that were once common to most WMSs have become too distinct to share across systems.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…For instance, in production Workflow Management Systems (WMSs) [1,2,3,4,5,6], the default behavior is that all output data is saved to files and all input data is read from files, which is exactly the CkptAll strategy. While this strategy leads to fast restarts in case of failures, its downside is that it maximizes checkpointing overhead.…”
Section: Introductionmentioning
confidence: 99%
“…When these simulations dataflows are managed by a parallel Scientific Workflow Management System (SWMS) [1], they benefit from provenance [2] and data parallelism among different programs that compose the workflow. Systems like Swift/T [3] and Pegasus [4] are highly scalable SWMS and have shown impressive performance results for many different scientific application domains [3,5].…”
mentioning
confidence: 99%