Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science 2009
DOI: 10.1145/1645164.1645171
|View full text |Cite
|
Sign up to set email alerts
|

A data-driven workflow language for grids based on array programming principles

Abstract: International audienceDifferent scientific workflow languages have been developed to help programmers in designing complex data analysis pro- cedures. However, little effort has been invested in com- paring and finding a common root for existing approaches. This work is motivated by the search for a scientific workflow language which coherently integrates different aspects of dis- tributed computing. The language proposed is data-driven for easing the expression of parallel flows. It leverages array programmin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
45
0

Year Published

2010
2010
2017
2017

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 51 publications
(45 citation statements)
references
References 19 publications
0
45
0
Order By: Relevance
“…SciFloware is data-driven [37] in the sense that its internal workflow language clearly separates the definition of data to be processed from the graph of activities to be applied to the data. This separation is particularly suited to scientific workflows where the same experiment has to be reused for analyzing different data sets without any change.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…SciFloware is data-driven [37] in the sense that its internal workflow language clearly separates the definition of data to be processed from the graph of activities to be applied to the data. This separation is particularly suited to scientific workflows where the same experiment has to be reused for analyzing different data sets without any change.…”
Section: Discussionmentioning
confidence: 99%
“…This separation is particularly suited to scientific workflows where the same experiment has to be reused for analyzing different data sets without any change. Optimization in SciFloware focuses on two main aspects: It uses asynchronous messages to execute workflows on a distributed infrastructure such as Grid or Cloud [37,38]. Furthermore, while it uses generally coarse grain parallelism, it is able to exploit the fact that some workflow actors may be algebraic operators (i.e., some actors are not black-boxes) to optimize the workflow execution.…”
Section: Discussionmentioning
confidence: 99%
“…Many of them support counting loops without dependencies in grids and clusters, e.g. [1,7,18,21,30], some of which also support counting loops with dependencies [18,21,30]. Counting loops without dependencies can be represented as multiple DAG and counting loops with dependencies can also be linearized as a DAG [18].…”
Section: Related Workmentioning
confidence: 99%
“…Counting loops without dependencies can be represented as multiple DAG and counting loops with dependencies can also be linearized as a DAG [18]. For conditional loops, there are dataflow languages [21,30] that supports it out-of-the-box. The main issue of current iterative workflow approaches is that the workflow execution is tightly coupled to the workflow specification model.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation