2015 38th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) 2015
DOI: 10.1109/mipro.2015.7160272
|View full text |Cite
|
Sign up to set email alerts
|

Four level provenance support to achieve portable reproducibility of scientific workflows

Abstract: -In the scientist's community one of the most vital challenges is the issue of reproducibility of workflow execution. In order to reproduce the results of an experiment, on one hand provenance information must be collected and on the other hand the dependencies of the execution need to be eliminated. Concerning the workflow execution environment we have differentiated four levels of provenance: infrastructural, environmental, workflow and data provenance. During the re-execution at all levels the components ca… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
10
0

Year Published

2015
2015
2019
2019

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 13 publications
(10 citation statements)
references
References 9 publications
0
10
0
Order By: Relevance
“…Those include documentation of all relevant analysis artefacts. In their paper, Bánáti et al [3] classified several dependencies -that have a direct impact on the reproducibility of experiments -into three categories: infrastructural dependency, data dependency and job execution dependency. According to their work, reproducibility of computational studies requires to fully document the computational environments, and to ensure that all experimental resources remain accessible.…”
Section: Reproducibilitymentioning
confidence: 99%
“…Those include documentation of all relevant analysis artefacts. In their paper, Bánáti et al [3] classified several dependencies -that have a direct impact on the reproducibility of experiments -into three categories: infrastructural dependency, data dependency and job execution dependency. According to their work, reproducibility of computational studies requires to fully document the computational environments, and to ensure that all experimental resources remain accessible.…”
Section: Reproducibilitymentioning
confidence: 99%
“…With help of the given results together with the information gained from the user the system can create a so called Dependency Dataset, which will store all the jobs which depend on any external circumstances and may not be reproducible. In our previous paper [2] we showed, that the rate of reproducibility of a scientific workflow can be computed with the help of which the reproducible parts of workflow can be determined. From this dataset, after viewing the results the user -before finally submits his workflow -can think over the model, he can modify it and can eliminate certain dependencies or he can decide to apply extra provenance or virtualization tools to preserve the workflow.…”
Section: Dependency Datasetmentioning
confidence: 99%
“…In our previous work [2] we determined the four level of the provenance, and the different utilizations of the captured data in the different levels. Capturing provenance data during the running time of the workflow is crucial to create reproducible workflows.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The former can be perceived as the necessary and the latter one as the satisfactory requirements of the reproducibility. The dependencies of the execution mean those resources which require external (out of the scientific workflow management system, SWfMS) services or resources such as third party services, special hardwares/softwares or random value generator [2]. Elimination of these dependencies in most cases is not possible, so they have to be handled in some other way: different methods should be set up to make the workflows reproducible.…”
Section: Introductionmentioning
confidence: 99%