2015
DOI: 10.1007/978-3-319-16462-5_6
|View full text |Cite
|
Sign up to set email alerts
|

noWorkflow: Capturing and Analyzing Provenance of Scripts

Abstract: We propose noWorkflow, a tool that transparently captures provenance of scripts and enables reproducibility. Unlike existing approaches, noWorkflow is non-intrusive and does not require users to change the way they work -users need not wrap their experiments in scientific workflow systems, install version control systems, or instrument their scripts. The tool leverages Software Engineering techniques, such as abstract syntax tree analysis, reflection, and profiling, to collect different types of provenance, in… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
99
0
4

Year Published

2015
2015
2023
2023

Publication Types

Select...
6
4

Relationship

1
9

Authors

Journals

citations
Cited by 87 publications
(103 citation statements)
references
References 14 publications
0
99
0
4
Order By: Relevance
“…The suggestions are mined from retrospective provenance data 1 [1,31]. (Leake and Kendall-Morwick [13] refer to retrospective provenance as "execution traces, " terminology also adopted in other works [32]). The reason for this is that execution traces are simpler than the workflow structure, since they are basically sequential.…”
Section: Approaches Based On Case-based Reasoningmentioning
confidence: 99%
“…The suggestions are mined from retrospective provenance data 1 [1,31]. (Leake and Kendall-Morwick [13] refer to retrospective provenance as "execution traces, " terminology also adopted in other works [32]). The reason for this is that execution traces are simpler than the workflow structure, since they are basically sequential.…”
Section: Approaches Based On Case-based Reasoningmentioning
confidence: 99%
“…In the architecture discussion, three papers were presented: noWorkflow [80] is a tool that transparently captures provenance of scripts and enables reproducibility, LabelFlow [46] uses domain-specific labels with workflow provenance as a platform for data artifacts labelling; and [84] proposes software provenance to be included as part of software packages.…”
Section: Provenancementioning
confidence: 99%
“…One motivation is obtaining a provenance abstraction capturing data's flow or the process's dependencies. In noWorkflow [24] authors analyse Python scripts to extract function-call hierarchies, which they use to create a view over provenance traces collected by run-time instrumentation of the functions' reads and writes to the file system. In [25] authors employ a taint tracking framework, which instruments programme executions and records which computations are affected by tainted data sources.…”
Section: Related Workmentioning
confidence: 99%