Proceedings of the 16th ACM SIGSOFT International Symposium on Foundations of Software Engineering 2008
DOI: 10.1145/1453101.1453147
|View full text |Cite
|
Sign up to set email alerts
|

Experience in using a process language to define scientific workflow and generate dataset provenance

Abstract: This paper describes our experiences in exploring the applicability of software engineering approaches to scientific data management problems. Specifically, this paper describes how process definition languages can be used to expedite production of scientific datasets as well as to generate documentation of their provenance. Our approach uses a process definition language that incorporates powerful semantics to encode scientific processes in the form of a Process Definition Graph (PDG). The paper describes how… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
18
0

Year Published

2009
2009
2012
2012

Publication Types

Select...
4
2
1

Relationship

3
4

Authors

Journals

citations
Cited by 23 publications
(18 citation statements)
references
References 36 publications
0
18
0
Order By: Relevance
“…Initially SmartCheck will use Data Derivation Graphs (DDGs) [13], [14] to manage such process information. A DDG is a DAG-like structure that records the steps performed, what entity performed them, and (usually pointers to) the artifacts used as inputs and outputs.…”
Section: Context Awarenessmentioning
confidence: 99%
“…Initially SmartCheck will use Data Derivation Graphs (DDGs) [13], [14] to manage such process information. A DDG is a DAG-like structure that records the steps performed, what entity performed them, and (usually pointers to) the artifacts used as inputs and outputs.…”
Section: Context Awarenessmentioning
confidence: 99%
“…These graphs are defined and queried using a Virtual Data Language. Osterweil et al [17] similarly apply graphical notions of provenance to hydrological studies using derivation functions that include both tools and human operations. In digital forensics, provenance techniques can similarly be applied to report on transformations from an acquired raw image (i.e., a data source) to evidence (i.e., a data product); Process provenance metadata is not a standard part of investigations though it would address many concerns about the reproducibility of examinations.…”
Section: Related Workmentioning
confidence: 99%
“…In this section we present several concrete examples of rework that we have encountered, and which seem amenable to solution using the pattern presented later in this paper. Though we have found rework in dispute resolution [2], scientific data processing [3,4], health care delivery [5], and other domains, the examples here are drawn from software engineering.…”
Section: Motivating Examples Of Reworkmentioning
confidence: 99%
“…Since rework often involves the modification of existing entities, is is usual to pass these entities by reference. We note, however, that the derivation history [9,3,4] of these entities is often particularly important in the creation of a rework plan as that history is often necessary in order to guide the re-invocation away from repetition of choices that have previously been shown to have led to later problems.…”
Section: Managing the Contextmentioning
confidence: 99%
See 1 more Smart Citation