ProvManager: a provenance management system for scientific workflows

Marinho, Anderson; Murta, Leonardo; Werner, Cláudia Maria Lima; Braganholo, Vanessa; Cruz, Sérgio Manuel Serra da; Ogasawara, Eduardo; Mattoso, Marta

doi:10.1002/cpe.1870

Cited by 30 publications

(25 citation statements)

References 19 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…The ProvManager [3] approach, as previously discussed, works by adapting workflow activities. It minimizes the overhead of activities instrumentation via an automatic adaptation process.…”

Section: Related Workmentioning

confidence: 99%

“…This provenance gathering mechanism uses Git [9] VCS to capture and store retrospective provenance, including ImP. The ProvMonitor implementation is integrated with ProvManager [3], an activity-based provenance gathering mechanism that works through automatic workflow instrumentation. This way, each activity becomes responsible for gathering its own provenance.…”

Section: The Provmonitor Approachmentioning

confidence: 99%

See 1 more Smart Citation

Implicit provenance gathering through configuration management

Neves¹,

Braganholo²,

Murta³

2013

2013 5th International Workshop on Software Engineering for Computational Science and Engineering (SE-CSE)

Self Cite

View full text Add to dashboard Cite

Scientific experiments based on computer simulations usually consume and produce huge amounts of data. Data provenance is used to help scientists answer queries related to how experiment data were generated or changed. However, during the experiment execution, data not explicitly referenced by the experiment specification may lead to an implicit data flow missed by the existing provenance gathering infrastructures. This paper introduces a novel approach to gather and store implicit data flow provenance through configuration management. Our approach opens some new opportunities in terms of provenance analysis, such as identifying implicit data flows, identifying data transformations along an experiment trial, comparing data evolution in different trials of the same experiment, and identifying side effects on data evolution caused by implicit data flows.

show abstract

“…The ProvManager [3] approach, as previously discussed, works by adapting workflow activities. It minimizes the overhead of activities instrumentation via an automatic adaptation process.…”

Section: Related Workmentioning

confidence: 99%

Section: The Provmonitor Approachmentioning

confidence: 99%

Implicit provenance gathering through configuration management

Neves¹,

Braganholo²,

Murta³

2013

2013 5th International Workshop on Software Engineering for Computational Science and Engineering (SE-CSE)

Self Cite

View full text Add to dashboard Cite

show abstract

“…These two solutions are nevertheless limited to these two systems and their adaptability to a broader range of SWfMSs would depend on a complete reformulation of their architectures. Other pieces of work [19], [20] propose approaches to the interoperation of different SWfMSs based on provenance metadata stored according to the OPM [21] and PROV [22] standards. The achieved interoperation in these solutions is retrospective though, in the sense they capture metadata about past workflow executions from different SWfMSs and allow for joint analyses over them; therefore, they do not give support for scientists to reuse workflow specifications across these different SWfMSs.…”

Section: B the Studied Systemsmentioning

confidence: 99%

Scientific Workflow Interchanging through Patterns: Reversals and Lessons Learned

Bastos

Braga

Gomes³

2015

2015 IEEE 11th International Conference on E-Science

View full text Add to dashboard Cite

Scientific workflows are used for dealing with complex problems in different e-science domains. These workflows are modeled and executed using Scientific Workflow Management Systems (SWfMSs). Generally, SWfMSs provide their own Workflow Specification Language (WfSL), and this is a challenge considering the possibility of interchanging workflow specifications between different SWfMSs. Nevertheless, the reuse of workflows gains growing importance as it helps with fostering the collaboration and cross-fertilization across different research groups. This paper presents a research proposal, including its mishaps and assimilations, on the use of workflow patterns combined with software architecture concepts to capture the key semantics expressed in scientific workflows specified in different WfSLs and to allow the interchanging of these specifications between different SWfMSs. This paper also shows how our findings based on real world specifications led us to reformulate our initial proposal and discuss the new results.

show abstract

“…The observed strategy requires operating systems to continuously collect provenance about running processes, their inputs and outputs [19]. The disclosed strategy requires adapted applications to collect provenance as designed by software architects [20], [21]. Users sometimes need to manually declare provenance when it cannot be captured by the system or application [11], [18].…”

Section: Basics Of Pasmentioning

confidence: 99%

“…A query engine is usually specific to a storage model. Hence, users have to write queries in languages specific to the storage model, such as SQL [22], Prolog [21], or SPARQL [23]. However, these general languages were not designed specifically for provenance.…”

Section: Basics Of Pasmentioning

confidence: 99%

A Provenance-Aware Access Control Framework with Typed Provenance

Sun

Park

Nguyen

et al. 2016

IEEE Trans. Dependable and Secure Comput.

View full text Add to dashboard Cite

Provenance is a directed graph that captures historical information about data items in Provenance-Aware Systems (PAS). A variety of access control models and policy languages specific to PAS have been recently discussed in literature. However, it is still not clear how to efficiently specify provenance-aware access control policies and how to effectively enforce these policies with respect to complex provenance graph that can only be captured at run-time. To this end, we design and implement a provenance-aware access control framework with a layered architecture that features an abstract layer, including a Typed Provenance Model (TPM) and a set of TPM interpreters. TPM includes a set of abstract provenance types enabling efficient specification of provenance-aware policies. New provenance types can be composed of extant ones for specifying new policies. TPM interpreters can be integrated to enable the policy enforcement with respect to provenance graphs in different physical representations. By treating provenance types as special attributes, the proposed framework enables an adoption of provenance-aware access control in existing attribute-based access control frameworks, such as XACML-compliant ones. We implement the proposed framework by extending SUN's XACML implementation and show that it facilitates the specification of provenance-aware policies in XACML with minor extensions. We also analyze the performance of the proposed framework.

show abstract

ProvManager: a provenance management system for scientific workflows

Cited by 30 publications

References 19 publications

Implicit provenance gathering through configuration management

Implicit provenance gathering through configuration management

Scientific Workflow Interchanging through Patterns: Reversals and Lessons Learned

A Provenance-Aware Access Control Framework with Typed Provenance

Contact Info

Product

Resources

About