2018 IEEE 14th International Conference on E-Science (E-Science) 2018
DOI: 10.1109/escience.2018.00039
|View full text |Cite
|
Sign up to set email alerts
|

Big Provenance Stream Processing for Data Intensive Computations

Abstract: This dissertation is a result of an effort over many years. There are so many people who helped me in various ways during this endeavor. Without their generous support and encouragement, this work would not have been possible. First of all, I am so grateful to my Ph.D. advisor, Prof. Beth Plale for her invaluable support, guidance, and encouragement throughout my Ph.D. Her research experience over many years across multiple areas of Computer Science helped me in many ways to solve hard research problems and to… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
13
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(13 citation statements)
references
References 60 publications
0
13
0
Order By: Relevance
“…Runtime multiworkflow provenance data capture. Komadu [8,12] is the only solution we found in this group. Different from the previous groups, Komadu aims at generating integrated provenance data as a multiworkflow runs.…”
Section: Related Workmentioning
confidence: 75%
See 2 more Smart Citations
“…Runtime multiworkflow provenance data capture. Komadu [8,12] is the only solution we found in this group. Different from the previous groups, Komadu aims at generating integrated provenance data as a multiworkflow runs.…”
Section: Related Workmentioning
confidence: 75%
“…To capture the data relationships between data transformations and workflows, which are given by the consumed and generated data values, the service uses unique identifiers to every data value that flows through it. Using unique identifiers for maintaining relationships of captured data is used in several provenance systems [8,12]. Thus, every data value receives a unique identifier in the PLView.…”
Section: Provcapturermentioning
confidence: 99%
See 1 more Smart Citation
“…It is unrealistic to expect that all phases, their execution, and the processed data will be managed by one single system. Alternatively, provenance tracking systems [12][13][14][15][16] can be coupled to a CSE workflow, providing provenance support while not significantly changing the way CSE users develop their applications. However, these solutions fail to track the interconnections between workflows and fail to track data processed in multiple heterogeneous stores.…”
Section: Related Workmentioning
confidence: 99%
“…11prov.out( batch , loss value)12 prov.out( epoch , conf usion matrix, model hyperprms, model perf , model ref )…”
mentioning
confidence: 99%