Proceedings of the 11th ACM Symposium on Cloud Computing 2020
DOI: 10.1145/3419111.3421292
|View full text |Cite
|
Sign up to set email alerts
|

Influence-based provenance for dataflow applications with taint propagation

Abstract: Debugging big data analytics often requires a root cause analysis to pinpoint the precise culprit records in an input dataset responsible for incorrect or anomalous output. Existing debugging or data provenance approaches do not track fine-grained control and data flows in user-defined application code; thus, the returned culprit data is often too large for manual inspection and expensive post-mortem analysis is required. We design FLOWDEBUG to identify a highly precise set of input records based on two key in… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 9 publications
references
References 35 publications
0
0
0
Order By: Relevance