Proceedings of the 38th International Conference on Software Engineering 2016
DOI: 10.1145/2884781.2884813
|View full text |Cite
|
Sign up to set email alerts
|

BigDebug

Abstract: Developers use cloud computing platforms to process a large quantity of data in parallel when developing big data analytics. Debugging the massive parallel computations that run in today’s data-centers is time consuming and error-prone. To address this challenge, we design a set of interactive, real-time debugging primitives for big data processing in Apache Spark, the next generation data-intensive scalable cloud computing platform. This requires re-thinking the notion of step-through debugging in a tradition… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 58 publications
(12 citation statements)
references
References 27 publications
0
12
0
Order By: Relevance
“…While the rst is arguably not desirable for reactive systems, the noti cation property might be a good addition to RxFiddle. BIGDEBUG [26], a debugging solution for systems like Spark [52], introduces simulated breakpoints for this purpose. When a simulated breakpoint is reached, the execution resumes immediately and the required lineage information of the breakpoint is collected in a new independent process.…”
Section: Marblementioning
confidence: 99%
“…While the rst is arguably not desirable for reactive systems, the noti cation property might be a good addition to RxFiddle. BIGDEBUG [26], a debugging solution for systems like Spark [52], introduces simulated breakpoints for this purpose. When a simulated breakpoint is reached, the execution resumes immediately and the required lineage information of the breakpoint is collected in a new independent process.…”
Section: Marblementioning
confidence: 99%
“…In order to deal with non-deterministic inputs, a partial order of variable accesses or events has to be stored in the trace to be able to reproduce the program concurrent behaviour. To make replay debugging scalable, several debuggers combine trace recording with checkpoint-based debugging, in which snapshots of the application are taken at certain periods of time to limit the size of the trace that needs to be stored [1,16].…”
Section: Offline Debuggingmentioning
confidence: 99%
“…As such, it may not be convenient to let a temperature monitoring application crash, and retrieve a log or a trace of the execution afterward. In some cases, stopping the execution of the program may also lead to information loss [16].…”
Section: Offline Debuggingmentioning
confidence: 99%
See 2 more Smart Citations