2011 31st International Conference on Distributed Computing Systems 2011
DOI: 10.1109/icdcs.2011.29
|View full text |Cite
|
Sign up to set email alerts
|

Low-Overhead Fault Tolerance for High-Throughput Data Processing Systems

Abstract: The MapReduce programming paradigm proved to be a useful approach for building highly scalable data processing systems. One important reason for its success is simplicity, including the fault tolerance mechanisms. However, this simplicity comes at a price: efficiency. MapReduce's fault tolerance scheme stores too much intermediate information on disk. This inefficiency negatively affects job completion time. Furthermore, this inefficiency in particular forbids the application of MapReduce in near real-time sce… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0

Year Published

2011
2011
2019
2019

Publication Types

Select...
4
3
1

Relationship

2
6

Authors

Journals

citations
Cited by 30 publications
(15 citation statements)
references
References 14 publications
(16 reference statements)
0
15
0
Order By: Relevance
“…This kind of stratification has been investigated in [19]. Recent works on streaming MapReduce [21,20,11] also focus on the possibility to model any computation as a sequence of alternated map and reduce stages, supporting the idea that a large class of computations can be structured as a sequence of consecutive stages where events always flow from previous to subsequent stages.…”
Section: Workload Characteristicsmentioning
confidence: 99%
See 1 more Smart Citation
“…This kind of stratification has been investigated in [19]. Recent works on streaming MapReduce [21,20,11] also focus on the possibility to model any computation as a sequence of alternated map and reduce stages, supporting the idea that a large class of computations can be structured as a sequence of consecutive stages where events always flow from previous to subsequent stages.…”
Section: Workload Characteristicsmentioning
confidence: 99%
“…Other works [21,20,11] try to bridge the gap between continuous queries and MapReduce paradigm by proposing a stream based version of the MapReduce approach [3] where events uninterruptedly flow among the map and reduce stages of a certain computation without incurring in the delays typical of batch oriented solutions.…”
Section: Related Workmentioning
confidence: 99%
“…E-STREAMHUB supports full elasticity, i.e., scale out and in, while optimizing the overall system utilization, and without requiring specific application support. Its underlying runtime engine can support both passive [26] and active [25] replication.…”
Section: B Elastic Complex Event Processingmentioning
confidence: 99%
“…Each slice can be supported by multiple cores operating in parallel when the processing is stateless or requires only an R lock. STREAMMINE3G supports dependability through passive [26] or active [25] slice replication. This requires no modification to the application but the evaluation of these mechanisms is out of the scope of the present paper.…”
Section: The Streamhub Scalable Pub/sub Enginementioning
confidence: 99%
“…If nodes rely on state, this state needs to be keep safe for when failures occur. In a previous work, we have detailed an approach for passive replication for ESP systems that apply the MapReduce approach [13]. However, as detailed in Section II, traditional passive replication has a low resource overhead, but requires a long recovery phase after a failure as both the checkpoint needs to be restored and events from the log need to be replayed and reprocessed.…”
Section: E Node Failuresmentioning
confidence: 99%