2010 IEEE 30th International Conference on Distributed Computing Systems 2010
DOI: 10.1109/icdcs.2010.81
|View full text |Cite
|
Sign up to set email alerts
|

A Hybrid Approach to High Availability in Stream Processing Systems

Abstract: Abstract-Stream processing is widely used by today's applications such as financial data analysis and disaster response. In distributed stream processing systems, machine fail-stop events are handled by either active standby or passive standby. However, existing HA schemes have not sufficiently addressed the situation when a machine becomes temporarily unavailable due to data rate spikes, intensive analysis or job sharing, which happens frequently but lasts for short time. It is not clear how well active and p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
23
0
2

Year Published

2011
2011
2023
2023

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 53 publications
(25 citation statements)
references
References 17 publications
0
23
0
2
Order By: Relevance
“…However, this requires an average utilisation of less than 50%-active replication is suspended when peak workloads demand all resources. While we assume long-lived failures, Zhang et al [33] focus on transient failures. They combine active replication in the face of failures for fast recovery with passive replication during normal operation.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…However, this requires an average utilisation of less than 50%-active replication is suspended when peak workloads demand all resources. While we assume long-lived failures, Zhang et al [33] focus on transient failures. They combine active replication in the face of failures for fast recovery with passive replication during normal operation.…”
Section: Related Workmentioning
confidence: 99%
“…While mechanisms for scale out [27,26] and fault tolerance [30,28,33] in stream processing have received considerable attention in the past, it remains an open question how SPSs can scale out while remaining fault tolerant when queries contain stateful operators. Especially with recently popular stream processing models [23,29] that treat operators as black boxes in a data flow graph, users rely on operators that have large amounts of state, which potentially depends on the complete history of previously processed tuples [5].…”
Section: Introductionmentioning
confidence: 99%
“…The MapReduce approach requires the operator state to be partitionable, but enables a single operator to scale to a large number of nodes. An approach related, but somehow opposed, to ours is proposed by Zhang et al [18]. In their work passive replication is used when the system is in steady state.…”
Section: E Node Failuresmentioning
confidence: 99%
“…Therefore, the academic community has put forward a number of fault-tolerant programs based on the SPSs [8-14, 19, 20], most of which are achieved by backing up operator state or the calculation tuples. For instance, Zhang Z et al [19] has reduced both the overhead and latency of the system by mixing the active standby and passive standby, utilizing the ideas of the optimal fault tolerance. Upadhyaya P et al [20] has implemented different fault-tolerant algorithms for different operators so that the recovery time and fault tolerance efficiency can achieve optimal.…”
Section: Introductionmentioning
confidence: 99%