2008
DOI: 10.1007/s00446-008-0068-y
|View full text |Cite
|
Sign up to set email alerts
|

On implementing omega in systems with weak reliability and synchrony assumptions

Abstract: We study the feasibility and cost of implementing Ω-a fundamental failure detector at the core of many algorithms-in systems with weak reliability and synchrony assumptions. Intuitively, Ω allows processes to eventually elect a common leader. We first give an algorithm that implements Ω in a weak system S where (a) except for some unknown timely process s, all processes may be arbitrarily slow or may crash, and (b) only the output links of s are eventually timely (all other links can be arbitrarily slow and lo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
51
0

Year Published

2010
2010
2021
2021

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 42 publications
(51 citation statements)
references
References 37 publications
0
51
0
Order By: Relevance
“…If process j receives a quorum of vote messages (Line 54) indicating the same leader in the same epoch, then process j starts also believing in its leadership and sets its leader and epoch variables, thus Whenever the election does not converge, e.g., too many messages were lost and no quorum sent proposals to process i, a second timeout (retry-timer) is triggered and process i repeats its broadcasts (Line 1), exponentially backing off on each retry. Note that the timeliness implied by waiting for a quorum is not the minimal to enable atomic broadcasts [2,15]. Nevertheless, we have not yet observed the need for weaker timeliness assumptions.…”
Section: Leader Selection Algorithmmentioning
confidence: 83%
See 1 more Smart Citation
“…If process j receives a quorum of vote messages (Line 54) indicating the same leader in the same epoch, then process j starts also believing in its leadership and sets its leader and epoch variables, thus Whenever the election does not converge, e.g., too many messages were lost and no quorum sent proposals to process i, a second timeout (retry-timer) is triggered and process i repeats its broadcasts (Line 1), exponentially backing off on each retry. Note that the timeliness implied by waiting for a quorum is not the minimal to enable atomic broadcasts [2,15]. Nevertheless, we have not yet observed the need for weaker timeliness assumptions.…”
Section: Leader Selection Algorithmmentioning
confidence: 83%
“…To solve consensus, it is necessary and sufficient to elect a leader [5,7]. The problem of leader election has been broadly investigated from many viewpoints, from minimizing the synchrony and reliability requirements imposed on links [2,15], to optimizing the Quality of Service of failure detection [8], to electing the leader that can minimize the latency of solving consensus [17]. The ZooKeeper atomic broadcast protocol (Zab) implements a variant of the atomic broadcast primitive called primary order atomic broadcast [13].…”
Section: Related Workmentioning
confidence: 99%
“…It is shown in [10] that a system with unknown and eventual bounds on relative process speeds and message delay is sufficient to implement ♦P, and thus to implement ♦S. Subsequently, it is shown in [4] that ♦S can be implemented in a system model where all processes execute in lock-step synchrony and there exists some correct process whose links are eventually timely; that is, eventually there is an upper bound on the message delay on these links. In later work, focus shifts to the weakest system model to implement ♦S and an equivalent failure detector Ω [11] (which, eventually and permanently, outputs the id of the same correct process at all correct processes) in environments where up to f processes may crash.…”
Section: Related Workmentioning
confidence: 99%
“…However, we show that a non-communication-efficient PSSCF solution can be implemented in such systems. Finally, we conclude with the basic system where all links can be asynchronous and lossy (S 0 ): the leader election has neither SSSCF nor PSSCF solution in S 0 [14,9].…”
Section: Introductionmentioning
confidence: 97%
“…We show that a CE-PSSCF leader election can be done in some weak systems where the SSSCF leader election cannot be done: any system having a timely source that is a process whose all ouput links are timely [14](S 2 ). Using a previous result of Aguilera et al [14], we then recall that communication-efficiency cannot be done if we consider systems having at least one timely source but no fair hub (a hub is a process whose all links are fair lossy) (S 1 ). However, we show that a non-communication-efficient PSSCF solution can be implemented in such systems.…”
Section: Introductionmentioning
confidence: 99%