State-machine replication for planet-scale systems

Enes, Vitor; Baquero, Carlos; Rezende, Tuanir França; Gotsman, Alexey; Perrin, Matthieu; Sutra, Pierre

doi:10.1145/3342195.3387543

Cited by 29 publications

(52 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The protocol saturates at around 4K clients per site, when the outgoing network bandwidth at the leader reaches 95% usage. The fact that the leader can be a bottleneck in leader-based protocol has been reported by several prior works [13,23,24,31,43].…”

Section: Full Replication Deploymentmentioning

confidence: 98%

See 1 more Smart Citation

Efficient replication via timestamp stability

Enes

Baquero

Gotsman

et al. 2021

Proceedings of the Sixteenth European Conference on Computer Systems

Self Cite

View full text Add to dashboard Cite

Modern web applications replicate their data across the globe and require strong consistency guarantees for their most critical data. These guarantees are usually provided via statemachine replication (SMR). Recent advances in SMR have focused on leaderless protocols, which improve the availability and performance of traditional Paxos-based solutions. We propose Tempo -a leaderless SMR protocol that, in comparison to prior solutions, achieves superior throughput and offers predictable performance even in contended workloads. To achieve these benefits, Tempo timestamps each application command and executes it only after the timestamp becomes stable, i.e., all commands with a lower timestamp are known. Both the timestamping and stability detection mechanisms are fully decentralized, thus obviating the need for a leader replica. Our protocol furthermore generalizes to partial replication settings, enabling scalability in highly parallel workloads. We evaluate the protocol in both real and simulated geo-distributed environments and demonstrate that it outperforms state-of-the-art alternatives.CCS Concepts: • Theory of computation → Distributed algorithms.

show abstract

Section: Full Replication Deploymentmentioning

confidence: 98%

“…Unfortunately, all existing leaderless SMR protocols suffer from drawbacks in the way they order commands. Some protocols [1,5,13,31] maintain explicit dependencies between commands: a replica may execute a command only after all its dependencies get executed. These dependencies may form arbitrary long chains.…”

Section: Introductionmentioning

confidence: 99%

Efficient replication via timestamp stability

Enes

Baquero

Gotsman

et al. 2021

Proceedings of the Sixteenth European Conference on Computer Systems

Self Cite

View full text Add to dashboard Cite

show abstract

“…Given the pervasiveness of replicated state machines in distributed systems and applications, improving the replication throughput has been an important problem in the past decade. Some solutions [9,23] call for boosting throughput and reducing latency at the same time, while others [7,35] argue for trading off some latency in exchange for better throughput. Both camps, however, take a similar high-level approach for improving their performance.…”

Section: Current Approaches To Scaling State Machine Replicationmentioning

confidence: 99%

“…Usually, the bottleneck is at the leader, as it is used to communicate with the clients and coordinate the replication [1]. Systems like EPaxos [23,31] and Atlas [9] avoid having a one-node bottleneck by not having a single leader that centers all communication around it. Instead, these systems expand on Fast Paxos [17] ideas and try to use fast quorums to commit/replicate operations in one round-trip network latency from any node in the system.…”

Section: Current Approaches To Scaling State Machine Replicationmentioning

confidence: 99%

“…For example, many traditional protocols [2,17,24,32] rely on a dedicated leader to prescribe the operations and their order to the follower nodes. EPaxos [23] conjectures that a single leader is a bottleneck for both the throughput and latency and removes it by allowing any node to become an opportunistic leader; other recent systems [9,31] further optimize EPaxos. Pig-Paxos [7] moves a significant portion of communication from the leader onto the followers.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Scalable but wasteful

Matte

Charapko

Aghayev

2021

Proceedings of the 13th ACM Workshop on Hot Topics in Storage and File Systems

View full text Add to dashboard Cite

Consensus protocols are at the core of strongly consistent replication deployed in cloud-based storage systems. There have been many proposals to optimize these protocols, most of which work by identifying and shifting load from bottlenecked nodes to underutilized nodes.We show that while these optimizations increase throughput, they sacrifice resource efficiency, which is paramount in a cloud setting. We propose a new metric to measure the efficiency of these protocols and show that using this metric, for example, the optimized EPaxos protocol is less efficient than the unoptimized Multi-Paxos protocol. We then demonstrate that Multi-Paxos can achieve 2× higher throughput than EPaxos in a fixed-budget resource setting that is typical of the cloud. Our work underlines the need for considering resource efficiency when optimizing consensus protocols, given that they are increasingly deployed in the cloud.

show abstract