Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems Principles 2009
DOI: 10.1145/1629575.1629602
|View full text |Cite
|
Sign up to set email alerts
|

Upright cluster services

Abstract: The UpRight library seeks to make Byzantine fault tolerance (BFT) a simple and viable alternative to crash fault tolerance for a range of cluster services. We demonstrate UpRight by producing BFT versions of the Zookeeper lock service and the Hadoop Distributed File System (HDFS). Our design choices in UpRight favor simplifying adoption by existing applications; performance is a secondary concern. Despite these priorities, our BFT Zookeeper and BFT HDFS implementations have performance comparable with the orig… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
163
0
1

Year Published

2012
2012
2022
2022

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 150 publications
(165 citation statements)
references
References 30 publications
1
163
0
1
Order By: Relevance
“…Byzantine fault tolerance (BFT) (Castro and Liskov, 2002;Clement et al, 2009a) is a promising technology that could help an application achieve high availability and trustworthiness. A Byzantine fault (Lamport et al, 1982) refers to an arbitrary fault, which could be a crash or malicious fault.…”
Section: Request-2mentioning
confidence: 99%
See 1 more Smart Citation
“…Byzantine fault tolerance (BFT) (Castro and Liskov, 2002;Clement et al, 2009a) is a promising technology that could help an application achieve high availability and trustworthiness. A Byzantine fault (Lamport et al, 1982) refers to an arbitrary fault, which could be a crash or malicious fault.…”
Section: Request-2mentioning
confidence: 99%
“…There are a large body of work on modern BFT algorithms, such as (Castro and Liskov, 2002;Clement et al, 2009aClement et al, , 2009bSingh et al, 2009). These algorithms are designed to protect generic stateful servers against Byzantine faults in a client-server environment.…”
Section: Related Workmentioning
confidence: 99%
“…Starting from PBFT [12], various proposals [27,28,14,29,15] aim to reduce latency and increase throughput. Aardvark [30] and Zyzzyvark [31] focus on sustainable performance rather than peak performance. Other proposals focus on reducing the number of full replicas [13,32,33].…”
Section: Comparison With Prior Workmentioning
confidence: 99%
“…An important project related to Hadoop's omission failures is presented in [37]. In this work, authors have tried to build separate fault tolerance thresholds in the UpRight library for omission and commission failures, because omission failures are likely to be more common than commission failures.…”
Section: Resource Aware Speculative Scheduling (Ras)mentioning
confidence: 99%
“…The work discussing the omission failures in [37], is actually a wider review that includes the byzantine failures in general. The main properties upon which the UpRight library is based are:…”
Section: Arbitrary (Byzantine) Failurementioning
confidence: 99%