Joshua B. Leners scite author profile

Joshua B. Leners

5Publications

71Citation Statements Received

213Citation Statements Given

How they've been cited

132

How they cite others

223

212

Affiliations

Two Sigma Investments (United States), The University of Texas at Austin

Publications

Order By: Most citations

Detecting failures in distributed systems with the Falcon spy network

Leners

Hung

et al. 2011

View full text Add to dashboard Cite

A common way for a distributed system to tolerate crashes is to explicitly detect them and then recover from them. Interestingly, detection can take much longer than recovery, as a result of many advances in recovery techniques, making failure detection the dominant factor in these systems' unavailability when a crash occurs.This paper presents the design, implementation, and evaluation of Falcon, a failure detector with several features. First, Falcon's common-case detection time is sub-second, which keeps unavailability low. Second, Falcon is reliable: it never reports a process as down when it is actually up. Third, Falcon sometimes kills to achieve reliable detection but aims to kill the smallest needed component. Falcon achieves these features by coordinating a network of spies, each monitoring a layer of the system. Falcon's main cost is a small amount of platform-specific logic. Falcon is thus the first failure detector that is fast, reliable, and viable. As such, it could change the way that a class of distributed systems is built.

show abstract

It’s on Me! The Benefit of Altruism in BAR Environments

Wong

Leners

Alvisi

2010

View full text Add to dashboard Cite

Taming uncertainty in distributed systems with help from the network

Leners

Gupta

Aguilera

et al. 2015

View full text Add to dashboard Cite

Network and process failures cause complexity in distributed applications. When a remote process does not respond, the application cannot tell if the process or network have failed, or if they are just slow. Without this information, applications can lose availability or correctness. To address this problem, we propose Albatross, a service that quickly reports to applications the current status of a remote process-whether it is working and reachable, or not. Albatross is targeted at data centers equipped with software defined networks (SDNs), allowing it to discover and enforce network partitions: Albatross borrows the old observation that it can be better to cause a problem than to live with uncertainty, and applies this idea to networks. When enforcing partitions, Albatross avoids disruption by disconnecting only individual processes (not entire hosts), and by allowing them to reconnect if the application chooses. We show that, under Albatross, distributed applications can bypass the complexity caused by network failures and that they become more available.

show abstract

The Efficient Server Audit Problem, Deduplicated Re-execution, and the Web

Tan

Leners

et al. 2017

View full text Add to dashboard Cite

You put a program on a concurrent server, but you don't trust the server; later, you get a trace of the actual requests that the server received from its clients and the responses that it delivered. You separately get logs from the server; these are untrusted. How can you use the logs to efficiently verify that the responses were derived from running the program on the requests? This is the Efficient Server Audit Problem, which abstracts real-world scenarios, including running a web application on an untrusted provider. We give a solution based on several new techniques, including simultaneous replay and efficient verification of concurrent executions. We implement the solution for PHP web applications. For several applications, our verifier achieves 5.6-10.9× speedup versus simply re-executing, with <10% overhead for the server.

show abstract

Yesquel

Aguilera

Leners

Walfish³

2015

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Joshua B. Leners

Detecting failures in distributed systems with the Falcon spy network

It’s on Me! The Benefit of Altruism in BAR Environments

Taming uncertainty in distributed systems with help from the network

The Efficient Server Audit Problem, Deduplicated Re-execution, and the Web

Yesquel

Contact Info

Product

Resources

About