2015
DOI: 10.1145/2796314.2745873
|View full text |Cite
|
Sign up to set email alerts
|

Reducing Latency via Redundant Requests

Abstract: Recent computer systems research has proposed using redundant requests to reduce latency. The idea is to run a request on multiple servers and wait for the first completion (discarding all remaining copies of the request). However there is no exact analysis of systems with redundancy. This paper presents the first exact analysis of systems with redundancy. We allow for any number of classes of redundant requests, any number of classes of non-redundant requests, any degree of redundancy, and any numbe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
43
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 80 publications
(47 citation statements)
references
References 27 publications
0
43
0
Order By: Relevance
“…In [6], the authors provided delay bounds under the MDS code. Papers (e.g., [4], [7]- [13]) studied the delay performance of redundant requests in various settings. To the best of our knowledge, none of these works are able to characterize or analytically bound the performance improvement under coding compared with replication.…”
Section: A Related Workmentioning
confidence: 99%
“…In [6], the authors provided delay bounds under the MDS code. Papers (e.g., [4], [7]- [13]) studied the delay performance of redundant requests in various settings. To the best of our knowledge, none of these works are able to characterize or analytically bound the performance improvement under coding compared with replication.…”
Section: A Related Workmentioning
confidence: 99%
“…In fact, canceling plays a key role in obtaining the scarce analytical solutions available for request replication [10], [11]. For instance, canceling enables [11] to reduce the problem to a single M/G/1 queue in the early-canceling case or to an approximate M/G/C queue in the late-canceling one.…”
Section: Related Workmentioning
confidence: 99%
“…For instance, canceling enables [11] to reduce the problem to a single M/G/1 queue in the early-canceling case or to an approximate M/G/C queue in the late-canceling one. Similarly, canceling is key in obtaining the solution to the Markov chain model in [10]. The absence of canceling allows servers to evolve asynchronously, requiring a more complex analysis.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…. $15.00 DOI: 10.1145/nnnnnnn.nnnnnnn simultaneously download from di erent replicas, or each user can issue simultaneous requests to multiple replicas and wait for the rst download to nish [6,7,15]. Data access systems arising in erasure coded storage have received a lot of a ention for all-data (see for example [12,14,19,24] and references therein) and some for hot-data download [16,17], and usually involve single or multiple inter-dependent fork-join queueing systems.…”
Section: Introductionmentioning
confidence: 99%