2017
DOI: 10.1145/3055281
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Redundancy Techniques for Latency Reduction in Cloud Systems

Abstract: In computing-as-a-service frameworks, the computing cost is proportional to money spent on renting machines to run a job on the cloud 2 . PREVIOUS WORK AND MAIN CONTRIBUTIONS Related Previous WorkSystems Work: The use of redundancy to reduce latency is not new. One of the earliest instances is the use of multiple routing paths [Maxemchuk 1975] to send packets in networks; see [Kabatiansky et al. 2005, Chapter 7] for a detailed survey of other related work. A similar idea has been studied [Vulimiri et al. 2013… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
107
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
3
3
3

Relationship

3
6

Authors

Journals

citations
Cited by 122 publications
(110 citation statements)
references
References 36 publications
1
107
0
Order By: Relevance
“…For the MDS and replication strategies, we reduce the queueing system to a fork-join queueing system with redundancy, and then use previous results [32,35] to obtain bounds on the mean response time. The results are presented in Appendix D.…”
Section: Queueing Analysismentioning
confidence: 99%
“…For the MDS and replication strategies, we reduce the queueing system to a fork-join queueing system with redundancy, and then use previous results [32,35] to obtain bounds on the mean response time. The results are presented in Appendix D.…”
Section: Queueing Analysismentioning
confidence: 99%
“…Queuing systems with redundancy, on the other hand, are studied in literature, e.g. [3], [11], [21], [25]. With redundancy, two scenarios for the cancellation of redundant copies of a jobs have been studied; cancellation after the first copy starts service, [3], [11], and cancellation after the first copy finishes service, [21], [25].…”
Section: Problem Statementmentioning
confidence: 99%
“…Two scenarios have been proposed for treating redundancy in distributed systems. In the first scenario, [3] and [11], copies of an arriving job are submitted to multiple servers and the redundant copies get cancelled once the first copy starts service. The first copy of a job starting the service is the one which faces the least-work-left queue among all the copies.…”
Section: Introductionmentioning
confidence: 99%
“…If, at demand λ, there exists a splitting strategy under which no storage system node receives requests at a rate in excess of its service rate, then λ is said to be in the achievable service rate region of the storage system. More formally, the storage system's achievable service rate region S is the set of all λ ∈ R K ≥0 such that there exists a splitting strategy with (4) For any λ = (λ1, . .…”
Section: Preliminariesmentioning
confidence: 99%