2017 IEEE International Conference on Autonomic Computing (ICAC) 2017
DOI: 10.1109/icac.2017.17
|View full text |Cite
|
Sign up to set email alerts
|

DIAL: Reducing Tail Latencies for Cloud Applications via Dynamic Interference-aware Load Balancing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 35 publications
(11 citation statements)
references
References 26 publications
0
11
0
Order By: Relevance
“…V. RELATED WORK Building distributed systems that offer guarantees on their timely execution while the system is subject to uncertainty and changes is a challenging task. Bounding latencies is of utmost importance, but this is quite difficult in the presence of changes [5,6,14,24,25,42]. Changes are unpredictable, they can be dramatic, and they can include malfunctioning [23], slow down [12], failures [18], and much more.…”
Section: B Experimental Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…V. RELATED WORK Building distributed systems that offer guarantees on their timely execution while the system is subject to uncertainty and changes is a challenging task. Bounding latencies is of utmost importance, but this is quite difficult in the presence of changes [5,6,14,24,25,42]. Changes are unpredictable, they can be dramatic, and they can include malfunctioning [23], slow down [12], failures [18], and much more.…”
Section: B Experimental Resultsmentioning
confidence: 99%
“…To determine the size of data centers, and properly dimension the resources to be allocated in each geographic location, most data center owners use predictions of the computational needs [29,36]. The computational resource within a data center is then used to serve requests coming from multiple clients, providing the illusion of infinite capacity and, as a result, the possibility of bounding the latency [5,6,14,24,25,42]. To do so the architecture uses multiple instances of the same application, here called replicas, and predictions and estimations of traffic and needed computational capacity.…”
Section: Introductionmentioning
confidence: 99%
“…In DIAL [19], interference detection is accomplished using decision tree-based classifier to find the dominant source of resource contention. To quantify the resource interference impact on a webserver application's tail response, a queuing model is utilized to determine the application's response time under contention.…”
Section: R a F Tmentioning
confidence: 99%
“…The load-balancing approach we suggest is designed specifically for DSAG, but is inspired by the large number of previous works on the topic; see, e.g., [35], [36], [37], [38], and references therein. These suggest approaches to balance either i) the complexity of the subtasks that make up a particular large computation (e.g., [35], [36]), or ii) incoming requests between instances of a distributed application, such as a web server (e.g., [37], [38]).…”
Section: Introductionmentioning
confidence: 99%
“…The load-balancing approach we suggest is designed specifically for DSAG, but is inspired by the large number of previous works on the topic; see, e.g., [35], [36], [37], [38], and references therein. These suggest approaches to balance either i) the complexity of the subtasks that make up a particular large computation (e.g., [35], [36]), or ii) incoming requests between instances of a distributed application, such as a web server (e.g., [37], [38]). The approach we suggest, like those of [37], [38], but unlike [35], [36], accounts for latency differences between servers and over time-as is the case in the cloud-but balances the complexity of subtasks, as in [35], [36].…”
Section: Introductionmentioning
confidence: 99%