Proceedings of the Twentieth ACM International Symposium on Mobile Ad Hoc Networking and Computing 2019
DOI: 10.1145/3323679.3326528
|View full text |Cite
|
Sign up to set email alerts
|

Timely-Throughput Optimal Coded Computing over Cloud Networks

Abstract: In modern distributed computing systems, unpredictable and unreliable infrastructures result in high variability of computing resources. Meanwhile, there is significantly increasing demand for timely and event-driven services with deadline constraints. Motivated by measurements over Amazon EC2 clusters, we consider a two-state Markov model for variability of computing speed in cloud networks. In this model, each worker can be either in a good state or a bad state in terms of the computation speed, and the tran… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
23
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 41 publications
(23 citation statements)
references
References 34 publications
0
23
0
Order By: Relevance
“…Coded computing is a recent technique that enables optimal tradeoffs between computation load, communication load, and computation latency due to stragglers in distributed computing (see, e.g., [5], [6], [37]- [39]). Therefore, designing joint task scheduling and coded computing in order to leverage tradeoffs between computation, communication, and latency could be an interesting problem (e.g., [40]). µ (k,j) p (k,j) , = r (k,j) − r (k+1,j) (104) = µ (k,j) p (k,j) − µ (k+1,j) p (k+1,j) .…”
Section: Discussionmentioning
confidence: 99%
“…Coded computing is a recent technique that enables optimal tradeoffs between computation load, communication load, and computation latency due to stragglers in distributed computing (see, e.g., [5], [6], [37]- [39]). Therefore, designing joint task scheduling and coded computing in order to leverage tradeoffs between computation, communication, and latency could be an interesting problem (e.g., [40]). µ (k,j) p (k,j) , = r (k,j) − r (k+1,j) (104) = µ (k,j) p (k,j) − µ (k+1,j) p (k+1,j) .…”
Section: Discussionmentioning
confidence: 99%
“…according to Eq. (7). Using proof of contradiction, we can then derive that the infimum and supremum of λ i are attained when p i → ∞ and p i = 1, respectively.…”
Section: Appendix Proof Of Lemmamentioning
confidence: 96%
“…One major challenge is that many computing frameworks are vulnerable to uncertain disturbances, such as node/link failures, communication congestion, and slow-downs [6]. Such disturbances, which can be modeled as stragglers that are slow or even fail in returning results, have been observed in many largescale computing systems such as cloud computing [7], mobile edge computing [8], and fog computing [9].…”
Section: Introductionmentioning
confidence: 99%
“…The first work found the exact expression of the average AoI, while the latter found its entire distribution. Timely applications in the edge/cloud were studied in [9]- [12]. Particularly, in the latter, a feedback loop is inserted as a means of reducing the load at the server side, but only the percentage of aborted tasks is considered as a metric of the advantage of having feedback.…”
Section: Introductionmentioning
confidence: 99%