2021
DOI: 10.48550/arxiv.2105.02510
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Towards Inference Delivery Networks: Distributing Machine Learning with Optimality Guarantees

Abstract: We present the novel idea of inference delivery networks (IDN), networks of computing nodes that coordinate to satisfy inference requests achieving the best trade-off between latency and accuracy. IDNs bridge the dichotomy between device and cloud execution by integrating inference delivery at the various tiers of the infrastructure continuum (access, edge, regional data center, cloud). We propose a distributed dynamic policy for ML model allocation in an IDN by which each node periodically updates its local s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 55 publications
(46 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?