2019 IEEE 12th International Conference on Cloud Computing (CLOUD) 2019
DOI: 10.1109/cloud.2019.00067
|View full text |Cite
|
Sign up to set email alerts
|

TrIMS: Transparent and Isolated Model Sharing for Low Latency Deep Learning Inference in Function-as-a-Service

Abstract: Deep neural networks (DNNs) have become core computation components within low latency Function as a Service (FaaS) prediction pipelines: including image recognition, object detection, natural language processing, speech synthesis, and personalized recommendation pipelines. Cloud computing, as the de-facto backbone of modern computing infrastructure for both enterprise and consumer applications, has to be able to handle user-defined pipelines of diverse DNN inference workloads while maintaining isolation and l… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
3

Relationship

1
8

Authors

Journals

citations
Cited by 16 publications
(10 citation statements)
references
References 31 publications
(26 reference statements)
0
10
0
Order By: Relevance
“…Electronic markets, for example, are increasingly augmented with AI-based systems such as customer service chatbots (Adam et al 2020). Likewise, several cloud providers recently began offering 'AI as a Service' (AIaaS), referring to web services for organizations and individuals interested in training, building, and deploying AI-based systems (Dakkak et al 2019;Rai et al 2019). Although cost-and time-saving opportunities have triggered a widespread implementation of AI-based systems and services in electronic markets, trust persists to play a pivotal role in any buyer-seller relationship (Bauer et al 2019;Marella et al 2020).…”
Section: Introductionmentioning
confidence: 99%
“…Electronic markets, for example, are increasingly augmented with AI-based systems such as customer service chatbots (Adam et al 2020). Likewise, several cloud providers recently began offering 'AI as a Service' (AIaaS), referring to web services for organizations and individuals interested in training, building, and deploying AI-based systems (Dakkak et al 2019;Rai et al 2019). Although cost-and time-saving opportunities have triggered a widespread implementation of AI-based systems and services in electronic markets, trust persists to play a pivotal role in any buyer-seller relationship (Bauer et al 2019;Marella et al 2020).…”
Section: Introductionmentioning
confidence: 99%
“…The RL model takes as input the Experiment GPU Scales: the scale of physical testbed. S (0, 30] M (30,60] L (60, 120] XL (120, ∞] -: no evaluation on a physical cluster or not clearly specified. job time information, resource demand, and accuracy requirements.…”
Section: Efficiencymentioning
confidence: 99%
“…In contrast, with cloud services like IaaS and PaaS, the abstraction is done for hardware and operating system, respectively. Whereas in serverless computing, the abstraction is done at language runtime, and everything is provided using functions, so it gives function as a Service (FaaS) [19]. Fig 2 shows the user's control over the cloud services.…”
Section: A Trends Impacting I Andomentioning
confidence: 99%