2023 IEEE 16th International Conference on Cloud Computing (CLOUD) 2023
DOI: 10.1109/cloud60044.2023.00046
|View full text |Cite
|
Sign up to set email alerts
|

Performance Characterization of Multi-Container Deployment Schemes for Online Learning Inference

Peini Liu,
Jordi Guitart,
Amir Taherkordi

Abstract: Online machine learning (ML) inference services provide users with an interactive way to request for predictions in real-time. To meet the notable computational requirements of such services, they are increasingly being deployed in the Cloud. In this context, the efficient provisioning and optimization of ML inference services in the Cloud is critical to achieve the required performance and meet the dynamic queries by end-users. Existing provisioning solutions focus on framework parameter tuning and infrastruc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 19 publications
0
0
0
Order By: Relevance
“…Our contributions to achieve this objective are as follows, and have resulted in publications [99][101] [102]. We consider that most of our conclusions in the objective would also apply to BD workloads.…”
Section: Objective 2: Understand the Performance Of Hpc Bd And ML App...mentioning
confidence: 99%
“…Our contributions to achieve this objective are as follows, and have resulted in publications [99][101] [102]. We consider that most of our conclusions in the objective would also apply to BD workloads.…”
Section: Objective 2: Understand the Performance Of Hpc Bd And ML App...mentioning
confidence: 99%