2020 IEEE 13th International Conference on Cloud Computing (CLOUD) 2020
DOI: 10.1109/cloud49709.2020.00070
|View full text |Cite
|
Sign up to set email alerts
|

Proactive Container Auto-scaling for Cloud Native Machine Learning Services

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 16 publications
(2 citation statements)
references
References 15 publications
0
2
0
Order By: Relevance
“…Therefore, the proposed model's ability to predict a burst increase in the needed resources remains unknown. Buchaca et al used the AI4DL framework [19], [20] to characterize workload and discover resource consumption phases. First, the existing technology was advanced to an incremental phase discovery method that applies to more general ML workload types for training and inference.…”
Section: B Proactive Scaling Trendsmentioning
confidence: 99%
“…Therefore, the proposed model's ability to predict a burst increase in the needed resources remains unknown. Buchaca et al used the AI4DL framework [19], [20] to characterize workload and discover resource consumption phases. First, the existing technology was advanced to an incremental phase discovery method that applies to more general ML workload types for training and inference.…”
Section: B Proactive Scaling Trendsmentioning
confidence: 99%
“…Although there are approaches which take stream arrival rates into account [15], [16], they often apply model predictive control principles to predict upcoming load intensities and scale the parallelism of applied streaming operators. The prediction models are either trained on historical data from completed containers [17] or by feedback loop based system which rely on re-calibration processes in case of performance degradation [18]. In contrast, we focus on a single and efficient profiling process which enables the adaptive adjustment of resources based on the initially created model.…”
Section: Related Workmentioning
confidence: 99%