2020
DOI: 10.1007/978-3-030-43229-4_40
|View full text |Cite
|
Sign up to set email alerts
|

Evaluating the Use of Policy Gradient Optimization Approach for Automatic Cloud Resource Provisioning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
1

Relationship

2
3

Authors

Journals

citations
Cited by 7 publications
(8 citation statements)
references
References 10 publications
0
8
0
Order By: Relevance
“…Such a model is used to demonstrate the performance and adaptiveness of the discussed approach to the control of dynamic traffic. In [ 10 ] we demonstrated how a similar algorithm, the Proximal Policy Optimization (PPO) [ 20 ], can be used to horizontally scale cloud resources. The implementation has been limited to control resources of a single type.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Such a model is used to demonstrate the performance and adaptiveness of the discussed approach to the control of dynamic traffic. In [ 10 ] we demonstrated how a similar algorithm, the Proximal Policy Optimization (PPO) [ 20 ], can be used to horizontally scale cloud resources. The implementation has been limited to control resources of a single type.…”
Section: Related Workmentioning
confidence: 99%
“…In our previous research [ 10 ] we experimented with a number of policy gradient methods (Vanilla Policy Gradient, Proximal Policy Optimization, Trust-Region Policy Optimization) out of which the PPO rendered the best empirical results in the automated resources management.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Reinforcement Learning [11] is a data-driven approach for adaptively applying optimized control policies based on real-time feedback, which models the stochastic process under the framework of Markov Decision Process (MDP) [21]. Policy gradient [7] is one of the most common types of reinforcement learning algorithms. In the policy gradient approach, the optimal actions with model parameters can be learned directly.…”
Section: Policy Gradient Approachmentioning
confidence: 99%