2016 5th International Conference on Informatics, Electronics and Vision (ICIEV) 2016
DOI: 10.1109/iciev.2016.7760166
|View full text |Cite
|
Sign up to set email alerts
|

Reinforcement learning based autonomic virtual machine management in clouds

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
2
0
1

Year Published

2016
2016
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 8 publications
(3 citation statements)
references
References 7 publications
(14 reference statements)
0
2
0
1
Order By: Relevance
“…Nonetheless, they may have drawbacks such as high costs and extensive computing power requirements. The Markov decision process, optimized with reinforcement learning (RL), is another technique applied in this context [36]. Additionally, deep learning can be combined with RL to explore data characteristics and learn scheduling strategies, or it can be applied directly to the VM placement problem [37].…”
Section: Selection Of Destination Server For Vm Migrationmentioning
confidence: 99%
“…Nonetheless, they may have drawbacks such as high costs and extensive computing power requirements. The Markov decision process, optimized with reinforcement learning (RL), is another technique applied in this context [36]. Additionally, deep learning can be combined with RL to explore data characteristics and learn scheduling strategies, or it can be applied directly to the VM placement problem [37].…”
Section: Selection Of Destination Server For Vm Migrationmentioning
confidence: 99%
“…RL has been used to solve many routing problems over the past few years. For RL-CH, we have used Q-learning [8,17,18] to accommodate multiple CHs in the model. Q-learning is a model free RL algorithm where an agent interacts with the environment and learns the optimal policy by yielding some scalar rewards.…”
Section: Rl-ch Data Aggregation Mechanismmentioning
confidence: 99%
“…Diferentemente, neste artigo os estados são baseados inteiramente na banda contratada. Habib e Khan adotam uma proposta de ações similar a de Mera-Gomez et al, com foco na alocação de recursos em nuvens, porém comparando com outro algoritmo de aprendizado por reforço, o SARSA (State-Action-Reward-State-Action) [Habib e Khan, 2016]. O trabalho conclui que o algoritmo Q-learning apresenta melhor desempenho em relação ao tempo de convergência.…”
Section: Os Trabalhos Relacionadosunclassified