Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning

Zhong, Shuncong; Liu, Quan; Fu, Qiming

doi:10.1155/2016/4824072

Cited by 5 publications

(1 citation statement)

References 26 publications

(22 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In this process, VM is an agent and server is the environment, where VM takes an action by interacting with the server at each cycle. MDP can be represented as four-tuple ðS, A, P, RÞ, and these are described as follows [22]: S representing as state space: s t ∊S denotes the state of the server at time period t.…”

Section: Makespan Timementioning

confidence: 99%

Action-Based Load Balancing Technique in Cloud Network Using Actor-Critic-Swarm Optimization

Pradhan

Bisoy

Sain

2022

Wireless Communications and Mobile Computing

View full text Add to dashboard Cite

Increasing scale of task in cloud network leads to problem in load balancing and its improvement in parameters. In this paper, we proposed a hybrid scheduling policy which is hybrid of both Particle Swarm Optimization (PSO) algorithm and actor-critic algorithm named as Hybrid Particle Swarm Optimization Actor Critic (HPSOAC) to solve this issue. This hybrid scheduling policy helps to each agent to improve an individual learning as well as learning through exchanging information among other agents. An experiment is carried out by the help of Python simulator with TensorFlow. Outcome shows that our proposed scheduling policy reduces 5.16% and 10.86% in energy consumption, reduces 7.13% and 10.04% in makespan time, and has marginally better resource utilization over Deep Q-network (DQN) and Q-learning based on Modified Particle Swarm Optimization (QMPSO) algorithm, respectively.

show abstract

Section: Makespan Timementioning

confidence: 99%