Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence 2019
DOI: 10.24963/ijcai.2019/316
|View full text |Cite
|
Sign up to set email alerts
|

Hybrid Actor-Critic Reinforcement Learning in Parameterized Action Space

Abstract: In this paper we propose a hybrid architecture of actor-critic algorithms for reinforcement learning in parameterized action space, which consists of multiple parallel sub-actor networks to decompose the structured action space into simpler action spaces along with a critic network to guide the training of all sub-actor networks. While this paper is mainly focused on parameterized action space, the proposed architecture, which we call hybrid actor-critic, can be extended for more general action spaces which ha… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
20
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 58 publications
(20 citation statements)
references
References 2 publications
0
20
0
Order By: Relevance
“…To solve the stochastic optimization problem defined in Section 2.2, we need an algorithm that allows a mixture of continuous (harvesting quantities) and discrete actions (timings of clear-cuts and thinnings). We approach the problem of continuous action and state space using the notion of parameterized action spaces suggested by Fan et al (2019). The idea is to view the overall action as a hierarchical structure instead of a flat set.…”
Section: Rl Algorithm With Parameterized Action Spacesmentioning
confidence: 99%
See 2 more Smart Citations
“…To solve the stochastic optimization problem defined in Section 2.2, we need an algorithm that allows a mixture of continuous (harvesting quantities) and discrete actions (timings of clear-cuts and thinnings). We approach the problem of continuous action and state space using the notion of parameterized action spaces suggested by Fan et al (2019). The idea is to view the overall action as a hierarchical structure instead of a flat set.…”
Section: Rl Algorithm With Parameterized Action Spacesmentioning
confidence: 99%
“…To handle the parameterized action space containing both discrete actions and continuous parameters, Fan et al (2019) have proposed a hybrid proximal policy optimization (H-PPO) algorithm.…”
Section: Rl Algorithm With Parameterized Action Spacesmentioning
confidence: 99%
See 1 more Smart Citation
“…Previous work has touched upon the idea of using trees as a formalization of action spaces with multiple components such as in [16], where the tree structure is referred to as a Hierarchical Action Space. Other works have used action spaces that are similar to those used in this paper as examples of action trees.…”
Section: A Action Treesmentioning
confidence: 99%
“…Unlike [22], we do not transfer any target samples back to the internal model. Similar to the problem settings of [23]- [25], our agent predicts the best parameters for a controller.…”
Section: Related Workmentioning
confidence: 99%