2021
DOI: 10.1016/j.knosys.2021.106844
|View full text |Cite
|
Sign up to set email alerts
|

Forgetful experience replay in hierarchical reinforcement learning from expert demonstrations

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
10
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 18 publications
(10 citation statements)
references
References 6 publications
0
10
0
Order By: Relevance
“…Since MineRL was held in 2019, many solutions have been proposed to learn to play in Minecraft. There works can be grouped into two categories: 1) end-to-end learning [Amiranashvili et al, 2020;Kanervisto et al, 2020;Scheller et al, 2020]; 2) HRL with human demonstrations [Skrynnik et al, 2021;Mao et al, 2021]. Our approach belongs to the second category.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Since MineRL was held in 2019, many solutions have been proposed to learn to play in Minecraft. There works can be grouped into two categories: 1) end-to-end learning [Amiranashvili et al, 2020;Kanervisto et al, 2020;Scheller et al, 2020]; 2) HRL with human demonstrations [Skrynnik et al, 2021;Mao et al, 2021]. Our approach belongs to the second category.…”
Section: Related Workmentioning
confidence: 99%
“…Our approach belongs to the second category. In this category, prior works leverage the structure of the tasks and learn a hierarchical agent to play in Minecraft -ForgER [Skrynnik et al, 2021] proposed a hierarchical method with forgetful experience replay to allow the agent to learn from low-quality demonstrations; Mao et al [2021] proposed SEIHAI that fully takes advantage of the human demonstrations and the task structure. Sample-efficient Reinforcement Learning.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…In this paper, we propose to use reinforcement learning [9], [12] to generate the behavior of each autonomous agent in a multi-agent partially-observable environment, which in our case is an abstraction for the Internet of vehicles. Modelfree reinforcement learning methods have shown excellent results in behavior generation tasks for single agents [13]- [15] and cooperative environments [16], [17]. Modern deep reinforcement learning algorithms cope well with complex observation space (visual environments) [18] and stochastic environmental conditions [19].…”
Section: Introductionmentioning
confidence: 99%