2021
DOI: 10.48550/arxiv.2102.02639
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Improving Reinforcement Learning with Human Assistance: An Argument for Human Subject Studies with HIPPO Gym

Abstract: Reinforcement learning (RL) is a popular machine learning paradigm for game playing, robotics control, and other sequential decision tasks. However, RL agents often have long learning times with high data requirements because they begin by acting randomly. In order to better learn in complex tasks, this article argues that an external teacher can often significantly help the RL agent learn.OpenAI Gym is a common framework for RL research, including a large number of standard environments and agents, making RL … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 30 publications
0
1
0
Order By: Relevance
“…This study showed that an information-gain objective decreasing uncertainty was well suited to propose effective questions. There has been a recent effort to provide testing environments for human-subject experimentation in reinforcement learning (Taylor et al, 2021) or active querying (Bıyık et al, 2022b). However, existing work has focused on simple feedback types and user interfaces, and more extensive human studies as possible future research directions have been highlighted.…”
Section: Related Workmentioning
confidence: 99%
“…This study showed that an information-gain objective decreasing uncertainty was well suited to propose effective questions. There has been a recent effort to provide testing environments for human-subject experimentation in reinforcement learning (Taylor et al, 2021) or active querying (Bıyık et al, 2022b). However, existing work has focused on simple feedback types and user interfaces, and more extensive human studies as possible future research directions have been highlighted.…”
Section: Related Workmentioning
confidence: 99%