Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence 2019
DOI: 10.24963/ijcai.2019/331
|View full text |Cite
|
Sign up to set email alerts
|

Using Natural Language for Reward Shaping in Reinforcement Learning

Abstract: Recent reinforcement learning (RL) approaches have shown strong performance in complex domains such as Atari games, but are often highly sample inefficient. A common approach to reduce interaction time with the environment is to use reward shaping, which involves carefully designing reward functions that provide the agent intermediate rewards for progress towards the goal. However, designing appropriate shaping rewards is known to be difficult as well as time-consuming. In this work, we address this problem by… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
58
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
1
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 50 publications
(59 citation statements)
references
References 7 publications
0
58
0
Order By: Relevance
“…As illustrated in Figure 1, we separate the literature into language-conditional RL (in which interaction with language is necessitated by the problem formulation itself) and language-assisted RL (in which language is used to facilitate learning). The two categories are not mutually exclusive, in that for some languageconditional RL tasks, NLP methods or additional textual corpora are used to assist learning Goyal et al, 2019].…”
Section: Current Use Of Natural Language In Rlmentioning
confidence: 99%
See 4 more Smart Citations
“…As illustrated in Figure 1, we separate the literature into language-conditional RL (in which interaction with language is necessitated by the problem formulation itself) and language-assisted RL (in which language is used to facilitate learning). The two categories are not mutually exclusive, in that for some languageconditional RL tasks, NLP methods or additional textual corpora are used to assist learning Goyal et al, 2019].…”
Section: Current Use Of Natural Language In Rlmentioning
confidence: 99%
“…Methods developed for languageconditional tasks are relevant for language-assisted RL as they both deal with the problem of grounding natural language sentences in the context of RL. Moreover, in tasks such as following sequences of instructions, the full instructions are often not necessary to solve the underlying RL problem but they assist learning by structuring the policy [Andreas et al, 2017] or by providing auxiliary rewards [Goyal et al, 2019].…”
Section: Language-conditional Rlmentioning
confidence: 99%
See 3 more Smart Citations