2018
DOI: 10.1561/9781680834116
|View full text |Cite
|
Sign up to set email alerts
|

An Algorithmic Perspective on Imitation Learning

Abstract: As robots and other intelligent agents move from simple environments and problems to more complex, unstructured settings, manually programming their behavior has become increasingly challenging and expensive. Often, it is easier for a teacher to demonstrate a desired behavior rather than attempt to manually engineer it. This process of learning from demonstrations, and the study of algorithms to do so, is called imitation learning. This work provides an introduction to imitation learning. It covers the underly… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
47
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 174 publications
(47 citation statements)
references
References 18 publications
0
47
0
Order By: Relevance
“…This baseline directly learns a policy from an initial set of demonstrations using supervised learning. This approach is called behavioral cloning (see the survey of imitation learning given by Osa et al (2018)); in each of our experiments, we describe the policy models used. It is important to note that this approach requires fully observed demonstrations.…”
Section: Experimental Methodologymentioning
confidence: 99%
See 1 more Smart Citation
“…This baseline directly learns a policy from an initial set of demonstrations using supervised learning. This approach is called behavioral cloning (see the survey of imitation learning given by Osa et al (2018)); in each of our experiments, we describe the policy models used. It is important to note that this approach requires fully observed demonstrations.…”
Section: Experimental Methodologymentioning
confidence: 99%
“…This fully observed setting is typical of the literature on imitation learning (Osa et al, 2018), where one uses a function approximation to learn the state to action mapping from these samples. However, one may have a more limited access to a limited supervisor, where only the states are observed.…”
Section: Problem Setupmentioning
confidence: 99%
“…where i is the index of a sequence in a batch, B the batch size, y i t = τ d,i pk,t , τ d,i pa,t the vector of labels, i t the loss of time step t, L the loss of a batch with sequences of length T . The L1 loss is used instead of the L2 loss because of its robustness to outliers [33].…”
Section: Training Proceduresmentioning
confidence: 99%
“…Some papers mention the significance of evaluating predictive uncertainty to ensure the safety of the controller in the context of imitation learning. Previous papers have pointed out that the main problems in imitation learning lies in the inherent ambiguity of demonstrations (Goo and Niekum, 2019 ) or the discrepancy between training and test conditions that can lead robots to perform unexpected actions (Pomerleau, 1989 ; Osa et al, 2018 ). In practice, one possible solution is measuring the predictive uncertainty, and if the robots are uncertain about their prediction, they can stop performing actions and request that experts provide additional demonstrations (Thakur et al, 2019 ).…”
Section: Related Workmentioning
confidence: 99%