2020
DOI: 10.48550/arxiv.2008.08972
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Online inverse reinforcement learning with limited data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
8
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(8 citation statements)
references
References 0 publications
0
8
0
Order By: Relevance
“…Inspired by recent results in online Reinforcement Learning methods [16]- [18], IRL has been extended to online implementations where the objective is to learn from a single demonstration or trajectory [19]- [22]. In [20], [21], batch IRL techniques are developed to estimate reward functions in the presence of unmeasureable system states and/or uncertain dynamics for both linear and nonlinear systems.…”
Section: Introductionmentioning
confidence: 99%
See 4 more Smart Citations
“…Inspired by recent results in online Reinforcement Learning methods [16]- [18], IRL has been extended to online implementations where the objective is to learn from a single demonstration or trajectory [19]- [22]. In [20], [21], batch IRL techniques are developed to estimate reward functions in the presence of unmeasureable system states and/or uncertain dynamics for both linear and nonlinear systems.…”
Section: Introductionmentioning
confidence: 99%
“…In [20], [21], batch IRL techniques are developed to estimate reward functions in the presence of unmeasureable system states and/or uncertain dynamics for both linear and nonlinear systems. The case where the trajectories being monitored are suboptimal due to an external disturbance is addressed in [23], and [22] estimates a feedback policy and generates artificial data using the estimated policy to compensate for the sparsity of data in online implementations. However, results such as [19]- [23], either require full state feedback, or rely on state estimators that require dynamical systems in Brunovsky Canonical form.…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations