2020
DOI: 10.48550/arxiv.2011.02057
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Online Observer-Based Inverse Reinforcement Learning

Ryan Self,
Kevin Coleman,
He Bai
et al.

Abstract: In this paper, a novel approach to the outputfeedback inverse reinforcement learning (IRL) problem is developed by casting the IRL problem, for linear systems with quadratic cost functions, as a state estimation problem. Two observer-based techniques for IRL are developed, including a novel observer method that re-uses previous state estimates via history stacks. Theoretical guarantees for convergence and robustness are established under appropriate excitation conditions. Simulations demonstrate the performanc… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(3 citation statements)
references
References 14 publications
(31 reference statements)
0
3
0
Order By: Relevance
“…Proof: Bring such Q * , R, γ into (19a), since Q T , R T , γ T satisfy (6), then one has P * = cP T and K * = K T . Then, using this result in (11) for V (x) * and comparing it with V T (x T ) * in (4) shows that V (x) * = cV T (x T ) * .…”
Section: Non-uniqueness Of Solutionmentioning
confidence: 94%
See 2 more Smart Citations
“…Proof: Bring such Q * , R, γ into (19a), since Q T , R T , γ T satisfy (6), then one has P * = cP T and K * = K T . Then, using this result in (11) for V (x) * and comparing it with V T (x T ) * in (4) shows that V (x) * = cV T (x T ) * .…”
Section: Non-uniqueness Of Solutionmentioning
confidence: 94%
“…Lemma 1. Consider the two-player learner agent (7) with x(t 0 ) = x 0 , t ≥ t 0 and ( 8) and (11). Assume there exists a positive definite symmetric matrix P ∈ R n×n such that…”
Section: B Stability and Optimality Analysismentioning
confidence: 99%
See 1 more Smart Citation