2019 International Conference on Robotics and Automation (ICRA) 2019
DOI: 10.1109/icra.2019.8793828
|View full text |Cite
|
Sign up to set email alerts
|

Formal Policy Learning from Demonstrations for Reachability Properties

Abstract: We consider the problem of learning structured, closed-loop policies (feedback laws) from demonstrations in order to control under-actuated robotic systems, so that formal behavioral specifications such as reaching a target set of states are satisfied. Our approach uses a "counterexample-guided" iterative loop that involves the interaction between a policy learner, a demonstrator and a verifier. The learner is responsible for querying the demonstrator in order to obtain the training data to guide the construct… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 29 publications
0
1
0
Order By: Relevance
“…This work will consider the controller synthesis problem for reachability specification for unknown dynamical systems. In the past few decades, there have been several works in the literature addressing reachability problem (see [10], [11], [12], [13], [14]) for known dynamical systems. To solve this problem, we leverage the funnel-based control approaches [15] that have been extensively used for controlling systems with prescribed performance constraints (see [16] and references therein for examples).…”
Section: Introductionmentioning
confidence: 99%
“…This work will consider the controller synthesis problem for reachability specification for unknown dynamical systems. In the past few decades, there have been several works in the literature addressing reachability problem (see [10], [11], [12], [13], [14]) for known dynamical systems. To solve this problem, we leverage the funnel-based control approaches [15] that have been extensively used for controlling systems with prescribed performance constraints (see [16] and references therein for examples).…”
Section: Introductionmentioning
confidence: 99%