2021 American Control Conference (ACC) 2021
DOI: 10.23919/acc50511.2021.9482725
|View full text |Cite
|
Sign up to set email alerts
|

Recurrent Neural Network Controllers for Signal Temporal Logic Specifications Subject to Safety Constraints

Abstract: In this paper, we consider the problem of learning a neural network controller for a system required to satisfy a Signal Temporal Logic (STL) specification. We exploit STL quantitative semantics to define a notion of robust satisfaction. Guaranteeing the correctness of a neural network controller, i.e., ensuring the satisfaction of the specification by the controlled system, is a difficult problem that received a lot of attention recently. We provide a general procedure to construct a set of trainable High Ord… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
25
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 8 publications
(25 citation statements)
references
References 38 publications
(84 reference statements)
0
25
0
Order By: Relevance
“…Similarly to generating the initial dataset, each time after the control policy is improved, we sample N initial states, starting from which the safe control inputs in (12) are applied until arriving at the time horizon T . We add all the system transition data (totally N T data pairs) to the dataset D. Then the FNN is retrained on the new dataset D to minimize the loss function C in (7) (see Alg.…”
Section: A System Model Learningmentioning
confidence: 99%
See 3 more Smart Citations
“…Similarly to generating the initial dataset, each time after the control policy is improved, we sample N initial states, starting from which the safe control inputs in (12) are applied until arriving at the time horizon T . We add all the system transition data (totally N T data pairs) to the dataset D. Then the FNN is retrained on the new dataset D to minimize the loss function C in (7) (see Alg.…”
Section: A System Model Learningmentioning
confidence: 99%
“…In this section, we evaluate our approach on two case studies. We compare the results with the approach in [12], where an RNN controller is trained via imitation learning, i.e., a dataset containing success trajectories is first generated and the controller is trained on that dataset. The system dynamics in [12] are assumed to be known.…”
Section: Case Studiesmentioning
confidence: 99%
See 2 more Smart Citations
“…Other learning-based related works include modeling with Gaussian processes [6,23,24,25], or use neural networks [26,27,28,29,30,31,32,33,30,34,35,36] to accomplish reachability, verification or temporal logic specifications. Nevertheless, the aforementioned works either use partial information on the underlying robot dynamics, or do not consider them at all.…”
Section: Related Workmentioning
confidence: 99%