Recurrent Neural Network Controllers for Signal Temporal Logic Specifications Subject to Safety Constraints

Li, Wenliang; Mehdipour, Noushin; Belta, Călin

doi:10.23919/acc50511.2021.9482725

Cited by 8 publications

(25 citation statements)

References 38 publications

(84 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Similarly to generating the initial dataset, each time after the control policy is improved, we sample N initial states, starting from which the safe control inputs in (12) are applied until arriving at the time horizon T . We add all the system transition data (totally N T data pairs) to the dataset D. Then the FNN is retrained on the new dataset D to minimize the loss function C in (7) (see Alg.…”

Section: A System Model Learningmentioning

confidence: 99%

“…In this section, we evaluate our approach on two case studies. We compare the results with the approach in [12], where an RNN controller is trained via imitation learning, i.e., a dataset containing success trajectories is first generated and the controller is trained on that dataset. The system dynamics in [12] are assumed to be known.…”

Section: Case Studiesmentioning

confidence: 99%

“…The RNN was implemented using Pytorch [32]. We used a Mac with a 2.6GHz Core i7 CPU and 16GB of RAM for both the method developed in this paper and for the one from [12].…”

Section: Case Studiesmentioning

confidence: 99%

“…For instance, if a specification requires an agent to visit region A and then region B, the agent needs to know whether it has already visited region A in order to decide which region it should go at the current time. A Recurrent Neural Network (RNN) controller, which has memory, was proposed in [12] to satisfy STL specifications. The RNN was trained on a dataset containing trajectories generated from optimization methods, which is again computationally expensive.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Safe Model-based Control from Signal Temporal Logic Specifications Using Recurrent Neural Networks

Belta

2021

Preprint

Self Cite

View full text Add to dashboard Cite

We propose a policy search approach to learn controllers from specifications given as Signal Temporal Logic (STL) formulae. The system model is unknown, and it is learned together with the control policy. The model is implemented as a feedforward neural network (FNN). To capture the history dependency of the STL specification, we use a recurrent neural network (RNN) to implement the control policy. In contrast to prevalent model-free methods, the learning approach proposed here takes advantage of the learned model and is more efficient. We use control barrier functions (CBFs) with the learned model to improve the safety of the system. We validate our algorithm via simulations. The results show that our approach can satisfy the given specification within very few system runs, and therefore it has the potential to be used for on-line control.

show abstract

Section: A System Model Learningmentioning

confidence: 99%

Section: Case Studiesmentioning

confidence: 99%

“…The RNN was implemented using Pytorch [32]. We used a Mac with a 2.6GHz Core i7 CPU and 16GB of RAM for both the method developed in this paper and for the one from [12].…”

Section: Case Studiesmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Safe Model-based Control from Signal Temporal Logic Specifications Using Recurrent Neural Networks

Belta

2021

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…Other learning-based related works include modeling with Gaussian processes [6,23,24,25], or use neural networks [26,27,28,29,30,31,32,33,30,34,35,36] to accomplish reachability, verification or temporal logic specifications. Nevertheless, the aforementioned works either use partial information on the underlying robot dynamics, or do not consider them at all.…”

Section: Related Workmentioning

confidence: 99%

Non-Parametric Neuro-Adaptive Control Subject to Task Specifications

Verginis,

Xu,

Topcu

2021

Preprint

View full text Add to dashboard Cite

We develop a learning-based algorithm for the control of robotic systems governed by unknown, nonlinear dynamics to satisfy tasks expressed as signal temporal logic specifications. Most existing algorithms either assume certain parametric forms for the dynamic terms or resort to unnecessarily large control inputs (e.g., using reciprocal functions) in order to provide theoretical guarantees. The proposed algorithm avoids the aforementioned drawbacks by innovatively integrating neural network-based learning with adaptive control. More specifically, the algorithm learns a controller, represented as a neural network, using training data that correspond to a collection of different tasks and robot parameters. It then incorporates this neural network into an online closed-loop adaptive control mechanism in such a way that the resulting behavior satisfies a user-defined task. The proposed algorithm does not use any information on the unknown dynamic terms or any approximation schemes. We provide formal theoretical guarantees on the satisfaction of the task and we demonstrate the effectiveness of the algorithm in a virtual simulator using a 6-DOF robotic manipulator.

show abstract