Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence 2020
DOI: 10.24963/ijcai.2020/570
|View full text |Cite
|
Sign up to set email alerts
|

Verifiable RNN-Based Policies for POMDPs Under Temporal Logic Constraints

Abstract: Recurrent neural networks (RNNs) have emerged as an effective representation of control policies in sequential decision-making problems. However, a major drawback in the application of RNN-based policies is the difficulty in providing formal guarantees on the satisfaction of behavioral specifications, e.g. safety and/or reachability. By integrating techniques from formal methods and machine learning, we propose an approach to automatically extract a finite-state controller (FSC) from an RNN, which, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
16
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
3
1

Relationship

2
7

Authors

Journals

citations
Cited by 30 publications
(21 citation statements)
references
References 2 publications
0
16
0
Order By: Relevance
“…Thus, finding a suitable under-approximative value function reduces to finding "good" policies for M, e.g. by using randomly guessed fm-policies, machine learning methods [12], or a transformation to a parametric model [27].…”
Section: 1 Belief Cut-offsmentioning
confidence: 99%
See 1 more Smart Citation
“…Thus, finding a suitable under-approximative value function reduces to finding "good" policies for M, e.g. by using randomly guessed fm-policies, machine learning methods [12], or a transformation to a parametric model [27].…”
Section: 1 Belief Cut-offsmentioning
confidence: 99%
“…Previously proposed methods to solve the problem are e.g. to use approximate value iteration [21], optimisation and search techniques [1,11], dynamic programming [6], Monte Carlo simulation [43], game-based abstraction [51], and machine learning [12,13,18]. Other approaches restrict the memory size of the policies [34].…”
Section: Introductionmentioning
confidence: 99%
“…The last result, from [1], builds on a novel connection between training of recurrent neural networks and probabilistic model checking. More specifically, recurrent neural networks have emerged as an effective representation of control policies in sequential decisionmaking problems.…”
Section: Ccs Conceptsmentioning
confidence: 99%
“…Building on preliminary results in Carr et al (2019Carr et al ( , 2020, the current paper makes the following contributions. First, it presents an iterative method that employs state-of-the art tools from machine learning and formal verification to find policies that ensure that an agent in a POMDP satisfies any given linear temporal logic specification.…”
Section: Introductionmentioning
confidence: 99%