2020
DOI: 10.1007/978-3-030-57628-8_14
|View full text |Cite
|
Sign up to set email alerts
|

Probabilistic Guarantees for Safe Deep Reinforcement Learning

Abstract: Deep reinforcement learning is an increasingly popular technique for synthesising policies to control an agent's interaction with its environment. There is also growing interest in formally verifying that such policies are correct and execute safely. Progress has been made in this area by building on existing work for verification of deep neural networks and of continuous-state dynamical systems. In this paper, we tackle the problem of verifying probabilistic policies for deep reinforcement learning, which are… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 22 publications
(12 citation statements)
references
References 46 publications
(35 reference statements)
0
12
0
Order By: Relevance
“…-Google Scholar 14 -Engineering Village (including Compendex) 15 -Web of Science 16 -Science Direct 17 -Scopus 18 -ACM Digital Library 19 -IEEE Xplore 20 Certification of machine learning systems is a relatively new topic. To review the state-of-the-art techniques, we limited the publication date from January 2015 to September 2020 (the month when we started this work) in our searches.…”
Section: Paper Searchmentioning
confidence: 99%
See 1 more Smart Citation
“…-Google Scholar 14 -Engineering Village (including Compendex) 15 -Web of Science 16 -Science Direct 17 -Scopus 18 -ACM Digital Library 19 -IEEE Xplore 20 Certification of machine learning systems is a relatively new topic. To review the state-of-the-art techniques, we limited the publication date from January 2015 to September 2020 (the month when we started this work) in our searches.…”
Section: Paper Searchmentioning
confidence: 99%
“…An algorithm for measuring the safety of RL agents has been proposed in [17]. A controller is modelled to characterize the actions taken by the agent and the possibilities that result from taking them.…”
Section: Stability Certification Of Rl-based Controllersmentioning
confidence: 99%
“…Our abstraction mechanism is based on the assumption that a trained controller usually adopts the same action for those concrete states that are adjacent [2]. We consider a concrete state s to be a vector of n (n ≥ 1) real numbers.…”
Section: State Discretization and Abstractionmentioning
confidence: 99%
“…Various recent works consider verification or synthesis of RL schemes against reachability specifications [Sun et al, 2019, Könighofer et al, 2020, Bacci and Parker, 2020. None of these approaches, however, support both continuous state-action spaces and probabilistic models, as in this work.…”
Section: Introductionmentioning
confidence: 98%