2020
DOI: 10.1109/lcsys.2019.2954102
|View full text |Cite
|
Sign up to set email alerts
|

Model-Free Stochastic Reachability Using Kernel Distribution Embeddings

Abstract: We present a data-driven solution to the terminalhitting stochastic reachability problem for a Markov control process. We employ a nonparametric representation of the stochastic kernel as a conditional distribution embedding within a reproducing kernel Hilbert space (RKHS). This representation avoids intractable integrals in the dynamic recursion of the stochastic reachability problem since the expectations can be calculated as an inner product within the RKHS. We demonstrate this approach on a high-dimensiona… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
13
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 12 publications
(15 citation statements)
references
References 38 publications
(92 reference statements)
0
13
0
Order By: Relevance
“…By the reproducing property, for any function ∈ H X , we can compute the expectation of with respect to the distribution (• | , ), where ( , ) ∈ X ×U, as an inner product in Hilbert space with the embedding | , [20]. Thus, we can compute the safety probabilities for the first-hitting time problem and the terminal-hitting time problem by computing the expectations in ( 4) and (2) as Hilbert space inner products.…”
Section: (Rkhs)mentioning
confidence: 99%
See 3 more Smart Citations
“…By the reproducing property, for any function ∈ H X , we can compute the expectation of with respect to the distribution (• | , ), where ( , ) ∈ X ×U, as an inner product in Hilbert space with the embedding | , [20]. Thus, we can compute the safety probabilities for the first-hitting time problem and the terminal-hitting time problem by computing the expectations in ( 4) and (2) as Hilbert space inner products.…”
Section: (Rkhs)mentioning
confidence: 99%
“…Thus, we can approximate the expectation of a function ∈ H X using an inner product ˆ | , , H X . As shown in [20,22], we can substitute the inner product into the backward recursion in ( 4) and (2) to approximate the safety probabilities. We have implemented this in SReachTools as KernelEmbeddings.…”
Section: Kernel Distribution Embeddingsmentioning
confidence: 99%
See 2 more Smart Citations
“…The SReachTools Kernel Module [89] is the most recent addition to the toolbox, which implements a collection of nonparametric data-driven algorithms for stochastic reachability based on kernel methods. These algorithms leverage techniques from functional analysis and statistical learning theory and can handle discrete-time linear and nonlinear dynamical systems with arbitrary stochastic disturbances [86,90].…”
Section: Figaro Workbenchmentioning
confidence: 99%