2020
DOI: 10.48550/arxiv.2007.12501
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Secure Control in Partially Observable Environments to Satisfy LTL Specifications

Abstract: This paper studies the synthesis of control policies for an agent that has to satisfy a temporal logic specification in a partially observable environment, in the presence of an adversary. The interaction of the agent (defender) with the adversary is modeled as a partially observable stochastic game. The goal is to generate a defender policy to maximize satisfaction of a given temporal logic specification under any adversary policy. The search for policies is limited to the space of finite state controllers, w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(4 citation statements)
references
References 40 publications
(63 reference statements)
0
4
0
Order By: Relevance
“…Therefore, the problem of maximizing the probability of satisfaction of ϕ under any adversary policy is equivalent to reaching a subset of states under these policies of the product game G ϕ that composes representations of the environment and the LTL objective. Moreover, results in [15]- [18] establish convergence of these policies to an equilibrium of the Stackelberg game between the defender and adversary.…”
Section: A Synthesis Of Policies To Satisfy Ltl Objectivementioning
confidence: 95%
See 3 more Smart Citations
“…Therefore, the problem of maximizing the probability of satisfaction of ϕ under any adversary policy is equivalent to reaching a subset of states under these policies of the product game G ϕ that composes representations of the environment and the LTL objective. Moreover, results in [15]- [18] establish convergence of these policies to an equilibrium of the Stackelberg game between the defender and adversary.…”
Section: A Synthesis Of Policies To Satisfy Ltl Objectivementioning
confidence: 95%
“…Maximizing the probability of satisfaction in this setting corresponds to reaching an equilibrium of a zero-sum Stackelberg game between the defender and adversary. A careful justification of this assertion with detailed proofs for fully and partially observable environments has been studied in our earlier works [15]- [18]. We only state relevant results that will be useful in our goal to further establish differential privacy of trajectories in G ϕ that will satisfy ϕ under the respective agent policies.…”
Section: A Synthesis Of Policies To Satisfy Ltl Objectivementioning
confidence: 99%
See 2 more Smart Citations