52nd IEEE Conference on Decision and Control 2013
DOI: 10.1109/cdc.2013.6760491
|View full text |Cite
|
Sign up to set email alerts
|

Optimal control of MDPs with temporal logic constraints

Abstract: In this paper, we focus on formal synthesis of control policies for finite Markov decision processes with non-negative real-valued costs. We develop an algorithm to automatically generate a policy that guarantees the satisfaction of a correctness specification expressed as a formula of Linear Temporal Logic, while at the same time minimizing the expected average cost between two consecutive satisfactions of a desired property. The existing solutions to this problem are sub-optimal. By leveraging ideas from aut… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
31
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
5
2
1

Relationship

3
5

Authors

Journals

citations
Cited by 38 publications
(31 citation statements)
references
References 19 publications
(28 reference statements)
0
31
0
Order By: Relevance
“…The authors present a solution that is optimal only in special cases. In [34], we prove that our approach results in an optimal solution for any MDP and LTL formula. We also strongly believe that a similar approach can be used for non-deterministic transition systems but one first needs to properly define the optimization objective APPC that accounts for the non-determinism of transitions.…”
Section: Discussionmentioning
confidence: 95%
See 1 more Smart Citation
“…The authors present a solution that is optimal only in special cases. In [34], we prove that our approach results in an optimal solution for any MDP and LTL formula. We also strongly believe that a similar approach can be used for non-deterministic transition systems but one first needs to properly define the optimization objective APPC that accounts for the non-determinism of transitions.…”
Section: Discussionmentioning
confidence: 95%
“…Our preliminary results on this topic can be found in [34], where we consider the analogous problem for MDPs with static costs. This problem was also investigated in [35].…”
Section: Discussionmentioning
confidence: 99%
“…Since the satisfactory runs for a temporal logic are usually of infinite length, it is not trivial to assign costs to a particular run. Accumulative, average, weight average, and expected average costs for transitions or between two consecutive satisfactions are all considered in the literature, see e.g., [71,72,75].…”
Section: Optimalitymentioning
confidence: 99%
“…In [16], the authors synthesized a control policy such that the MDP satisfies the given specification almost surely, if such a policy exists. The control strategies synthesis for MDP so to minimize the expected average cost between two consecutive satisfactions of a desired LTL property was considered in [72] by using results from the game theory [10].…”
Section: Optimalitymentioning
confidence: 99%
“…While the synthesis problem also has a long tradition [9,3,22], it has gained significant attention in formal methods more recently. These techniques are being deployed in control and path planning in particular: model checking techniques can be adapted to synthesize (optimal) controllers for deterministic finite systems [24,14], Büchi and Rabin games can be reformulated as control strategies for nondeterministic systems [29,27], and probabilistic games can be used to control finite probabilistic systems such as Markov decision processes [23,18].…”
Section: Introductionmentioning
confidence: 99%