Markov Decision Processes with Multiple Long-run Average Objectives

Brázdil, Tomǎš; Brožek, Václav; Chatterjee, Krishnendu; Forejt, Vojtěch; Kučera, Antonín

doi:10.2168/lmcs-10(1:13)2014

Cited by 33 publications

(54 citation statements)

References 27 publications

(76 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The need for infinite memory was proved in [6,Section 5] for the problem of ensuring thresholds Fig. 4, v 1 = v 2 = 0.5 and α = 1 can be ensured by an infinite-memory strategy and that finite-memory strategies can only achieve these thresholds with probability 0.…”

Section: Percentiles On Multi-dimensional Mpmentioning

confidence: 99%

“…The linear program follows the ideas of [18,6]. Note that the first two lines of (L) corresponds to the multiple reachability LP of [18] for absorbing target states.…”

Section: Percentiles On Multi-dimensional Mpmentioning

confidence: 99%

“…First, there is a series of works that investigate MDPs with multi-dimensional weights [14,6] rather than single-dimensional as it is traditionally the case. Multi-dimensional MDPs are useful to analyze systems with multiple objectives that are potentially conflicting and make necessary the analysis of trade-offs.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Percentile Queries in Multi-dimensional Markov Decision Processes

Randour

Raskin

Sankur

2015

Computer Aided Verification

View full text Add to dashboard Cite

Abstract. Markov decision processes (MDPs) with multi-dimensional weights are useful to analyze systems with multiple objectives that may be conflicting and require the analysis of trade-offs. We study the complexity of percentile queries in such MDPs and give algorithms to synthesize strategies that enforce such constraints. Given a multi-dimensional weighted MDP and a quantitative payoff function f , thresholds vi (one per dimension), and probability thresholds αi, we show how to compute a single strategy to enforce that for all dimensions i, the probability of outcomes ρ satisfying fi(ρ) ≥ vi is at least αi. We consider classical quantitative payoffs from the literature (sup, inf, lim sup, lim inf, mean-payoff, truncated sum, discounted sum). Our work extends to the quantitative case the multiobjective model checking problem studied by Etessami et al. [18] in unweighted MDPs.

show abstract

Section: Percentiles On Multi-dimensional Mpmentioning

confidence: 99%

“…The linear program follows the ideas of [18,6]. Note that the first two lines of (L) corresponds to the multiple reachability LP of [18] for absorbing target states.…”

Section: Percentiles On Multi-dimensional Mpmentioning

confidence: 99%

See 1 more Smart Citation

Percentile Queries in Multi-dimensional Markov Decision Processes

Randour

Raskin

Sankur

2015

Computer Aided Verification

View full text Add to dashboard Cite

show abstract

“…For example, although at multiple places we build on the techniques of [13] and [2] which allow us to deal with maximal end components (sometimes called strongly communicating sets) of an MDP separately, we often need to extend these techniques. Unlike the works [13] and [2] which study multiple "independent" objectives, in the case of the global variance any change of value in the expected mean payoff implies a change of value of the variance.…”

Section: (Zero Variance)mentioning

confidence: 99%

“…Then the user gets 2 Mbits/sec connection almost surely, but since the individual runs are apparently "unstable", he may still see a lot of stuttering in the video stream. As an appropriate measure for the stability of individual runs, we propose local variance, which is defined as the long-run average of (r i (ω) − mp(ω)) 2 , where r i (ω) is the reward of the i-th action executed in a run ω and mp(ω) is the mean payoff of ω. Hence, local variance says how much the rewards of the actions executed along a given run deviate from the mean payoff of the run on average.…”

Section: Introductionmentioning

confidence: 99%

Trading Performance for Stability in Markov Decision Processes

Brázdil

Chatterjee

Forejt³

et al. 2013

2013 28th Annual ACM/IEEE Symposium on Logic in Computer Science

Self Cite

View full text Add to dashboard Cite

We study controller synthesis problems for finite-state Markov decision processes, where the objective is to optimize the expected mean-payoff performance and stability (also known as variability in the literature). We argue that the basic notion of expressing the stability using the statistical variance of the mean payoff is sometimes insufficient, and propose an alternative definition. We show that a strategy ensuring both the expected mean payoff and the variance below given bounds requires randomization and memory, under both the above definitions. We then show that the problem of finding such a strategy can be expressed as a set of constraints.

show abstract

Deciding Fast Termination for Probabilistic VASS with Nondeterminism

Brázdil

Chatterjee

Kučera

et al. 2019

Automated Technology for Verification and Analysis

Self Cite

View full text Add to dashboard Cite

A probabilistic vector addition system with states (pVASS) is a finite state Markov process augmented with non-negative integer counters that can be incremented or decremented during each state transition, blocking any behaviour that would cause a counter to decrease below zero. The pVASS can be used as abstractions of probabilistic programs with many decidable properties. The use of pVASS as abstractions requires the presence of nondeterminism in the model. In this paper, we develop techniques for checking fast termination of pVASS with nondeterminism. That is, for every initial configuration of size n, we consider the worst expected number of transitions needed to reach a configuration with some counter negative (the expected termination time). We show that the problem whether the asymptotic expected termination time is linear is decidable in polynomial time for a certain natural class of pVASS with nondeterminism. Furthermore, we show the following dichotomy: if the asymptotic expected termination time is not linear, then it is at least quadratic, i.e., in Ω(n 2 ).Keywords: angelic and demonic nondeterminism · termination time · probabilistic VASS IntroductionProbabilistic Programs & VASS Probabilistic systems play an important role in various areas of computing such as machine learning [26], network protocol design [25], robotics [45], privacy and security [5], and many others. For this reason, verification of probabilistic systems receives a considerable attention of the verification community. As in the classical (non-probabilistic) setting, in probabilistic verification one typically constructs a suitable abstract model overapproximating the real behaviour of the system. In the past, the verification research was focused mostly on finite-state probabilistic models [4] as well as some ⋆ Tomáš Brázdil

show abstract

Markov Decision Processes with Multiple Long-run Average Objectives

Cited by 33 publications

References 27 publications

Percentile Queries in Multi-dimensional Markov Decision Processes

Percentile Queries in Multi-dimensional Markov Decision Processes

Trading Performance for Stability in Markov Decision Processes

Deciding Fast Termination for Probabilistic VASS with Nondeterminism

Contact Info

Product

Resources

About