2011 IEEE 26th Annual Symposium on Logic in Computer Science 2011
DOI: 10.1109/lics.2011.10
|View full text |Cite
|
Sign up to set email alerts
|

Two Views on Multiple Mean-Payoff Objectives in Markov Decision Processes

Abstract: We study Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) functions. We consider two different objectives, namely, expectation and satisfaction objectives. Given an MDP with k k k reward functions, in the expectation objective the goal is to maximize the expected limit-average value, and in the satisfaction objective the goal is to maximize the probability of runs such that the limit-average value stays above a given vector. We show that under the expectation objective, in contrast… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
114
0

Year Published

2012
2012
2022
2022

Publication Types

Select...
7

Relationship

1
6

Authors

Journals

citations
Cited by 43 publications
(115 citation statements)
references
References 16 publications
1
114
0
Order By: Relevance
“…Similar concepts for a strategy σ of the Spoiler are defined analogously. In this paper we use an alternative formulation of strategy [1] that generalises the concept of strategy automata [6].…”
Section: Definition 1 (Stochastic Game Arena)mentioning
confidence: 99%
See 3 more Smart Citations
“…Similar concepts for a strategy σ of the Spoiler are defined analogously. In this paper we use an alternative formulation of strategy [1] that generalises the concept of strategy automata [6].…”
Section: Definition 1 (Stochastic Game Arena)mentioning
confidence: 99%
“…Strategies are expressed as strategy automata [6,1] that consist of-i) a set of memory elements, ii) a memory update function that specifies how memory is updated as the transitions occur in the game arena, and iii) a next move function that specifies a distribution over the successors of game state, depending on the memory element. Memory update functions in strategy automata can be either deterministic or stochastic [1].…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…For non-stochastic games, multi-dimensional objectives have been considered in [6,23]. For MDPs, multiple discounted objectives [5], long-run objectives [2], ω-regular objectives [9] and total rewards [12] have been analysed. The objectives that we study in this paper are a special case of branching time temporal logics for stochastic games [3,1].…”
Section: Introductionmentioning
confidence: 99%