2007
DOI: 10.1007/978-1-84628-690-2
|View full text |Cite
|
Sign up to set email alerts
|

Simulation-based Algorithms for Markov Decision Processes

Abstract: British Library Cataloguing in Publication Data Simulation-based algorithms for Markov decision processes. -(Communications and control engineering) 1. Decision making -Mathematical models 2. Markov processes I. Chang, Hyeong Soo engineering -Mathematics 3. Matrices 4. Linear systems 658.4'033 ISBN-13: 9781846286896

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

1
92
0

Year Published

2010
2010
2015
2015

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 168 publications
(93 citation statements)
references
References 0 publications
1
92
0
Order By: Relevance
“…Even if the spirit of improving all policies in a set is similar to parallel rollout in unconstrained MDPs (Chang, Hu, Fu, & Marcus, 2013), the approach here is different due to the constrained setting. The parallel rollout method for a set of policies uses the maximum value function over the value functions of the policies in the set at each possible next state when it obtains an action prescribed by an improving policy at a state.…”
Section: Multi-policy Improvementmentioning
confidence: 98%
“…Even if the spirit of improving all policies in a set is similar to parallel rollout in unconstrained MDPs (Chang, Hu, Fu, & Marcus, 2013), the approach here is different due to the constrained setting. The parallel rollout method for a set of policies uses the maximum value function over the value functions of the policies in the set at each possible next state when it obtains an action prescribed by an improving policy at a state.…”
Section: Multi-policy Improvementmentioning
confidence: 98%
“…Reference [37] gives a survey of EC applied to noisy environments. Recent work by [13] and [12] has produced provably convergent algorithms for solving Markov Decision Processes. Reference [32] extend this work to solving problems with the form of (1).…”
Section: Stochastic Combinatorial Optimization Literaturementioning
confidence: 99%
“…Evolutionary Policy Iteration (EPI) was proposed in [13] and [12]. It was suggested as a method to find optimal policies in Markov decision processes (MDP).…”
Section: Competing Algorithmsmentioning
confidence: 99%
See 2 more Smart Citations