2003
DOI: 10.1613/jair.1000
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Solution Algorithms for Factored MDPs

Abstract: This paper addresses the problem of planning under uncertainty in large Markov Decision Processes (MDPs). Factored MDPs represent a complex state space using state variables and the transition model using a dynamic Bayesian network. This representation often allows an exponential reduction in the representation size of structured MDPs, but the complexity of exact solution algorithms for such MDPs can grow exponentially in the representation size. In this paper, we present two approximate solution algorithms th… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

2
254
0
11

Year Published

2004
2004
2023
2023

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 256 publications
(267 citation statements)
references
References 33 publications
2
254
0
11
Order By: Relevance
“…Morrison and Kumar (1999) formulate approximate linear programming algorithms for queueing problems with a specific choice of basis functions that renders all but a relatively small number of constraints redundant. Guestrin et al (2003) exploit the structure arising when factored linear architectures are used for approximating the cost-to-go function in factored MDPs. In some special cases, this allows for efficient exact solution of the ALP, and in others, this motivates alternative approximate solution methods.…”
mentioning
confidence: 99%
See 1 more Smart Citation
“…Morrison and Kumar (1999) formulate approximate linear programming algorithms for queueing problems with a specific choice of basis functions that renders all but a relatively small number of constraints redundant. Guestrin et al (2003) exploit the structure arising when factored linear architectures are used for approximating the cost-to-go function in factored MDPs. In some special cases, this allows for efficient exact solution of the ALP, and in others, this motivates alternative approximate solution methods.…”
mentioning
confidence: 99%
“…Schuurmans and Patrascu (2002) devise a constraint generation scheme, also especially designed for factored MDPs with factored linear architectures. The worst-case computation time of this scheme grows exponentially with the number of state variables, even for special cases treated effectively by the methods of Guestrin et al (2003). However, the proposed scheme requires a smaller amount of computation time, on average.…”
mentioning
confidence: 99%
“…Although the generality is attractive, this is precisely what makes general dynamic programs impossible to solve. It is far more effective to use a factored representation (Boutilier et al 1999, Guestrin et al 2003, where we retain the specific structure of the state variable. The term "flat representation" is also used in the AI community to distinguish nonhierarchical from hierarchical representations.…”
Section: Modeling Dynamic Programsmentioning
confidence: 99%
“…De plus, elle n'est pas très utile en pratique puisque la plupart des opérateurs d'approximation et algorithmes d'AS résolvent un problème de minimisation en norme L 1 ou L 2 (bien que certains approximateurs de fonction utilisant la norme L ∞ , tels les averagers, aient été étudiés dans le cadre de la PD (Gordon, 1995, Guestrin et al, 2001). …”
Section: Préliminairesunclassified