1991
DOI: 10.1109/9.67293
|View full text |Cite
|
Sign up to set email alerts
|

Aggregation of the policy iteration method for nearly completely decomposable Markov chains

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
51
0

Year Published

1994
1994
2019
2019

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 36 publications
(51 citation statements)
references
References 23 publications
0
51
0
Order By: Relevance
“…Since here the AIB and IB methods coincide 4 , this example shows that the relaxation of the optimization problem does not necessarily lead to the optimal partition.…”
Section: Examplementioning
confidence: 87%
See 2 more Smart Citations
“…Since here the AIB and IB methods coincide 4 , this example shows that the relaxation of the optimization problem does not necessarily lead to the optimal partition.…”
Section: Examplementioning
confidence: 87%
“…Given the transition matrix of a Markov chain, they obtained a bipartition of its state space via alternating projection. Aldhaheri and Khalil considered optimal control of nearly completely decomposable Markov chains and adapted Howard's algorithm to work on an aggregated model [4]. The work of Jia considers state aggregation of Markov decision processes optimal w.r.t.…”
Section: A Contributions and Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Figure 1 shows a 150-point sample of the chain when c = 0.1 in (34). Notice the weak interaction between the two groups (1,2) and (3,4] and the strong interaction between the states in each group. The solid line in Figure 3 shows the error probabilities of the state estimates.…”
Section: Simulation Examplesmentioning
confidence: 99%
“…These techniques have since been studied in the context of singularly perturbed systems. 3, 4 The ideas behind Courtois aggregation have been extended, resulting in the technique called stochastic complementation, Stochastic complementation is applicable whether or not the system is NCD. Unlike Courtois aggregation, the exact steady state probability distribution for the full system can be reconstructed.…”
Section: Introductionmentioning
confidence: 99%