1980 19th IEEE Conference on Decision and Control Including the Symposium on Adaptive Processes 1980
DOI: 10.1109/cdc.1980.272034
|View full text |Cite
|
Sign up to set email alerts
|

Decentralized control of finite state Markov processes

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
9
0

Year Published

1998
1998
2016
2016

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 9 publications
(9 citation statements)
references
References 0 publications
0
9
0
Order By: Relevance
“…We start investigating communication that arrives with a one-step delay (1-SD), which is referred to as the 'one-step delay observation sharing pattern' in the control literature [Witsenhausen, 1971, Varaiya and Walrand, 1978, Hsu and Marcus, 1982. The consequence is that during execution at stage t the agents knowθ t−1 , the joint action-observation history up to time step t − 1, and the joint action a t−1 that was taken at the previous time step.…”
Section: One-step Delayed Communicationmentioning
confidence: 99%
See 1 more Smart Citation
“…We start investigating communication that arrives with a one-step delay (1-SD), which is referred to as the 'one-step delay observation sharing pattern' in the control literature [Witsenhausen, 1971, Varaiya and Walrand, 1978, Hsu and Marcus, 1982. The consequence is that during execution at stage t the agents knowθ t−1 , the joint action-observation history up to time step t − 1, and the joint action a t−1 that was taken at the previous time step.…”
Section: One-step Delayed Communicationmentioning
confidence: 99%
“…For the motivation to indicate this value using the letter 'V' and for the relation to other value functions, please see the discussion by Oliehoek [2010, chap. 3] Hsu andMarcus [1982] already showed that the value function for 1-step delayed communication is piecewise-linear and convex (PWLC); i.e., representable using sets of vectors. Not surprisingly, more recent approximation methods for POMDPs, e.g., Perseus [Spaan and Vlassis, 2005], can be transferred to its computation [Oliehoek et al, 2007].…”
Section: One-step Delayed Communicationmentioning
confidence: 99%
“…This situation arises in team problems with special information structure (see [16,22]), and in control problems with a random environment [4, 14, 151. (iii) Coupling of the dp equation with other linear equations. This occurred in equations describing the time evolution of free choice Petri-nets (which are useful for the modeling of parallel computing), see [7].…”
Section: Altman and Koolementioning
confidence: 99%
“…The team problem with decentralized information can be transformed into an equivalent Partially Observable Markov Decision Process (PO-MOP), that can be solved using dynamic programming once we transform it to an equivalent Completely Observable Markov Decision Process (CO-MOP) see [5], [2], [3], [4]. The problem is that this transformation comes at a cost of enlarging the state space.…”
Section: Introductionmentioning
confidence: 99%