Decentralized control of finite state Markov processes

Hsü,; Marcús,

doi:10.1109/cdc.1980.272034

Cited by 9 publications

(9 citation statements)

References 0 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We start investigating communication that arrives with a one-step delay (1-SD), which is referred to as the 'one-step delay observation sharing pattern' in the control literature [Witsenhausen, 1971, Varaiya and Walrand, 1978, Hsu and Marcus, 1982. The consequence is that during execution at stage t the agents knowθ t−1 , the joint action-observation history up to time step t − 1, and the joint action a t−1 that was taken at the previous time step.…”

Section: One-step Delayed Communicationmentioning

confidence: 99%

“…For the motivation to indicate this value using the letter 'V' and for the relation to other value functions, please see the discussion by Oliehoek [2010, chap. 3] Hsu andMarcus [1982] already showed that the value function for 1-step delayed communication is piecewise-linear and convex (PWLC); i.e., representable using sets of vectors. Not surprisingly, more recent approximation methods for POMDPs, e.g., Perseus [Spaan and Vlassis, 2005], can be transferred to its computation [Oliehoek et al, 2007].…”

Section: One-step Delayed Communicationmentioning

confidence: 99%

See 1 more Smart Citation

A Concise Introduction to Decentralized POMDPs

Oliehoek

Amato

2016

SpringerBriefs in Intelligent Systems

523

233

View full text Add to dashboard Cite

Section: One-step Delayed Communicationmentioning

confidence: 99%

Section: One-step Delayed Communicationmentioning

confidence: 99%

A Concise Introduction to Decentralized POMDPs

Oliehoek

Amato

2016

SpringerBriefs in Intelligent Systems

523

233

View full text Add to dashboard Cite

“…This situation arises in team problems with special information structure (see [16,22]), and in control problems with a random environment [4, 14, 151. (iii) Coupling of the dp equation with other linear equations. This occurred in equations describing the time evolution of free choice Petri-nets (which are useful for the modeling of parallel computing), see [7].…”

Section: Altman and Koolementioning

confidence: 99%

On submodular value functions and complex dynamic programming

Altman

Koole

1998

Communications in Statistics. Stochastic Models

View full text Add to dashboard Cite

We investigate in this paper submodular value functions using complex dynamic programming. In complex dynamic programming (dp) we consider concatenations and linear combinations of standard dp operators, as well as combinations of maximizations and minimizations. These value functions have many applications and interpretations, both in stochastic control (and stochastic zero-sum games) as well as in the analysis of (noncontrolled) discrete-event dynamic systems. The submodularity implies the monotonicity of the selectors appearing in the dp equations, which translates, in the context of stochastic control and stochastic games, to monotone optimal policies. Our work is based on the score-space approach of Glasserman and Yao.

show abstract

“…The team problem with decentralized information can be transformed into an equivalent Partially Observable Markov Decision Process (PO-MOP), that can be solved using dynamic programming once we transform it to an equivalent Completely Observable Markov Decision Process (CO-MOP) see [5], [2], [3], [4]. The problem is that this transformation comes at a cost of enlarging the state space.…”

Section: Introductionmentioning

confidence: 99%

Stochastic games with one step delay sharing information pattern with application to power control

Altman

Kambley

Silva

2009

2009 International Conference on Game Theory for Networks

View full text Add to dashboard Cite

International audienceNon-cooperative game theory has gained much interest as a paradigm for decentralized control in communication networks. It allows to get rid of the need for a centralized controller. Decentralizing the decision making may result in situations where agents (decision makers) do not have the same view of the network: the information available to agents vary from one agent to another. The global view of the network state cannot be available to an agent as fast as the information on its local state. Incorporating into the decentralized control paradigm this information asymmetry renders it applicable to a much wider class of situations. In this paper we model the above information asymmetry using the one-step delay sharing information pattern from team theory and generalize it to the context of non-cooperative games. We study its properties and apply it to distributed power control problem

show abstract

Decentralized control of finite state Markov processes

Cited by 9 publications

References 0 publications

A Concise Introduction to Decentralized POMDPs

A Concise Introduction to Decentralized POMDPs

On submodular value functions and complex dynamic programming

Stochastic games with one step delay sharing information pattern with application to power control

Contact Info

Product

Resources

About