Dynamic fictitious play, dynamic gradient play, and distributed convergence to Nash equilibria

Shamma, Jeff S.; Arslan, Gürdal

doi:10.1109/tac.2005.843878

Cited by 335 publications

(305 citation statements)

References 35 publications

Supporting

Mentioning

294

Contrasting

Order By: Relevance

“…Learning-based equilibriumseeking algorithms in both discrete [97] and continuous [122] settings provide a possible approach for tightening the error bounds and modeling the interaction over an extended time period. Investigating whether the supermodularity of the objective function for the system can be exploited to improve the bounds on these algorithms is an open problem.…”

Section: Future Workmentioning

confidence: 99%

Submodularity in Dynamics and Control of Networked Systems

Clark

Alomair

Bushnell

et al. 2016

Communications and Control Engineering

View full text Add to dashboard Cite

Submodularity in Dynamics and Control of Networked Systems Andrew ClarkCo-Chairs of the Supervisory Committee:Radha Poovendran Electrical Engineering Linda Bushnell Electrical EngineeringControlling a networked dynamical system to reach a desired state is a fundamental challenge in applications including transportation, energy, social, and biological systems. One scalable approach to controlling such systems is to directly control the states of a subset of leader nodes, while relying on local interactions to steer the remaining nodes towards their desired states. The choice of leader nodes is known to affect system metrics including robustness to noise, rate of convergence to a desired state, and controllability of the system.Selecting an optimal subset of leader nodes, however, is inherently a combinatorial problem, making optimal leader node selection intractable in general.This thesis presents a submodular optimization framework to selecting leader nodes for control of networked systems. We investigate the problem of selecting a subset of leader nodes in order to minimize node state errors due to noise in the communication links between nodes. We prove that the error due to link noise is a supermodular function of the set of leader nodes, leading to the first polynomial-time algorithms for minimizing error due to link noise with provable optimality bounds. We develop our approach for networks with static topologies, as well as dynamic topologies due to random link failures, switching between predefined topologies, and arbitrary mobility.We study selecting leader nodes in order to minimize convergence error, defined as the error in the intermediate node states prior to reaching their desired values. We derive upper bounds for a class of convergence error metrics based on the hitting time of random walk on the network, which we prove to be a supermodular function of the set of input nodes.We present polynomial-time algorithms for minimizing convergence error with provable optimality bounds, for static as well as dynamic networks.Efficient algorithms have recently been proposed for selecting leader nodes to satisfy controllability, defined as the ability to drive the non-input nodes from any initial state to any desired state in finite time. These algorithms, however, do not incorporate performance criteria including robustness to noise and convergence rate. We study the problem of leader selection for joint performance and controllability, and prove that controllability can be formulated as a matroid constraint on the set of leader nodes. We propose efficient algorithms with provable optimality gap for selecting leader nodes for joint performance and controllability, and characterize the submodular structure of the largest controllable subgraph of a network.We also investigate selecting input nodes for guaranteeing synchronization in networked systems. Using the widely-studied Kuramoto model of nonlinear phase-coupled oscillators, we develop novel threshold-based conditions for a set of input nodes to ...

show abstract

Section: Future Workmentioning

confidence: 99%

Submodularity in Dynamics and Control of Networked Systems

Clark

Alomair

Bushnell

et al. 2016

Communications and Control Engineering

View full text Add to dashboard Cite

show abstract

“…With a broad set of existing results for learning in potential games (Arslan & Shamma, 2004;Fudenberg & Levine, 1998;Marden, Arslan, & Shamma, 2009bMarden, Young, Arslan, & Shamma, 2009;Shamma & Arslan, 2005;Young, 1998Young, , 2005Young, , 1993, the primary focus of this work is on the development of methodologies for designing the interaction framework as a potential game while meeting constraints and objectives relevant to multiagent systems, e.g., locality of agent objective functions, and efficiency guarantees for resulting equilibria, among many others. Unfortunately, the framework of potential games is not broad enough to meet this diverse set of challenges as several limitations are beginning to emerge.…”

Section: Introductionmentioning

confidence: 99%

“…The framework of state based potential games is rich enough to overcome the aforementioned limitations as highlighted in Section 7. Interestingly, state based potential games can be thought of in a complimentary fashion to recent work in distributed learning algorithms (Pradelski & Young, 2012;Shamma & Arslan, 2005;Young, 2009) where an underlying state space is introduced into the learning environment to help coordinate behavior. For example, in Young (2009) the authors introduce moods for each agent that impacts the agent's behavior.…”

Section: Introductionmentioning

confidence: 99%

State based potential games

2012

View full text Add to dashboard Cite

“…It would be much less reasonable in alternative models in which agents make simple forecasts about the likely directions of change in the performances of their actions before deciding which action to play. The use of such forecasts can generate dynamics with excellent convergence properties-see Shamma and Arslan (2005) and Arslan and Shamma (2006). 3 For background on population games and evolutionary dynamics, see Sandholm (2010).…”

Section: Motivation: Excess Payoff Dynamics In Contractive Gamesmentioning

confidence: 99%

Probabilistic Interpretations of Integrability for Game Dynamics

Sandholm

2013

Dyn Games Appl

View full text Add to dashboard Cite

In models of evolution and learning in games, a variety of proofs of convergence rely on the assumption that the players' choice functions are integrable. This assumption does not have an obvious game-theoretic interpretation. We address this question by introducing probability models defined in terms of piecewise smooth closed curves through R n ; these curves describe cycles in the performances of the available actions. We establish that a choice function is integrable if and only if in the probability model induced by each such curve, the rate at which players switch to a randomly drawn action is uncorrelated with a certain binary signal. The binary signal specifies whether the performance of the randomly drawn action is improving or worsening, and can also be interpreted as a signal about the performances of actions other than the one randomly drawn.

show abstract

Dynamic fictitious play, dynamic gradient play, and distributed convergence to Nash equilibria

Cited by 335 publications

References 35 publications

Submodularity in Dynamics and Control of Networked Systems

Submodularity in Dynamics and Control of Networked Systems

State based potential games

Probabilistic Interpretations of Integrability for Game Dynamics

Contact Info

Product

Resources

About