2004
DOI: 10.1007/978-3-540-30115-8_18
|View full text |Cite
|
Sign up to set email alerts
|

Analyzing Multi-agent Reinforcement Learning Using Evolutionary Dynamics

Abstract: Abstract. In this paper, we show how the dynamics of Q-learning can be visualized and analyzed from a perspective of Evolutionary Dynamics (ED). More specifically, we show how ED can be used as a model for Qlearning in stochastic games. Analysis of the evolutionary stable strategies and attractors of the derived ED from the Reinforcement Learning (RL) application then predict the desired parameters for RL in MultiAgent Systems (MASs) to achieve Nash equilibriums with high utility. Secondly, we show how the der… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2005
2005
2013
2013

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 9 publications
(2 citation statements)
references
References 18 publications
0
2
0
Order By: Relevance
“…In this section we will show the dynamics of Cross learning and Q-learning and illustrate them on different examples [40,45,48]. First we discuss the main result of Börgers and Sarin [5], i.e.…”
Section: Relating Evolution and Reinforcement Learningmentioning
confidence: 99%
“…In this section we will show the dynamics of Cross learning and Q-learning and illustrate them on different examples [40,45,48]. First we discuss the main result of Börgers and Sarin [5], i.e.…”
Section: Relating Evolution and Reinforcement Learningmentioning
confidence: 99%
“…Many such tools are particularly suited to only certain learning methods, and only a few offer a common framework for multiple learning techniques. Evolutionary game theory is the main tool in this second category, and it was successfully used to study the properties of cooperative coevolution [78,296], to visualize basins of attraction to Nash equilibria for cooperative coevolution [191], and to study trajectories of concurrent Q-learning processes [271,263]. Another tool for modeling and predicting the dynamics of concurrent multi-agent learners was recently proposed in Vidal and Durfee [276,277].…”
Section: The Dynamics Of Learningmentioning
confidence: 99%