2006
DOI: 10.1016/j.jebo.2004.02.007
|View full text |Cite
|
Sign up to set email alerts
|

Adapting behaviors through a learning process

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
17
0

Year Published

2009
2009
2021
2021

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 26 publications
(17 citation statements)
references
References 12 publications
0
17
0
Order By: Relevance
“…We first study the convergence speed of the expected rewards defined in Equation (8). The result is depicted in Fig.…”
Section: Numerical Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…We first study the convergence speed of the expected rewards defined in Equation (8). The result is depicted in Fig.…”
Section: Numerical Resultsmentioning
confidence: 99%
“…where the so-called reference points [8], c i and π i (a i ), are specific conjecture and probability, and ω i is a positive scalar. We propose a simple rule for the SUs to configure their reference points.…”
Section: A Conjecture Based Multi-agent Qq-learning Approachmentioning
confidence: 99%
“…In the cases when both the strategy and the local expected payoff are to be learned, the AC-like, multiple-timescale learning algorithms [73] provide an efficient strategy-learning approach (e.g., stochastic FP [33]) for the agents. Further, when the joint action or the payoff of the adversary agents is not directly observable, conjecture-variation-based learning [74] works as an alternative way of the aforementioned learning algorithms. In the literature, these joint policy-value-iteration mechanisms for games are also known as the COmbined fully DIstributed PAyoff and Strategy-Reinforcement Learning (CODIPAS-RL) mechanisms [37].…”
Section: Multi-agent Strategy Learning In the Context Of Gamesmentioning
confidence: 99%
“…Herein, is the belief factor, and and are called the reference points [10]. The belief functions deployed by the SUs are based on the concept of reciprocity, which refers to the interaction mechanism that if the SUs realize the probabilities of interacting with each other in the future is high, they will consider their in uence on other SUs' strategies.…”
Section: A the Belief Functionmentioning
confidence: 99%