2003
DOI: 10.1007/978-3-540-45173-0_6
|View full text |Cite
|
Sign up to set email alerts
|

Homo Egualis Reinforcement Learning Agents for Load Balancing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2005
2005
2018
2018

Publication Types

Select...
3
1
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(7 citation statements)
references
References 4 publications
0
7
0
Order By: Relevance
“…In a common interest game, ESRL is able to find one of the Pareto optimal solutions of the game. In a conflicting interest game, we show that ESRL agents learn optimal fair, possibly periodical policies [17,26]. Important to know is that ESRL agents are independent in the sense that they only use their own action choices and rewards to base their decisions on, that ESRL agents are flexible in learning different solution concepts and they can handle both stochastic, possible delayed rewards and asynchronous action selection.…”
Section: Introductionmentioning
confidence: 98%
See 1 more Smart Citation
“…In a common interest game, ESRL is able to find one of the Pareto optimal solutions of the game. In a conflicting interest game, we show that ESRL agents learn optimal fair, possibly periodical policies [17,26]. Important to know is that ESRL agents are independent in the sense that they only use their own action choices and rewards to base their decisions on, that ESRL agents are flexible in learning different solution concepts and they can handle both stochastic, possible delayed rewards and asynchronous action selection.…”
Section: Introductionmentioning
confidence: 98%
“…Important to know is that ESRL agents are independent in the sense that they only use their own action choices and rewards to base their decisions on, that ESRL agents are flexible in learning different solution concepts and they can handle both stochastic, possible delayed rewards and asynchronous action selection. In [26] a job scheduling experiment is solved by conflicting interest ESRL agents. In this paper, we describe the problem of adaptive load-balancing parallel applications, handled by ESRL agents as a common interest game.…”
Section: Introductionmentioning
confidence: 99%
“…In previous literature [17,18], distributed algorithms for discovering such sequences found suboptimal ones. We will show optimal solutions, albeit with non-distributed algorithms.…”
Section: Infinite-length Gamesmentioning
confidence: 99%
“…The starting point in our research on long-term fairness was the work in [18] on "periodic policies." Their reward model comes in the form of a normal-form game, but the players are actually cooperative learning agents (rather than self-interested).…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation