2012
DOI: 10.1007/s11227-011-0738-6
|View full text |Cite
|
Sign up to set email alerts
|

Towards a Multiple-Lookahead-Levels agent reinforcement-learning technique and its implementation in integrated circuits

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 11 publications
0
1
0
Order By: Relevance
“…This scheme can attempt to determine a policy and learn a maximizing cumulative reward for a faster optimal path [15,16]. RL is typically used in multi-agent-based monitoring systems to solve the problem of learning strategies using an autonomous agent [7, 17,18]. It has emerged as an area of memory capacity and computational power since the start of the use of learning algorithms [19] in multi-agent systems.…”
Section: Introductionmentioning
confidence: 99%
“…This scheme can attempt to determine a policy and learn a maximizing cumulative reward for a faster optimal path [15,16]. RL is typically used in multi-agent-based monitoring systems to solve the problem of learning strategies using an autonomous agent [7, 17,18]. It has emerged as an area of memory capacity and computational power since the start of the use of learning algorithms [19] in multi-agent systems.…”
Section: Introductionmentioning
confidence: 99%