2010
DOI: 10.1007/s11227-010-0451-x
|View full text |Cite
|
Sign up to set email alerts
|

Reinforcement learning technique using agent state occurrence frequency with analysis of knowledge sharing on the agent’s learning process in multiagent environments

Abstract: Reinforcement learning techniques like the Q-Learning one as well as the Multiple-Lookahead-Levels one that we introduced in our prior work require the agent to complete an initial exploratory path followed by as many hypothetical and physical paths as necessary to find the optimal path to the goal. This paper introduces a reinforcement learning technique that uses a distance measure to the goal as a primary gauge for an autonomous agent's action selection. In this paper, we take advantage of the first random … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2010
2010
2022
2022

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(2 citation statements)
references
References 16 publications
(26 reference statements)
0
2
0
Order By: Relevance
“…This scheme can attempt to determine a policy and learn a maximizing cumulative reward for a faster optimal path [15,16]. RL is typically used in multi-agent-based monitoring systems to solve the problem of learning strategies using an autonomous agent [7, 17,18]. It has emerged as an area of memory capacity and computational power since the start of the use of learning algorithms [19] in multi-agent systems.…”
Section: Introductionmentioning
confidence: 99%
“…This scheme can attempt to determine a policy and learn a maximizing cumulative reward for a faster optimal path [15,16]. RL is typically used in multi-agent-based monitoring systems to solve the problem of learning strategies using an autonomous agent [7, 17,18]. It has emerged as an area of memory capacity and computational power since the start of the use of learning algorithms [19] in multi-agent systems.…”
Section: Introductionmentioning
confidence: 99%
“…In particular, a multi-agent Q-learning algorithm for service composition based on this model was also proposed. Authors in [15] presented a learning automatabased adaptive uniform fractional guard channel algorithm, and authors in [16] concentrated on the reinforcement learning technique application to the agent state occurrence frequency with analysis of knowledge sharing on the agent's learning process in multi-agent environments, and [17] applied reinforcement learning to large state spaces.…”
Section: Introductionmentioning
confidence: 99%