2021
DOI: 10.1155/2021/9963018
|View full text |Cite
|
Sign up to set email alerts
|

A Novel Behavioral Strategy for RoboCode Platform Based on Deep Q‐Learning

Abstract: This paper addresses a new machine learning-based behavioral strategy using the deep Q-learning algorithm for the RoboCode simulation platform. According to this strategy, a new model is proposed for the RoboCode platform, providing an environment for simulated robots that can be programmed to battle against other robots. Compared to Atari Games, RoboCode has a fairly wide set of actions and situations. Due to the challenges of training a CNN model for such a continuous action space problem, the inputs obtaine… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 24 publications
(32 reference statements)
0
2
0
Order By: Relevance
“…Recently, the use of deep reinforcement learning (DRL) algorithms as the fundamental control [ 26 ], navigation [ 27 ], localization [ 28 ], and planning [ 29 ] systems has gained popularity. The flexibility of artificial neural networks, along with the capacity to simulate, train, and utilize them as end-to-end solutions, has garnered interest from the research community.…”
Section: Related Workmentioning
confidence: 99%
“…Recently, the use of deep reinforcement learning (DRL) algorithms as the fundamental control [ 26 ], navigation [ 27 ], localization [ 28 ], and planning [ 29 ] systems has gained popularity. The flexibility of artificial neural networks, along with the capacity to simulate, train, and utilize them as end-to-end solutions, has garnered interest from the research community.…”
Section: Related Workmentioning
confidence: 99%
“…[32], [36] A more recent decentralized navigation research is done in [37] utilizing Neural networks, however, the proposed method utilizes a centralized unit for local communication and sharing learned features by nearby agents so it is not fully decentralized. The main subject of this research "Q-Learning" is utilized in diverse kinds of problems, such as inventory management [38], behavior design or navigation in robotics [12], [39], [40] and game play [41]- [43], and it is typically utilized in continuous state space problems by using parametric function approximation methods [40], [44]. Authors in [51] address the problem of coordinating multiagents in a decentralized fashion.…”
Section: C) Related Workmentioning
confidence: 99%