Scenario-Assisted Deep Reinforcement Learning

Yerushalmi, Raz; Amir, Guy; Elyasaf, Achiya; Harel, David; Katz, Guy; Marron, Assaf

doi:10.48550/arxiv.2202.04337

Cited by 1 publication

(1 citation statement)

References 32 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In 2022, Raz et al 2 developed a Scenario-Assisted Deep Reinforcement Learning technique for enhancing the reinforcement learning training process, which allowed engineers to directly contribute their domain knowledge, making the agent under training more likely to comply with various relevant constraints. The authors modified the reward calculation based on the constraints relevant to internet traffic control domain knowledge.…”

Section: Introductionmentioning

confidence: 99%

A new deep Q-learning method with dynamic epsilon adjustment and path planner assisted techniques for Turtlebot mobile robot

Cheng¹,

Zhang²,

Zhong³

2023

Synthetic Data for Artificial Intelligence and Machine Learning: Tools, Techniques, and Applications

View full text Add to dashboard Cite

Deep Q-learning (DQL) method has been proven a great success in autonomous mobile robots. However, the routine of DQL can often yield improper agent behavior (multiple circling-in-place actions) that comes with long training episodes until convergence. To address such problem, this project develops novel techniques that improve DQL training in both simulations and physical experiments. Specifically, the Dynamic Epsilon Adjustment method is integrated to reduce the frequency of non-ideal agent behaviors and therefore improve the control performance (i.e., goal rate). A Dynamic Window Approach (DWA) global path planner is designed in the physical training process so that the agent can reach more goals with less collision within a fixed amount of episodes. The GMapping Simultaneous Localization and Mapping (SLAM) method is also applied to provide a SLAM map to the path planner. The experiment results demonstrate that our developed approach can significantly improve the training performance in both simulation and physical training environment.

show abstract

Section: Introductionmentioning

confidence: 99%