Bridging the Reality Gap Between Virtual and Physical Environments Through Reinforcement Learning

Ranaweera, Mahesh; Mahmoud, Qusay H.

doi:10.1109/access.2023.3249572

Cited by 3 publications

(11 citation statements)

References 33 publications

(31 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The Godot game engine provides a versatile and accessible platform that can be used to train DRL models. This framework was utilized in a recent study [29] to bridge the reality gap between virtual reality and reality using a 3-DoF Stewart platform. This research demonstrates the versatility and efficacy of using Godot game engines in DRL applications.…”

Section: Discussionmentioning

confidence: 99%

“…In this case, the environment is created in a game engine, which allows the randomization of attributes such as simulated fidelity, physics dynamics, lighting conditions, specular highlights, textures, and object positions and their orientation [22]. These augmentations in the virtual environment create complexity and diversity, allowing the agent to adjust its behavior progressively over time [29].…”

Section: Domain Randomizationmentioning

confidence: 99%

“…The Python framework and Godot game engine were used to perform Sim2Real on a Stewart platform to overcome this gap in reality. In that paper [29], a Godot environment was used to train the DRL algorithm and perform DR to increase the fidelity of the model. The trained DRL model was used to manipulate a real-world 3-DoF Stewart platform.…”

Section: Deep Reinforcement Learning On a 3-dof Stewart Platformmentioning

confidence: 99%

“…The performance results obtained using the framework, demonstrating the application of DRL using Actor-Critic (A) and Q-Learning (B) within different environment conditions. Adapted from[29].…”

mentioning

confidence: 99%

See 3 more Smart Citations

Deep Reinforcement Learning with Godot Game Engine

Ranaweera,

Mahmoud

2024

Electronics

Self Cite

View full text Add to dashboard Cite

This paper introduces a Python framework for developing Deep Reinforcement Learning (DRL) in an open-source Godot game engine to tackle sim-to-real research. A framework was designed to communicate and interface with the Godot game engine to perform the DRL. With the Godot game engine, users will be able to set up their environment while defining the constraints, motion, interactive objects, and actions to be performed. The framework interfaces with the Godot game engine to perform defined actions. It can be further extended to perform domain randomization and enhance overall learning by increasing the complexity of the environment. Unlike other proprietary physics or game engines, Godot provides extensive developmental freedom under an open-source licence. By incorporating Godot’s built-in powerful node-based environment system, flexible user interface, and the proposed Python framework, developers can extend its features to develop deep learning applications. Research performed on Sim2Real using this framework has provided great insight into the factors that affect the gap in reality. It also demonstrated the effectiveness of this framework in Sim2Real applications and research.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Domain Randomizationmentioning

confidence: 99%

Section: Deep Reinforcement Learning On a 3-dof Stewart Platformmentioning

confidence: 99%

mentioning

confidence: 99%

See 2 more Smart Citations

Deep Reinforcement Learning with Godot Game Engine

Ranaweera,

Mahmoud

2024

Electronics

Self Cite

View full text Add to dashboard Cite

show abstract

“…Additionally, Iriondo A et al employed the Twin Delayed Deep Deterministic Policy Gradient (TD3) method [17] to study the operation of picking up objects from a table using a mobile manipulator. Ranaweera M and colleagues enhanced training outcomes through domain randomization and the introduction of noise during the reinforcement learning process [18]. These methods share a core principle of incorporating probabilistic approaches to significantly reduce the impact of ineffective actions.…”

Section: Introductionmentioning

confidence: 99%

Cascaded Fuzzy Reward Mechanisms in Deep Reinforcement Learning for Comprehensive Path Planning in Textile Robotic Systems

Zhao,

Ding,

et al. 2024

Applied Sciences

View full text Add to dashboard Cite

With the rapid advancement of industrial automation and artificial intelligence technologies, particularly in the textile industry, robotic technology is increasingly challenged with intelligent path planning and executing high-precision tasks. This study focuses on the automatic path planning and yarn-spool-assembly tasks of textile robotic arms, proposing an end-to-end planning and control model that integrates deep reinforcement learning. The innovation of this paper lies in the introduction of a cascaded fuzzy reward system, which is integrated into the end-to-end model to enhance learning efficiency and reduce ineffective exploration, thereby accelerating the convergence of the model. A series of experiments conducted in a simulated environment demonstrate the model’s exceptional performance in yarn-spool-assembly tasks. Compared to traditional reinforcement learning methods, our model shows potential advantages in improving task success rates and reducing collision rates. The cascaded fuzzy reward system, a core component of our end-to-end deep reinforcement learning model, offers a novel and more robust solution for the automated path planning of robotic arms. In summary, the method proposed in this study provides a new perspective and potential applications for industrial automation, especially in the operation of robotic arms in complex and uncertain environments.

show abstract

Evaluation of Techniques for Sim2Real Reinforcement Learning

Ranaweera

Mahmoud

2023

FLAIRS

View full text Add to dashboard Cite

Reinforcement learning (RL) has demonstrated promising results in transferring learned policies from simulation to real-world environments. However, inconsistencies and discrepancies between the two environments cause a negative transfer. The phenomenon is commonly known as the “reality gap.” The reality gap prevents learned policies from generalizing to the physical environment. This paper aims to evaluate techniques to improve sim2real learning and bridge the reality gap using RL. For this research, a 3-DOF Stewart Platform was built virtually and physically. The goal of the platform was to guide and balance the marble towards the center of the Stewart platform. Custom API was created to induce noise, manipulate in-game physics, dynamics, and lighting conditions, and perform domain randomization to improve generalization. Two RL algorithms; Q-Learning and Actor-Critic were implemented to train the agent and to evaluate the performance in bridging the reality gap. This paper outlines the techniques utilized to create noise, domain randomization, perform training, results, and observations. Overall, the obtained results show the effectiveness of domain randomization and inducing noise during the agents' learning process. Additionally, the findings provide valuable insights into implementing sim2real RL algorithms to bridge the reality gap.

show abstract

Bridging the Reality Gap Between Virtual and Physical Environments Through Reinforcement Learning

Cited by 3 publications

References 33 publications

Deep Reinforcement Learning with Godot Game Engine

Deep Reinforcement Learning with Godot Game Engine

Cascaded Fuzzy Reward Mechanisms in Deep Reinforcement Learning for Comprehensive Path Planning in Textile Robotic Systems

Evaluation of Techniques for Sim2Real Reinforcement Learning

Contact Info

Product

Resources

About