Occupancy Reward-Driven Exploration with Deep Reinforcement Learning for Mobile Robot System

Kamalova, Albina; Lee, Suk Gyu; Kwon, Soonhak

doi:10.3390/app12189249

Cited by 4 publications

(3 citation statements)

References 50 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Related studies proposed by different researchers have been based CME [ 14 ], meta-heuristics [ 37 ], and hybrid methods combining meta-heuristic with deterministic algorithms [ 38 , 39 , 40 ]. Some studies focused solely on static sensor coverage faults and robot movements in uncharted surroundings [ 41 , 42 , 43 ] as well as exploration with deep reinforcement learning for mobile robots [ 44 ]; however, they coincide in meaning but the goal of both studies is to create a finite map. The rest of the paper is organized as follows:…”

Section: Related Workmentioning

confidence: 99%

Multi-Robot Exploration of Unknown Space Using Combined Meta-Heuristic Salp Swarm Algorithm and Deterministic Coordinated Multi-Robot Exploration

Romeh

Mirjalili

2023

Sensors

View full text Add to dashboard Cite

Multi-robot exploration means constructing a finite map using a group of robots in an obstacle chaotic space. Uncertainties are reduced by distributing search tasks to robots and computing the best action in real time. Many previous methods are based on deterministic or meta-heuristic algorithms, but limited work has combined both techniques to consolidate both classes’ benefits and alleviate their drawbacks. This paper proposes a new hybrid method based on deterministic coordinated multi-robot exploration (CME) and the meta-heuristic salp swarm algorithm (SSA) to perform the search of a space. The precedence of adjacent cells around a robot is determined by deterministic CME using cost and utility. Then, the optimization process of the search space, improving the overall solution, is achieved utilizing the SSA. Three performance measures are considered to evaluate the performance of the proposed method: run time, percentage of the explored area, and the number of times when a method failed to continue a complete run. Experimental results compared four different methods, CME-GWO, CME-GWOSSA, CME-SCA, and CME, over seven maps with extra complexity varying from simple to complex. The results demonstrate how the proposed CME-SSA can outperform the four other methods. Moreover, the simulation results demonstrate that the proposed CME-SSA effectively distributes the robots over the search space to run successfully and obtain the highest exploration rate in less time.

show abstract

Section: Related Workmentioning

confidence: 99%

Multi-Robot Exploration of Unknown Space Using Combined Meta-Heuristic Salp Swarm Algorithm and Deterministic Coordinated Multi-Robot Exploration

Romeh

Mirjalili

2023

Sensors

View full text Add to dashboard Cite

show abstract

“…Hyperparameter tuning was critical to the success of this method [8]. A different approach, called Occupancy-Reward-Driven Exploration [9], has been applied in robotics to explore uncharted territories within the state space. In this technique, an occupancy map is utilized to acquire information about the environment through sensors such as a laser sensor.…”

Section: Introductionmentioning

confidence: 99%

“…The robot's reward is then determined by the number of new segments discovered within the occupancy map at each time step. This approach can also improve the robot's power efficiency [9].…”

Section: Introductionmentioning

confidence: 99%

Signal Novelty Detection as an Intrinsic Reward for Robotics

Kubovčík

Ľuptáková

Pospı́chal

2023

Sensors

View full text Add to dashboard Cite

In advanced robot control, reinforcement learning is a common technique used to transform sensor data into signals for actuators, based on feedback from the robot’s environment. However, the feedback or reward is typically sparse, as it is provided mainly after the task’s completion or failure, leading to slow convergence. Additional intrinsic rewards based on the state visitation frequency can provide more feedback. In this study, an Autoencoder deep learning neural network was utilized as novelty detection for intrinsic rewards to guide the search process through a state space. The neural network processed signals from various types of sensors simultaneously. It was tested on simulated robotic agents in a benchmark set of classic control OpenAI Gym test environments (including Mountain Car, Acrobot, CartPole, and LunarLander), achieving more efficient and accurate robot control in three of the four tasks (with only slight degradation in the Lunar Lander task) when purely intrinsic rewards were used compared to standard extrinsic rewards. By incorporating autoencoder-based intrinsic rewards, robots could potentially become more dependable in autonomous operations like space or underwater exploration or during natural disaster response. This is because the system could better adapt to changing environments or unexpected situations.

show abstract

Mastering the Cosmos: Leveraging Optimization Methods for Advances in Space Exploration

Gul,

Mir

2023

Handbook of Formal Optimization

View full text Add to dashboard Cite

Occupancy Reward-Driven Exploration with Deep Reinforcement Learning for Mobile Robot System

Cited by 4 publications

References 50 publications

Multi-Robot Exploration of Unknown Space Using Combined Meta-Heuristic Salp Swarm Algorithm and Deterministic Coordinated Multi-Robot Exploration

Multi-Robot Exploration of Unknown Space Using Combined Meta-Heuristic Salp Swarm Algorithm and Deterministic Coordinated Multi-Robot Exploration

Signal Novelty Detection as an Intrinsic Reward for Robotics

Mastering the Cosmos: Leveraging Optimization Methods for Advances in Space Exploration

Contact Info

Product

Resources

About