An Improved Q-learning Approach with Kalman Filter for Self-balancing Robot Using OpenAI

Srichandan, Aditya; Dhingra, Jiten; Hota, Malaya Kumar

doi:10.1007/s40313-021-00786-x

Cited by 5 publications

(7 citation statements)

References 8 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Recently, the authors in Reference 18 published research involving the use of KF to improve reinforcement learning applied to the control of a two‐wheeled robot, similar to the inverted pendulum problem. Different from our proposal, which uses EKF as a predictor of future robot states, the authors apply KF to filter out noisy information in sensory data before being used as a state in Q‐Learning.…”

Section: Related Workmentioning

confidence: 99%

“…This technique has been shown to improve the accuracy of states measured by a sensor, in this case the IMU (inertial measurement unit), thus it increases the stabilization of the obtained rewards and the transient response of the control system. Therefore, from the results presented in Reference 18, we are led to investigate if in addition to providing greater stabilization, KF can accelerate a newer or current reinforcement learning technique such as DQN.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Deep‐Q‐Network hybridization with extended Kalman filter for accelerate learning in autonomous navigation with auxiliary security module

Bezerra,

Vieira,

Soares

2024

Trans Emerging Tel Tech

View full text Add to dashboard Cite

This article proposes an algorithm for autonomous navigation of mobile robots that mixes reinforcement learning with extended Kalman filter (EKF) as a localization technique, namely EKF‐DQN, aiming to accelerate the maximization of the learning curve and improve the reward values obtained in the learning process. More specifically, Deep‐Q‐Networks (DQN) are used to control the trajectory of an autonomous robot in an environment with many obstacles. To improve navigation capability in this environment, we also propose a fusion of visual and nonvisual sensors. Due to the ability of EKF to predict states, this algorithm is used as a learning accelerator for the DQN network, predicting future states and inserting this information into the memory replay. Aiming to increase the safety of the navigation process, a visual safety system is also proposed to avoid collisions between the mobile robot and people circulating in the environment. The efficiency of the proposed control system is verified through computational simulations using the CoppeliaSIM simulator with code insertion in Python. The simulation results show that the EKF‐DQN algorithm accelerates the maximization of rewards obtained and provides a higher success rate in fulfilling the mission assigned to the robot when compared to other value‐based and policy‐based algorithms. A demo video of the navigation system can be seen at: https://bit.ly/3reEZrU.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Deep‐Q‐Network hybridization with extended Kalman filter for accelerate learning in autonomous navigation with auxiliary security module

Bezerra,

Vieira,

Soares

2024

Trans Emerging Tel Tech

View full text Add to dashboard Cite

show abstract

“…The true values of the weight parameters of the deep Q-network may have some deviation or fluctuation from the values we train, which may come from factors such as noise in the training data, randomness of the optimization algorithm, and complexity of the network structure. The uncertainty of network parameters can affect the performance and stability of the network, so it is necessary to analyze and filter out the uncertainty of network parameters [16].…”

Section: Dqn-ekf Algorithmmentioning

confidence: 99%

Research on Deep Q-Network Hybridization with Extended Kalman Filter in Maneuvering Decision of Unmanned Combat Aerial Vehicles

Ruan,

Qin,

Wang

et al. 2024

Mathematics

View full text Add to dashboard Cite

To adapt to the development trend of intelligent air combat, it is necessary to research the autonomous generation of maneuvering decisions for unmanned combat aerial vehicles (UCAV). This paper presents a maneuver decision-making method for UCAV based on a hybridization of deep Q-network (DQN) and extended Kalman filtering (EKF). Firstly, a three-dimensional air combat simulation environment is constructed, and a flight motion model of UCAV is designed to meet the requirements of the simulation environment. Secondly, we evaluate the current situation of UCAV based on their state variables in air combat, for further network learning and training to obtain the optimal maneuver strategy. Finally, based on the DQN, the system state equation is constructed using the uncertain parameter values of the current network, and the observation equation of the system is constructed using the parameters of the target network. The optimal parameter estimation value of the DQN is obtained by iteratively updating the solution through EKF. Simulation experiments have shown that this autonomous maneuver decision-making method hybridizing DQN with EKF is effective and reliable, as it can eliminate the opponent and preserve its side.

show abstract

“…Recently, the authors in 16 published research involving the use of KF to improve reinforcement learning applied to the control of a two-wheeled robot, similar to the inverted pendulum problem. Different from our proposal, which uses EKF as a predictor of future robot states, the authors apply KF to filter out noisy information in sensory data before being used as a state in Q-Learning.…”

Section: Related Workmentioning

confidence: 99%

“…This technique has been shown to improve the accuracy of states measured by a sensor, in this case the IMU (Inertial Measurement Unit), thus it increases the stabilization of the obtained rewards and the transient response of the control system. Therefore, from the results presented in 16 , we are led to investigate if in addition to providing greater stabilization, KF can accelerate a newer or current reinforcement learning technique such as DQN.…”

Section: Related Workmentioning

confidence: 99%

Deep-Q-Network Hybridization with Extended Kalman Filter for Accelerate Learning in Autonomous Navigation with the Auxiliary Security Module

Bezerra

Vieira

Soares

2022

Preprint

View full text Add to dashboard Cite

This article proposes an algorithm for autonomous navigation of mobile robots that merges Reinforcement Learning with Extended Kalman Filter (EKF) as a localization technique, namely, EKF-DQN, aiming to accelerate learning and improve the reward values obtained in the process of apprenticeship. More specifically, Deep Neural Networks (DQN - Deep-Q-Networks) are used to control the trajectory of an autonomous vehicle in an indoor environment. Due to the ability of EKF to predict states, this algorithm is proposed to be used as a learning accelerator of the DQN network, predicting states ahead and inserting this information in the memory replay. Aiming at the safety of the navigation process, it is also proposed a visual safety system that avoids collisions of the mobile vehicle with people moving in the environment. The efficiency of the proposed algorithm is verified through computer simulations using the CoppeliaSIM simulator with code insertion in Python. The simulation results show that the EKF-DQN algorithm accelerates the maximization of rewards obtained and provides a higher success rate in fulfilling the proposed mobile robot mission compared to the DQN and Q-Learning algorithms.

show abstract

An Improved Q-learning Approach with Kalman Filter for Self-balancing Robot Using OpenAI

Cited by 5 publications

References 8 publications

Deep‐Q‐Network hybridization with extended Kalman filter for accelerate learning in autonomous navigation with auxiliary security module

Deep‐Q‐Network hybridization with extended Kalman filter for accelerate learning in autonomous navigation with auxiliary security module

Research on Deep Q-Network Hybridization with Extended Kalman Filter in Maneuvering Decision of Unmanned Combat Aerial Vehicles

Deep-Q-Network Hybridization with Extended Kalman Filter for Accelerate Learning in Autonomous Navigation with the Auxiliary Security Module

Contact Info

Product

Resources

About