Q-Learning-Based Noise Covariance Adaptation in Kalman Filter for MARG Sensors Attitude Estimation

Dai, Xiang; Nateghi, Vahid; Fourati, Hassen; Prieur, Christophe

doi:10.1109/inertial53425.2022.9787752

Cited by 4 publications

(11 citation statements)

References 17 publications

(12 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In this section, the Q-learning basics and their combination with the EKF are introduced, and then our previous work [10] on Q-learning-based adaptation is recalled.…”

Section: B the Traditional Extended Kalman Filtermentioning

confidence: 99%

“…In [10], we have proposed the QLEKF that runs three parallel EKFs at each time step: the traditional EKF, which sets some initial values of (Q, R) for all time steps and serves as the benchmark for Q-learning; the learning EKF, which searches appropriate noise covariance matrices from the grid by the Q-learning algorithm; and the learned EKF, which outputs the result of estimation according to the covariance matrices found by the learning EKF. Please refer to Alg.…”

Section: B Q-learning-based Noise Covariance Adaptation Approachmentioning

confidence: 99%

“…2 in [10] for more details about the QLEKF. As shown in [10], the QLEKF exhibits the benefit of improving the EKF state estimation by searching for more appropriate noise covariance matrices from a predefined set of values. However, QLEKF contains potential insufficiencies:…”

Section: B Q-learning-based Noise Covariance Adaptation Approachmentioning

confidence: 99%

“…Second, it is sufficient for Q-learning to explore and exploit all directions from the center element. Third, its all 9 elements can be visited whit a shorter learning period, compared to larger size grids in [8], [10]. Fourth, by dynamically updating the ratio r, q and the central element (Q c , R c ), it allows Q-learning to search in an arbitrarily large scope (by manipulating q, r, r and q) with an arbitrarily high precision (by setting r and q close to 1).…”

Section: A the Dynamic Grid And Updated ϵ-Greedy Algorithmmentioning

confidence: 99%

“…As one of the most important reinforcement learning methods, Q-learning [6] has drawn increasing interest in adapting the noise covariance matrices of EKF [7], [8], for its modelfree algorithm, low computation demand, and capability in achieving optimality in Markov decision processes [9]. In our previous work [10], a Q-learning-based EKF (QLEKF) is proposed to autonomously adapt the values of process and measurement noise covariance matrices in the attitude estimation of a rigid body. Though improvement is revealed in estimation errors compared to the traditional EKF using hand-tuned noise covariance matrices, the design of QLEKF involves certain amounts of heuristics and a rule of thumb.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

A Dynamic Grid-based Q-learning for Noise Covariance Adaptation in EKF and its Application in Navigation

Dai

Fourati

Prieur

2022

2022 IEEE 61st Conference on Decision and Control (CDC)

Self Cite

View full text Add to dashboard Cite

The process and measurement noise covariance matrices significantly impact the Extended Kalman Filter (EKF) performance and are often hand-tuned in practice, which usually entails a tedious task. Q-learning, a wellknown method in reinforcement learning, has been applied recently to better adapt the noise covariance matrices for the EKF, thanks to its simplicity and capability in handling uncertain environments. Typically, some heuristics are involved in designing the Q-learning-based EKF (QLEKF), such as tuning grid size and covariance matrices values of each state, which inevitably degrades the estimation performance when the heuristics are not suitable. We propose a dynamic grid-based Qlearning EKF (DG-QLEKF) to overcome that drawback, which brings two novelties, an updated ϵ-greedy algorithm and a dynamic grid strategy. The proposed algorithm and strategy can thoroughly exploit arbitrary search scope and find appropriate values of noise covariance matrices. The effectiveness of DG-QLEKF, applied in navigation for attitude and bias estimation, is validated through the Monte Carlo method and real flight data from an unmanned aerial vehicle. The DG-QLEKF leads to much more improved state estimation than the QLEKF and traditional EKF.

show abstract

“…In this section, the Q-learning basics and their combination with the EKF are introduced, and then our previous work [10] on Q-learning-based adaptation is recalled.…”

Section: B the Traditional Extended Kalman Filtermentioning

confidence: 99%

Section: B Q-learning-based Noise Covariance Adaptation Approachmentioning

confidence: 99%

Section: B Q-learning-based Noise Covariance Adaptation Approachmentioning

confidence: 99%

Section: A the Dynamic Grid And Updated ϵ-Greedy Algorithmmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

A Dynamic Grid-based Q-learning for Noise Covariance Adaptation in EKF and its Application in Navigation

Dai

Fourati

Prieur

2022

2022 IEEE 61st Conference on Decision and Control (CDC)

Self Cite

View full text Add to dashboard Cite

show abstract

Intelligent navigation for the cruise phase of solar system boundary exploration based on Q-learning EKF

Tao,

Zhang,

et al. 2023

Complex Intell. Syst.

View full text Add to dashboard Cite

With the continuous advancement of deep space exploration missions, the solar system boundary exploration mission is established as one of the China's most important deep space scientific exploration missions. The mission of the solar system boundary exploration has many challenges such as ultra-remote detection distance, ultra-long operation time, and ultra-long communication delay. Therefore, the problem of high-precision autonomous navigation needs to be solved urgently. This paper designs an autonomous intelligent navigation method based on X-ray pulsars in the cruise phase, which estimate the motion state of the probe in real time. The proposed navigation method employs the Q-learning Extended Kalman filter (QLEKF) to improve navigation accuracy during long periods of self-determining running. The QLEKF selects automatically the error covariance matrix parameter of the process noise and the measurement noise by the reward mechanism of reinforcement learning. Compared to the traditional EKF and AEKF, the QLEKF improves the estimation accuracy of position and velocity. Finally, the simulation result demonstrates the effectiveness and the superiority of the intelligent navigation algorithm based on QLEKF, which can satisfy the high-precision navigation requirements in the cruise phase of the solar system boundary exploration.

show abstract

Adaptive Filtering Algorithm based on Reinforcement Learning

Lin,

Lu,

Wang

2024

2024 36th Chinese Control and Decision Conference (CCDC)

View full text Add to dashboard Cite

Q-Learning-Based Noise Covariance Adaptation in Kalman Filter for MARG Sensors Attitude Estimation

Cited by 4 publications

References 17 publications

A Dynamic Grid-based Q-learning for Noise Covariance Adaptation in EKF and its Application in Navigation

A Dynamic Grid-based Q-learning for Noise Covariance Adaptation in EKF and its Application in Navigation

Intelligent navigation for the cruise phase of solar system boundary exploration based on Q-learning EKF

Adaptive Filtering Algorithm based on Reinforcement Learning

Contact Info

Product

Resources

About