Q‐learning for noise covariance adaptation in extended KALMAN filter

Xiong, Kai; Wei, Chunling; Zhang, Haoyu

doi:10.1002/asjc.2336

Cited by 25 publications

(12 citation statements)

References 40 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Each time after executing an action a, the agent receives a response from the environment, which is translated to a reward (R) showing how good the action is. Significantly, Q-learning at its core seeks to maximize the cumulative reward by performing the best action at each state [12]. The cumulative reward is stored as Q-value through the Q-learning update rule as (11) where Q(s, a) ∈ R is the Q-value for the action a in state s, R ∈ R is the reward gained by exeuting action a in state s, α is the learning rate, and γ is the discount factor.…”

Section: Preliminaries On Q-learning Approachmentioning

confidence: 99%

“…Recent advancements in Reinforcement Learning (RL) have made it appealing to be implemented to cope with uncertain environments. Specifically, this work is motivated by the strength of the Q-learning method [9]- [12] in which an intelligent agent learns how to take action in an environment with uncertain parameters. To the best of our knowledge, the process and measurement noise covariance matrices adaptation for the EKF based on Q-learning has not been solved yet within the scope of attitude and related states estimation by MARG sensors.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Q-Learning-Based Noise Covariance Adaptation in Kalman Filter for MARG Sensors Attitude Estimation

Dai

Nateghi

Fourati

et al. 2022

2022 IEEE International Symposium on Inertial Sensors and Systems (INERTIAL)

View full text Add to dashboard Cite

The attitude estimation of a rigid body by magnetic, angular rate, and gravity (MARG) sensors is a research subject for a large variety of engineering applications. A standard solution for building up the observer is usually based on the Kalman filter and its different extensions for versatility and practical implementation. However, the performance of these observers has long suffered from the inaccurate process and measurement noise covariance matrices, which in turn entails tedious parameter turning procedures. To overcome the laborious noise covariance matrices regulation, we propose in this paper a Q-learning-based approach to autonomously adapt the values of process and measurement noise covariance matrices. The Qlearning method establishes a reinforcement learning mechanism that forces the noise covariance matrices pair with the least difference between predictions and measurements of output to be found in a predetermined candidate set of noise covariance matrices. The effectiveness of the Q-learning approach, applied to Extended Kalman filter-based attitude estimation, is validated through the Monte Carlo method that uses real flight data on an unmanned aerial vehicle.

show abstract

Section: Preliminaries On Q-learning Approachmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Q-Learning-Based Noise Covariance Adaptation in Kalman Filter for MARG Sensors Attitude Estimation

Dai

Nateghi

Fourati

et al. 2022

2022 IEEE International Symposium on Inertial Sensors and Systems (INERTIAL)

View full text Add to dashboard Cite

show abstract

“…The first group is the methods that tuning the parameters of KF such as process noise covariance matrix 𝑄 , measurement noise covariance matrix 𝑅 , weighting factors, etc., where those parameters are predicted by AI techniques [28]- [44]. Especially, many studies in this group are focused on tuning noise covariance matrices.…”

Section: Tuning Parameters Of Kfmentioning

confidence: 99%

A Review of Kalman Filter with Artificial Intelligence Techniques

Kim

Petrunin

Shin

2022

2022 Integrated Communication, Navigation and Surveillance Conference (ICNS)

View full text Add to dashboard Cite

Kalman filter (KF) is a widely used estimation algorithm for many applications. However, in many cases, it is not easy to estimate the exact state of the system due to many reasons such as an imperfect mathematical model, dynamic environments, or inaccurate parameters of KF. Artificial intelligence (AI) techniques have been applied to many estimation algorithms thanks to the advantage of AI techniques that have the ability of mapping between the input and the output, the so-called "black box". In this paper, we found and reviewed 55 papers that proposed KF with AI techniques to improve its performance. Based on the review, we categorised papers into four groups according to the role of AI as follows: 1) Methods tuning parameters of KF, 2) Methods compensating errors in KF, 3) Methods updating state vector or measurements of KF, and 4) Methods estimating pseudo-measurements of KF. In the concluding section of this paper, we pointed out the directions for future research that suggestion to focus on more research for combining the categorised groups. In addition, we presented the suggestion of beneficial approaches for representative applications.

show abstract

“…Second, it is sufficient for Q-learning to explore and exploit all directions from the center element. Third, its all 9 elements can be visited whit a shorter learning period, compared to larger size grids in [8], [10]. Fourth, by dynamically updating the ratio r, q and the central element (Q c , R c ), it allows Q-learning to search in an arbitrarily large scope (by manipulating q, r, r and q) with an arbitrarily high precision (by setting r and q close to 1).…”

Section: A the Dynamic Grid And Updated ϵ-Greedy Algorithmmentioning

confidence: 99%

“…As one of the most important reinforcement learning methods, Q-learning [6] has drawn increasing interest in adapting the noise covariance matrices of EKF [7], [8], for its modelfree algorithm, low computation demand, and capability in achieving optimality in Markov decision processes [9]. In our previous work [10], a Q-learning-based EKF (QLEKF) is proposed to autonomously adapt the values of process and measurement noise covariance matrices in the attitude estimation of a rigid body.…”

Section: Introductionmentioning

confidence: 99%

A Dynamic Grid-based Q-learning for Noise Covariance Adaptation in EKF and its Application in Navigation

Dai

Fourati

Prieur

2022

2022 IEEE 61st Conference on Decision and Control (CDC)

View full text Add to dashboard Cite

The process and measurement noise covariance matrices significantly impact the Extended Kalman Filter (EKF) performance and are often hand-tuned in practice, which usually entails a tedious task. Q-learning, a wellknown method in reinforcement learning, has been applied recently to better adapt the noise covariance matrices for the EKF, thanks to its simplicity and capability in handling uncertain environments. Typically, some heuristics are involved in designing the Q-learning-based EKF (QLEKF), such as tuning grid size and covariance matrices values of each state, which inevitably degrades the estimation performance when the heuristics are not suitable. We propose a dynamic grid-based Qlearning EKF (DG-QLEKF) to overcome that drawback, which brings two novelties, an updated ϵ-greedy algorithm and a dynamic grid strategy. The proposed algorithm and strategy can thoroughly exploit arbitrary search scope and find appropriate values of noise covariance matrices. The effectiveness of DG-QLEKF, applied in navigation for attitude and bias estimation, is validated through the Monte Carlo method and real flight data from an unmanned aerial vehicle. The DG-QLEKF leads to much more improved state estimation than the QLEKF and traditional EKF.

show abstract

Q‐learning for noise covariance adaptation in extended KALMAN filter

Cited by 25 publications

References 40 publications

Q-Learning-Based Noise Covariance Adaptation in Kalman Filter for MARG Sensors Attitude Estimation

Q-Learning-Based Noise Covariance Adaptation in Kalman Filter for MARG Sensors Attitude Estimation

A Review of Kalman Filter with Artificial Intelligence Techniques

A Dynamic Grid-based Q-learning for Noise Covariance Adaptation in EKF and its Application in Navigation

Contact Info

Product

Resources

About