How to train your differentiable filter

Kloss, Alina; Martius, Georg; Bohg, Jeannette

doi:10.1007/s10514-021-09990-9

Cited by 30 publications

(45 citation statements)

References 16 publications

(37 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This is particularly true for some of the richest, most information-dense sensing modalities, such as images, audio, or tactile feedback, as well as in interaction-rich applications that need to reason about difficult-to-model contact dynamics. These systems also tend to have more complex, heteroscedastic (variable) noise profiles [6]. For example, an object position estimate from an image might rapidly switch from being extremely precise under nominal operating conditions to completely useless under occlusion or poor lighting.…”

Section: Stanfordmentioning

confidence: 99%

“…To retain the benefits of a probabilistic state estimator while circumventing the need for analytical models, a recent line of work has shown that we can treat Bayesian filters as a differentiable component of a computation graph [6][7][8][9][10]. These differentiable filters allow end-to-end estimation errors to be backpropagated directly through the structure of the estimator itself, enabling data-driven learning for system models and uncertainties that are optimized for a specific state estimation setting.…”

Section: Stanfordmentioning

confidence: 99%

“…Each of these works demonstrate that the structure of a Bayesian filter can be used to improve the performance of a learned state estimator when compared to LSTM-based methods on visual robot localization tasks. In addition to EKFs and Particle filters, Kloss et al [6] analyses differentiable Unscented Kalman filters and the value of learning heteroscedastic noise models for both KITTI and a planar pushing task. Lee et al [10] explore differentiable filtering architectures for manipulation tasks involving both vision and touch.…”

Section: B Differentiable Filteringmentioning

confidence: 99%

“…In the first set of experiments, we study a synthetic visual tracking environment that has been used to evaluate differentiable filters [6,7]. This environment enables full control over the process noise and observation complexity of the underlying system.…”

Section: A Visual Trackingmentioning

confidence: 99%

See 3 more Smart Citations

Differentiable Factor Graph Optimization for Learning Smoothers

Yi¹,

Lee²,

Kloss³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

Section: Stanfordmentioning

confidence: 99%

Section: Stanfordmentioning

confidence: 99%

Section: B Differentiable Filteringmentioning

confidence: 99%

Section: A Visual Trackingmentioning

confidence: 99%

See 2 more Smart Citations

Differentiable Factor Graph Optimization for Learning Smoothers

Yi¹,

Lee²,

Kloss³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

“…In [25], they utilise conditional normalisation flows to construct flexible probability distributions for differentiable particle filters. A comparison of differentiable filters can be seen in [26]. The approach described in this paper differs from this body of previous work since we focus on ensuring resampling is differentiable without having to change how resampling operates.…”

Section: Introductionmentioning

confidence: 99%

Efficient Learning of the Parameters of Non-Linear Models using Differentiable Resampling in Particle Filters

Rosato,

Beraud,

Horridge

et al. 2021

Preprint

View full text Add to dashboard Cite

It has been widely documented that the sampling and resampling steps in particle filters cannot be differentiated. The reparameterisation trick was introduced to allow the sampling step to be reformulated into a differentiable function. We extend the reparameterisation trick to include the stochastic input to resampling therefore limiting the discontinuities in the gradient calculation after this step. Knowing the gradients of the prior and likelihood allows us to run particle Markov Chain Monte Carlo (p-MCMC) and use the No-U-Turn Sampler (NUTS) as the proposal when estimating parameters.We compare the Metropolis-adjusted Langevin algorithm (MALA), Hamiltonian Monte Carlo with different number of steps and NUTS. We consider two state-space models and show that NUTS improves the mixing of the Markov chain and can produce more accurate results in less computational time.

show abstract