Amaury Gouverneur scite author profile

Amaury Gouverneur

4Publications

4Citation Statements Received

34Citation Statements Given

How they've been cited

How they cite others

Affiliations

KTH Royal Institute of Technology, Université Catholique de Louvain

Publications

Order By: Most citations

Optimal Measurement Budget Allocation For Particle Filtering

Aspeel

Gouverneur

Jungers

et al. 2020

View full text Add to dashboard Cite

Particle filtering is a powerful tool for target tracking. When the budget for observations is restricted, it is necessary to reduce the measurements to a limited amount of samples carefully selected. A discrete stochastic nonlinear dynamical system is studied over a finite time horizon. The problem of selecting the optimal measurement times for particle filtering is formalized as a combinatorial optimization problem. We propose an approximated solution based on the nesting of a genetic algorithm, a Monte Carlo algorithm and a particle filter. Firstly, an example demonstrates that the genetic algorithm outperforms a random trial optimization. Then, the interest of non-regular measurements versus measurements performed at regular time intervals is illustrated and the efficiency of our proposed solution is quantified: better filtering performances are obtained in 87.5% of the cases and on average, the relative improvement is 27.7%.

show abstract

Optimal measurement budget allocation for particle filtering

Aspeel¹,

Gouverneur²,

Jungers³

et al. 2020

Preprint

View full text Add to dashboard Cite

An Information-Theoretic Analysis of Bayesian Reinforcement Learning

Gouverneur

Rodríguez-Gálvez

Oechtering

et al. 2022

View full text Add to dashboard Cite

An Information-Theoretic Analysis of Bayesian Reinforcement Learning

Gouverneur¹,

Rodríguez-Gálvez²,

Oechtering³

et al. 2022

Preprint

View full text Add to dashboard Cite

Building on the framework introduced by Xu and Raginksy [1] for supervised learning problems, we study the best achievable performance for model-based Bayesian reinforcement learning problems. With this purpose, we define minimum Bayesian regret (MBR) as the difference between the maximum expected cumulative reward obtainable either by learning from the collected data or by knowing the environment and its dynamics. We specialize this definition to reinforcement learning problems modeled as Markov decision processes (MDPs) whose kernel parameters are unknown to the agent and whose uncertainty is expressed by a prior distribution. One method for deriving upper bounds on the MBR is presented and specific bounds based on the relative entropy and the Wasserstein distance are given. We then focus on two particular cases of MDPs, the multi-armed bandit problem (MAB) and the online optimization with partial feedback problem. For the latter problem, we show that our bounds can recover from below the current information-theoretic bounds by Russo and Van Roy [2].

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.