Valentin Thomas scite author profile

Valentin Thomas

4Publications

3Citation Statements Received

45Citation Statements Given

How they've been cited

How they cite others

Affiliations

Publications

Order By: Most citations

Beyond Target Networks: Improving Deep $Q$-learning with Functional Regularization

Piche¹,

Thomas²,

Marino³

et al. 2021

Preprint

View full text Add to dashboard Cite

Target networks are at the core of recent success in Reinforcement Learning. They stabilize the training by using old parameters to estimate the Q-values, but this also limits the propagation of newly-encountered rewards which could ultimately slow down the training. In this work, we propose an alternative training method based on functional regularization which does not have this deficiency. Unlike target networks, our method uses up-to-date parameters to estimate the target Q-values, thereby speeding up training while maintaining stability. Surprisingly, in some cases, we can show that target networks are a special, restricted type of functional regularizers. Using this approach, we show empirical improvements in sample efficiency and performance across a range of Atari and simulated robotics environments.

show abstract

« La police, avec nous » ? Politisation et rapport aux institutions policières dans un contexte de répression

Devaux¹,

Lang²,

Lévêque³

et al. 2022

View full text Add to dashboard Cite

On the interplay between noise and curvature and its effect on optimization and generalization

Thomas¹,

Pedregosa²,

Merriënboer³

et al. 2019

Preprint

View full text Add to dashboard Cite

Introduction

Aguiton¹,

Déplaude²,

Jas³

et al. 2021

View full text Add to dashboard Cite

HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L'archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d'enseignement et de recherche français ou étrangers, des laboratoires publics ou privés. Copyright

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Valentin Thomas

Beyond Target Networks: Improving Deep $Q$-learning with Functional Regularization

« La police, avec nous » ? Politisation et rapport aux institutions policières dans un contexte de répression

On the interplay between noise and curvature and its effect on optimization and generalization

Introduction

Contact Info

Product

Resources

About