2023
DOI: 10.3390/e25020188
|View full text |Cite
|
Sign up to set email alerts
|

Maximum Entropy Exploration in Contextual Bandits with Neural Networks and Energy Based Models

Abstract: Contextual bandits can solve a huge range of real-world problems. However, current popular algorithms to solve them either rely on linear models or unreliable uncertainty estimation in non-linear models, which are required to deal with the exploration–exploitation trade-off. Inspired by theories of human cognition, we introduce novel techniques that use maximum entropy exploration, relying on neural networks to find optimal policies in settings with both continuous and discrete action spaces. We present two cl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 44 publications
(66 reference statements)
0
0
0
Order By: Relevance