2020
DOI: 10.1155/2020/2672537
|View full text |Cite
|
Sign up to set email alerts
|

Exploration Entropy for Reinforcement Learning

Abstract: The training process analysis and termination condition of the training process of a Reinforcement Learning (RL) system have always been the key issues to train an RL agent. In this paper, a new approach based on State Entropy and Exploration Entropy is proposed to analyse the training process. The concept of State Entropy is used to denote the uncertainty for an RL agent to select the action at every state that the agent will traverse, while the Exploration Entropy denotes the action selection uncertainty of … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
7
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 14 publications
(7 citation statements)
references
References 35 publications
(36 reference statements)
0
7
0
Order By: Relevance
“…This draws on the Exploration Entropy in a full reinforcement learning problem [40] where multiple states are associated with an agent.…”
Section: Uncertainty Evaluation Of Proactive Caching Systemsmentioning
confidence: 99%
“…This draws on the Exploration Entropy in a full reinforcement learning problem [40] where multiple states are associated with an agent.…”
Section: Uncertainty Evaluation Of Proactive Caching Systemsmentioning
confidence: 99%
“…e problem of cold start is also unable to adapt to the short-term interest changes of users and make effective information recommendations. erefore, many scholars began to try to use reinforcement learning [23] to solve the problems in the recommendation system. Reinforcement learning is a learning algorithm based on the interaction of the environment.…”
Section: Related Workmentioning
confidence: 99%
“…Later, through the study of many scientists, a relatively complete system, approximate dynamic programming was formed. Reinforcement learning [23] is a dynamic interactive learning strategy algorithm.…”
Section: Related Workmentioning
confidence: 99%
“…This draws on the Exploration Entropy in a full reinforcement learning problem [38] where multiple states are associated with an agent.…”
Section: Uncertainty Evaluation Of Proactive Caching Systemsmentioning
confidence: 99%