2021
DOI: 10.48550/arxiv.2103.12726
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 0 publications
0
1
0
Order By: Relevance
“…In particular, certain forms of UCB-induced exploration lead to regret bounds that are comparable to model-based methods when used in conjunction with the model-free Q-learning algorithm in the tabular case [19]. In a distinct, but related direction, the recent work [14] proposes a novel, information-theoretic measure of task complexity, called policy(-optimal) information capacity, and empirically demonstrates how it can be used to determine the difficulty of a range of common RL problems.…”
Section: Related Workmentioning
confidence: 99%
“…In particular, certain forms of UCB-induced exploration lead to regret bounds that are comparable to model-based methods when used in conjunction with the model-free Q-learning algorithm in the tabular case [19]. In a distinct, but related direction, the recent work [14] proposes a novel, information-theoretic measure of task complexity, called policy(-optimal) information capacity, and empirically demonstrates how it can be used to determine the difficulty of a range of common RL problems.…”
Section: Related Workmentioning
confidence: 99%