2017
DOI: 10.48550/arxiv.1710.10044
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Distributional Reinforcement Learning with Quantile Regression

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
33
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 17 publications
(33 citation statements)
references
References 0 publications
0
33
0
Order By: Relevance
“…To address the issue of risk in RL, recent works [6], [7] have introduced the concept of distributional RL. Distributional RL learns the distribution of accumulated rewards, Fig.…”
Section: Imentioning
confidence: 99%
See 2 more Smart Citations
“…To address the issue of risk in RL, recent works [6], [7] have introduced the concept of distributional RL. Distributional RL learns the distribution of accumulated rewards, Fig.…”
Section: Imentioning
confidence: 99%
“…Empirically, distributional RL algorithms have shown superior performance and sample efficiency in many game domains [7], [36]. It has been argued that this is because predicting quantiles serves as an auxiliary task that enhances representation learning, but as yet there is little supporting evidence for this conjecture [12].…”
Section: Distributional Rl and Risk-sensitive Policiesmentioning
confidence: 99%
See 1 more Smart Citation
“…We propose several Reverse RL algorithms and prove their convergence under linear function approximation. We also propose Distributional Reverse RL algorithms akin to Distributional RL (Bellemare et al, 2017;Dabney et al, 2017;Rowland et al, 2018) to compute the probability of an event for anomaly detection. We demonstrate empirically the utility of Reverse GVFs in anomaly detection and representation learning.…”
Section: L1 L2 L4 L3mentioning
confidence: 99%
“…We now provide a practical algorithm to approximate η s π based on quantile regression, akin to Dabney et al (2017). We use N quantiles with quantile levels {τ i } i=1,...,N , where τ i .…”
Section: Reverse General Value Functionmentioning
confidence: 99%