2019
DOI: 10.48550/arxiv.1903.07438
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Exploiting Hierarchy for Learning and Transfer in KL-regularized RL

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
43
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 13 publications
(43 citation statements)
references
References 0 publications
0
43
0
Order By: Relevance
“…While KL-regularized RL has achieved success across various settings [4,7,19,12], recently Tirumala et al [14] proposed a hierarchical extension where policy π and prior π 0 are augmented with latent variables, π(a, z|x, k) = π H (z|x, k)π L (a|z, x) and π 0 (a, z|x) = π H 0 (z|x)π L 0 (a|z, x), where subscripts H and L denote the higher and lower hierarchical levels. This structure encourages the shared low-level policy (π L = π L 0 ) to discover task-agnostic behavioural primitives, whilst the high-level discovers higher-level skills relevant to each task.…”
Section: Hierarchical Kl-regularized Rlmentioning
confidence: 99%
See 4 more Smart Citations
“…While KL-regularized RL has achieved success across various settings [4,7,19,12], recently Tirumala et al [14] proposed a hierarchical extension where policy π and prior π 0 are augmented with latent variables, π(a, z|x, k) = π H (z|x, k)π L (a|z, x) and π 0 (a, z|x) = π H 0 (z|x)π L 0 (a|z, x), where subscripts H and L denote the higher and lower hierarchical levels. This structure encourages the shared low-level policy (π L = π L 0 ) to discover task-agnostic behavioural primitives, whilst the high-level discovers higher-level skills relevant to each task.…”
Section: Hierarchical Kl-regularized Rlmentioning
confidence: 99%
“…However, when combined, one can in theory discover and leverage multiple skill abstractions. Whilst prior works [14] have attempted this, they were unable to yield transfer benefits from a learnt prior.…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations