2019
DOI: 10.48550/arxiv.1910.14033
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Plan Arithmetic: Compositional Plan Vectors for Multi-Task Control

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 0 publications
0
1
0
Order By: Relevance
“…Ordered-goal tasks have also been recently proposed in RL for training agents with symbolic and temporal rules [13][14][15][16][17][18][19][20]. Policy-concatenation (stitching) has also been proposed as a means of skill-learning in the Options framework [21][22][23], as well as Hierarchical Control [24,12], and other schemes for concatenating polices such as policy sketches (using modular polices and task-specific reward functions) [25], or compositional plan vectors [26]. Lastly, a compositional Boolean task algebra was recently introduced demonstrating zero-shot transfer for non-sequential polices [27].…”
Section: Introductionmentioning
confidence: 99%
“…Ordered-goal tasks have also been recently proposed in RL for training agents with symbolic and temporal rules [13][14][15][16][17][18][19][20]. Policy-concatenation (stitching) has also been proposed as a means of skill-learning in the Options framework [21][22][23], as well as Hierarchical Control [24,12], and other schemes for concatenating polices such as policy sketches (using modular polices and task-specific reward functions) [25], or compositional plan vectors [26]. Lastly, a compositional Boolean task algebra was recently introduced demonstrating zero-shot transfer for non-sequential polices [27].…”
Section: Introductionmentioning
confidence: 99%