2022
DOI: 10.48550/arxiv.2205.15460
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Critic Sequential Monte Carlo

Abstract: We introduce CriticSMC, a new algorithm for planning as inference built from a novel composition of sequential Monte Carlo with learned soft-Q function heuristic factors. This algorithm is structured so as to allow using large numbers of putative particles leading to efficient utilization of computational resource and effective discovery of high reward trajectories even in environments with difficult reward surfaces such as those arising from hard constraints. Relative to prior art our approach is notably stil… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 37 publications
(69 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?