2022
DOI: 10.48550/arxiv.2202.01461
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

ExPoSe: Combining State-Based Exploration with Gradient-Based Online Search

Abstract: A tree-based online search algorithm iteratively simulates trajectories and updates Q-value information on a set of states represented by a tree structure. Alternatively, policy gradient based online search algorithms update the information obtained from simulated trajectories directly onto the parameters of the policy and has been found to be effective. While tree-based methods limit the updates from simulations to the states that exist in the tree and do not interpolate the information to nearby states, poli… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 16 publications
(29 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?