2021
DOI: 10.48550/arxiv.2112.07746
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

CEM-GD: Cross-Entropy Method with Gradient Descent Planner for Model-Based Reinforcement Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 0 publications
0
3
0
Order By: Relevance
“…Remark 3. The matrix F and vector e (see (13)) have the dimension of 3(n o + 2)n p × 3n v and 3(n 0 + 2)n p and their complexity grow linearly with the number of obstacles n o and the planning horizon n p .…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…Remark 3. The matrix F and vector e (see (13)) have the dimension of 3(n o + 2)n p × 3n v and 3(n 0 + 2)n p and their complexity grow linearly with the number of obstacles n o and the planning horizon n p .…”
Section: Discussionmentioning
confidence: 99%
“…IV. CONNECTIONS TO EXISTING WORKS Connection to CEM-GD: Alternate approaches of combining sampling and gradient-based approach were presented recently in [4], [13]. In these two cited works, the projection at line 5 of Alg.1 is replaced with a gradient step of the form ξ i = ξ i − σ∇ ξ c 1 , for some learning-rate σ.…”
Section: Decenteralized Priest (D-priest)mentioning
confidence: 99%
See 1 more Smart Citation