2022
DOI: 10.1109/tro.2022.3184837
|View full text |Cite
|
Sign up to set email alerts
|

Model-Based Policy Search Using Monte Carlo Gradient Estimation With Real Systems Application

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(6 citation statements)
references
References 23 publications
0
6
0
Order By: Relevance
“…In this section, we evaluate the performance of the control policies learned by the different VF-MC-PILCO setups and by MC-PILCO4PMS. Notice that MC-PILCO4PMS achieved results comparable to or better than other state-of-the-art GPbased MBRL algorithms, see [11]. The cumulative costs and success rates obtained at each trial in the 50 experiments are reported in Fig.…”
Section: B Policy Learning Resultsmentioning
confidence: 78%
See 4 more Smart Citations
“…In this section, we evaluate the performance of the control policies learned by the different VF-MC-PILCO setups and by MC-PILCO4PMS. Notice that MC-PILCO4PMS achieved results comparable to or better than other state-of-the-art GPbased MBRL algorithms, see [11]. The cumulative costs and success rates obtained at each trial in the 50 experiments are reported in Fig.…”
Section: B Policy Learning Resultsmentioning
confidence: 78%
“…We compare the proposed approach with the s.o.t.a. MBRL algorithm specifically designed to deal with partial state measurability of real mechanical systems, MC-PILCO4PMS [11]. MC-PILCO4PMS follows a particle-based policy gradient framework similar to the one depicted in Sec.…”
Section: Policy Structurementioning
confidence: 99%
See 3 more Smart Citations