2021
DOI: 10.1609/aaai.v35i7.16765
|View full text |Cite
|
Sign up to set email alerts
|

CMAX++ : Leveraging Experience in Planning and Execution using Inaccurate Models

Abstract: Given access to accurate dynamical models, modern planning approaches are effective in computing feasible and optimal plans for repetitive robotic tasks. However, it is difficult to model the true dynamics of the real world before execution, especially for tasks requiring interactions with objects whose parameters are unknown. A recent planning approach, CMAX, tackles this problem by adapting the planner online during execution to bias the resulting plans away from inaccurately modeled regions. CMAX, while bei… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 20 publications
0
3
0
Order By: Relevance
“…Therefore, with the same amount of resources, we should be able to learn a more accurate uncertainty model than a transition model. (Vemula et al 2020;Vemula, Bagnell, and Likhachev 2021) also mention the same intuition, which motivated their approach. To illustrate this with an example, we designed a toy environment called the 2-Way GridWorld.…”
Section: Learning the Uncertainty Vs Learning The Transition Functionmentioning
confidence: 83%
See 1 more Smart Citation
“…Therefore, with the same amount of resources, we should be able to learn a more accurate uncertainty model than a transition model. (Vemula et al 2020;Vemula, Bagnell, and Likhachev 2021) also mention the same intuition, which motivated their approach. To illustrate this with an example, we designed a toy environment called the 2-Way GridWorld.…”
Section: Learning the Uncertainty Vs Learning The Transition Functionmentioning
confidence: 83%
“…While previous approaches to using search with imperfect models exist (Vemula et al 2020;Vemula, Bagnell, and Likhachev 2021), to the best of our knowledge, there is no prior work that directly adapts MCTS to deal with model uncertainty. In our work, we define transition uncertainty as a measure of difference between the state transitions in the perfect model and in the model that is available to the agent.…”
Section: Introductionmentioning
confidence: 99%
“…Because LQR control can be seen as model‐dependent RL method, RL can also be interpreted as data‐driven LQR control. [29] presents a theoretical study for the performance of both ILC and misspecified model on LQR problems with unknown transition dynamics, where ILC can be seen as LQR under the worst case modeling error, that is to say ILC is also a model‐independent LQR.…”
Section: Introductionmentioning
confidence: 99%