2016
DOI: 10.1016/j.artint.2016.07.004
|View full text |Cite
|
Sign up to set email alerts
|

A synthesis of automated planning and reinforcement learning for efficient, robust decision-making

Abstract: Automated planning and reinforcement learning are characterized by complementary views on decision making: the former relies on previous knowledge and computation, while the latter on interaction with the world, and experience. Planning allows robots to carry out different tasks in the same domain, without the need to acquire knowledge about each one of them, but relies strongly on the accuracy of the model. Reinforcement learning, on the other hand, does not require previous knowledge, and allows robots to ro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
61
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
4
4
1

Relationship

2
7

Authors

Journals

citations
Cited by 85 publications
(61 citation statements)
references
References 21 publications
0
61
0
Order By: Relevance
“…In [9], Leonetti et al investigated a low level integration of RL and external controllers where the RL algorithm only explores with feasible actions provided by the planner, these heuristics can not be discarded, both for training and testing. Therefore, the performance of the learner very depends on, if not limited by, the capability of the heuristics.…”
Section: Related Workmentioning
confidence: 99%
“…In [9], Leonetti et al investigated a low level integration of RL and external controllers where the RL algorithm only explores with feasible actions provided by the planner, these heuristics can not be discarded, both for training and testing. Therefore, the performance of the learner very depends on, if not limited by, the capability of the heuristics.…”
Section: Related Workmentioning
confidence: 99%
“…The integration of symbolic planning with reinforcement learning has been studied in a variety of approaches [12,22,29,31]. These methods focus on leveraging the strengths of one of the paradigms to enhance the other.…”
Section: Related Workmentioning
confidence: 99%
“…4. (lines 8,13,14) Suppose that the condition on line 8 is false. We then do not create a new state.…”
Section: (Line 6-7)mentioning
confidence: 99%