Learning to Plan from Raw Data in Grid-based Games

Dittadi, Andrea; Bolander, Thomas; Winther, Ole

doi:10.29007/s8jk

Cited by 3 publications

(5 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The experimental results show that SPOTTER achieved higher overall rewards than the baselines in the given time frame and did so more quickly. 7 Crucially, the agent learned the missing operator for moving the blue ball out of the way in Level 2, and was immediately able to use this operator in Level 3. This is demonstrated both by the fact that the agent did not experience any drop in performance when transitioning to Level 3 and also we know from running the experiment that the agent did not enter learn or gen-precon in Level 3.…”

Section: Methodsmentioning

confidence: 99%

“…Accordingly, we did not compare against any deep RL baselines. We also did not compare transfer learning and curriculum learning approaches as these approaches 7 Code implementing SPOTTER and the baselines along with experiments will be made available post-review. 8 In the supplementary material, we provide the learned operator described in PDDL, learning curves for the baselines over 2,000,000 episodes, and videos showing SPOT-TER's integrated planning and learning.…”

Section: Methodsmentioning

confidence: 99%

“…The examples consist of successful fluent state and operator pairs corresponding to a sequence of transitions in the symbolic domain [35]. Subsequent work has explored how domains can be learned with partial knowledge of successful traces [2,6], and with neural networks capable of approximating from partial traces [33] and learning models from pixels [4,7].…”

Section: Learning Symbolic Action Modelsmentioning

confidence: 99%

See 2 more Smart Citations

SPOTTER: Extending Symbolic Planning Operators through Targeted Reinforcement Learning

Sarathy¹,

Kasenberg²,

Goel³

et al. 2020

Preprint

View full text Add to dashboard Cite

Symbolic planning models allow decision-making agents to sequence actions in arbitrary ways to achieve a variety of goals in dynamic domains. However, they are typically handcrafted and tend to require precise formulations that are not robust to human error. Reinforcement learning (RL) approaches do not require such models, and instead learn domain dynamics by exploring the environment and collecting rewards. However, RL approaches tend to require millions of episodes of experience and often learn policies that are not easily transferable to other tasks. In this paper, we address one aspect of the open problem of integrating these approaches: how can decision-making agents resolve discrepancies in their symbolic planning models while attempting to accomplish goals? We propose an integrated framework named SPOTTER that uses RL to augment and support ("spot") a planning agent by discovering new operators needed by the agent to accomplish goals that are initially unreachable for the agent. SPOTTER outperforms pure-RL approaches while also discovering transferable symbolic knowledge and does not require supervision, successful plan traces or any a priori knowledge about the missing planning operator.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Methodsmentioning

confidence: 99%

Section: Learning Symbolic Action Modelsmentioning

confidence: 99%

See 1 more Smart Citation

SPOTTER: Extending Symbolic Planning Operators through Targeted Reinforcement Learning

Sarathy¹,

Kasenberg²,

Goel³

et al. 2020

Preprint

View full text Add to dashboard Cite

show abstract

“…Neural networks have also been applied to other aspects of planning. For instance, (Dittadi et al, 2018) trains a NN that learns a planning domain just from visual observations, assuming that actions have local preconditions and effects. The learned domain is generalizable across different problems of the same domain and, thus, can be used by a planner to solve these problems.…”

Section: Related Workmentioning

confidence: 99%

Learning to select goals in Automated Planning with Deep-Q Learning

Núñez-Molina¹,

Fernández-Olivares²,

Pérez³

2022

Expert Systems with Applications

View full text Add to dashboard Cite

“…Neural networks have also been applied to other aspects of planning. For instance, (Dittadi, Bolander, and Winther 2018) trains a NN that learns a planning domain just from visual observations, assuming that actions have local preconditions and effects. The learnt domain is generalizable across different problems of the same domain and, thus, can be used by a planner to solve these problems.…”

Section: Related Workmentioning

confidence: 99%

Goal Reasoning by Selecting Subgoals with Deep Q-Learning

Núñez-Molina,

Nikolov,

Vellido

et al. 2020

Preprint

View full text Add to dashboard Cite

In this work we propose a goal reasoning method which learns to select subgoals with Deep Q-Learning in order to decrease the load of a planner when faced with scenarios with tight time restrictions, such as online execution systems. We have designed a CNN-based goal selection module and trained it on a standard video game environment, testing it on different games (planning domains) and levels (planning problems) to measure its generalization abilities. When comparing its performance with a satisfying planner, the results obtained show both approaches are able to find plans of good quality, but our method greatly decreases planning time. We conclude our approach can be successfully applied to different types of domains (games), and shows good generalization properties when evaluated on new levels (problems) of the same game (domain).

show abstract

Learning to Plan from Raw Data in Grid-based Games

Cited by 3 publications

References 15 publications

SPOTTER: Extending Symbolic Planning Operators through Targeted Reinforcement Learning

SPOTTER: Extending Symbolic Planning Operators through Targeted Reinforcement Learning

Learning to select goals in Automated Planning with Deep-Q Learning

Goal Reasoning by Selecting Subgoals with Deep Q-Learning

Contact Info

Product

Resources

About