2019
DOI: 10.1609/aaai.v33i01.33012387
|View full text |Cite
|
Sign up to set email alerts
|

Bounded Suboptimal Search with Learned Heuristics for Multi-Agent Systems

Abstract: A wide range of discrete planning problems can be solved optimally using graph search algorithms. However, optimal search quickly becomes infeasible with increased complexity of a problem. In such a case, heuristics that guide the planning process towards the goal state can increase performance considerably. Unfortunately, heuristics are often unavailable or need manual and time-consuming engineering. Building upon recent results on applying deep learning to learn generalized reactive policies, we propose to l… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
13
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(13 citation statements)
references
References 13 publications
0
13
0
Order By: Relevance
“…An important technique used in this work is to learn an imitation policy from an expert solver or human, such that this policy can be used online but with less solution time or improved generalization. Imitation learning has been widely used for various purposes, e.g., autonomous driving, robot motion control [12], [25], and multi-robot coordination [8], [26]. Most of the these work has a strong focus on learning low-level control policy from raw visual inputs, without considering high-level tasks.…”
Section: Imitation Learningmentioning
confidence: 99%
See 3 more Smart Citations
“…An important technique used in this work is to learn an imitation policy from an expert solver or human, such that this policy can be used online but with less solution time or improved generalization. Imitation learning has been widely used for various purposes, e.g., autonomous driving, robot motion control [12], [25], and multi-robot coordination [8], [26]. Most of the these work has a strong focus on learning low-level control policy from raw visual inputs, without considering high-level tasks.…”
Section: Imitation Learningmentioning
confidence: 99%
“…Most of the these work has a strong focus on learning low-level control policy from raw visual inputs, without considering high-level tasks. Furthermore, training data can be generated from a complete solver [8], [25], [27] or expert demonstrations [10]. They are commonly represented by deep neural networks (DNN) such as CNN [8], [26], GNN [25], and VAE [28].…”
Section: Imitation Learningmentioning
confidence: 99%
See 2 more Smart Citations
“…by using AVIN to generate an informed heuristic for A * . Different methods to combine search-and learning-based planners have been proposed in [6], [13] and [23].…”
Section: Planning 3d Locomotion With Footprint Considerationmentioning
confidence: 99%