Bounded Suboptimal Search with Learned Heuristics for Multi-Agent Systems

Spies, Markus; Todescato, Marco; Becker, Hannes; Kesper, Patrick; Waniek, Nicolai; Guo, Meng

doi:10.1609/aaai.v33i01.33012387

Cited by 7 publications

(13 citation statements)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…An important technique used in this work is to learn an imitation policy from an expert solver or human, such that this policy can be used online but with less solution time or improved generalization. Imitation learning has been widely used for various purposes, e.g., autonomous driving, robot motion control [12], [25], and multi-robot coordination [8], [26]. Most of the these work has a strong focus on learning low-level control policy from raw visual inputs, without considering high-level tasks.…”

Section: Imitation Learningmentioning

confidence: 99%

“…Most of the these work has a strong focus on learning low-level control policy from raw visual inputs, without considering high-level tasks. Furthermore, training data can be generated from a complete solver [8], [25], [27] or expert demonstrations [10]. They are commonly represented by deep neural networks (DNN) such as CNN [8], [26], GNN [25], and VAE [28].…”

Section: Imitation Learningmentioning

confidence: 99%

“…Furthermore, training data can be generated from a complete solver [8], [25], [27] or expert demonstrations [10]. They are commonly represented by deep neural networks (DNN) such as CNN [8], [26], GNN [25], and VAE [28]. High-dimensional sensory inputs such as images in [12], [26] or point clouds in [20] enable direct reasoning over raw inputs, while direct state information such as object poses in [25] provides easy interfaces to the existing motion planners.…”

Section: Imitation Learningmentioning

confidence: 99%

“…Due to the repetitive nature of industrial processes, an intriguing question to ask is "Can we solve the problem faster after solving it 100 times?". Thus some recent work proposes to learn such heuristics from expert solutions or via reinforcement learning, see e.g., [8], [9], [10]. However, most of these approaches require enormous amount of training data even for relatively simple tasks.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Geometric Task Networks: Learning Efficient and Explainable Skill Coordination for Object Manipulation

Guo¹,

Bürger²

2021

Preprint

Self Cite

View full text Add to dashboard Cite

Complex manipulation tasks can contain various execution branches of primitive skills in sequence or in parallel under different scenarios. Manual specifications of such branching conditions and associated skill parameters are not only error-prone due to corner cases but also quickly untraceable given a large number of objects and skills. On the other hand, learning from demonstration has increasingly shown to be an intuitive and effective way to program such skills for industrial robots. Parameterized skill representations allow generalization over new scenarios, which however makes the planning process much slower thus unsuitable for online applications. In this work, we propose a hierarchical and compositional planning framework that learns a Geometric Task Network (GTN) from exhaustive planners, without any manual inputs. A GTN is a goal-dependent task graph that encapsulates both the transition relations among skill representations and the geometric constraints underlying these transitions. This framework has shown to improve dramatically the offline learning efficiency, the online performance and the transparency of decision process, by leveraging the taskparameterized models. We demonstrate the approach on a 7-DoF robot arm both in simulation and on hardware solving various manipulation tasks.

show abstract

Section: Imitation Learningmentioning

confidence: 99%

Section: Imitation Learningmentioning

confidence: 99%

Section: Imitation Learningmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Geometric Task Networks: Learning Efficient and Explainable Skill Coordination for Object Manipulation

Guo¹,

Bürger²

2021

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…by using AVIN to generate an informed heuristic for A * . Different methods to combine search-and learning-based planners have been proposed in [6], [13] and [23].…”

Section: Planning 3d Locomotion With Footprint Considerationmentioning

confidence: 99%

Value Iteration Networks on Multiple Levels of Abstraction

Schleich¹,

Klamt²,

Behnke³

2019

Robotics: Science and Systems XV

View full text Add to dashboard Cite

Learning-based methods are promising to plan robot motion without performing extensive search, which is needed by many non-learning approaches. Recently, Value Iteration Networks (VINs) received much interest since-in contrast to standard CNN-based architectures-they learn goal-directed behaviors which generalize well to unseen domains. However, VINs are restricted to small and low-dimensional domains, limiting their applicability to real-world planning problems.To address this issue, we propose to extend VINs to representations with multiple levels of abstraction. While the vicinity of the robot is represented in sufficient detail, the representation gets spatially coarser with increasing distance from the robot. The information loss caused by the decreasing resolution is compensated by increasing the number of features representing a cell. We show that our approach is capable of solving significantly larger 2D grid world planning tasks than the original VIN implementation. In contrast to a multiresolution coarse-to-fine VIN implementation which does not employ additional descriptive features, our approach is capable of solving challenging environments, which demonstrates that the proposed method learns to encode useful information in the additional features. As an application for solving real-world planning tasks, we successfully employ our method to plan omnidirectional driving for a search-and-rescue robot in cluttered terrain.

show abstract

K-Focal Search for Slow Learned Heuristics

Greco,

Toro,

Hernández

et al. 2024

IEEE Access

View full text Add to dashboard Cite

Bounded suboptimal heuristic search is a family of search algorithms capable of solving hard combinatorial problems, returning suboptimal solutions within a given bound. Recent machine learning approaches have been shown to learn accurate heuristic functions. Learned heuristics, however, are slow to compute; concretely, given a single search state s and a learned heuristic h, evaluating h(s) is typically very slow relative to expansion time, since state-of-the-art learned heuristics are implemented as neural networks. However, by using a Graphics Processing Unit (GPU), it is possible to compute heuristics using batched computation. Existing approaches to batched heuristic computation are specific to satisficing search and have not studied the problem in the context of bounded-suboptimal search. In this paper, we present K-Focal Search, a bounded suboptimal search algorithm that in each iteration expands K states from the FOCAL list and computes the learned heuristic values of the successors using a GPU. We experiment over the 24puzzle and Rubik's Cube using DeepCubeA, a very effective and inadmissible learned heuristic. Our results show that K-Focal Search benefits both from batched computation and from the diversity in the search introduced by its expansion strategy. Over standard Focal Search, K-Focal Search improves runtime by a factor of 6, expansions by up to three orders of magnitude, and finds better quality solutions, keeping the theoretical guarantees of Focal Search.

show abstract

Bounded Suboptimal Search with Learned Heuristics for Multi-Agent Systems

Cited by 7 publications

References 13 publications

Geometric Task Networks: Learning Efficient and Explainable Skill Coordination for Object Manipulation

Geometric Task Networks: Learning Efficient and Explainable Skill Coordination for Object Manipulation

Value Iteration Networks on Multiple Levels of Abstraction

K-Focal Search for Slow Learned Heuristics

Contact Info

Product

Resources

About