2019
DOI: 10.1109/lra.2019.2903261
|View full text |Cite
|
Sign up to set email alerts
|

PRIMAL: Pathfinding via Reinforcement and Imitation Multi-Agent Learning

Abstract: Multi-agent path finding (MAPF) is an essential component of many large-scale, real-world robot deployments, from aerial swarms to warehouse automation. However, despite the community's continued efforts, most state-of-the-art MAPF planners still rely on centralized planning and scale poorly past a few hundred agents. Such planning approaches are maladapted to real-world deployments, where noise and uncertainty often require paths be recomputed online, which is impossible when planning times are in seconds to … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
180
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 242 publications
(197 citation statements)
references
References 28 publications
(57 reference statements)
1
180
0
Order By: Relevance
“…Also, based on the cooperation method used whether it's a centralized, distributed or decentralized, the team of robots need to maintain communication especially if the team shares information. Most of the reviewed work assumes perfect communication and utilizes centralized type of cooperation as presented in [3,20,44,70] which is subject to scalability, overhead and single point of failure problems. Some of the work presented a decentralized CPP approach for multi-robot systems as presented in [8,52,54,70].…”
Section: Discussion and Future Research Directionsmentioning
confidence: 99%
See 2 more Smart Citations
“…Also, based on the cooperation method used whether it's a centralized, distributed or decentralized, the team of robots need to maintain communication especially if the team shares information. Most of the reviewed work assumes perfect communication and utilizes centralized type of cooperation as presented in [3,20,44,70] which is subject to scalability, overhead and single point of failure problems. Some of the work presented a decentralized CPP approach for multi-robot systems as presented in [8,52,54,70].…”
Section: Discussion and Future Research Directionsmentioning
confidence: 99%
“…The common assumption in literature is that communication is unlimited in range and bandwidth. Nearly all centralized systems assume that the individual robots can communicate directly with the central controller such as [44,70], and algorithms that create maps assume global communication [53,57]. Since communication systems are often down in the aftermath of a disaster [51,61], achieving coverage with limited communication must be a critical aspect that must to be considered.…”
Section: Communication and Task Allocationmentioning
confidence: 99%
See 1 more Smart Citation
“…In addition to imitation learning, modern reinforcement learning methods have also been used to generate MRS behavior [11]. The approach in [12] combines both reinforcement learning and imitation learning for extracting multi-agent navigation policies. The approach can deal with partially-observable domains, variable team-sizes, as well as complex environments and mazes.…”
Section: Related Workmentioning
confidence: 99%
“…Since the vehicles share the same environment, their coordination [13] and cooperation [14] is central when solving the planning problem. The methods presented in [14,15], and [16] are examples of multi-vehicle path planning algorithms that use artificial intelligence-based approaches (primarily reinforcement learning), to solve the problem of planning in multi-vehicle systems, demonstrating strong scalability properties [16].…”
Section: Path Planningmentioning
confidence: 99%