Policy search for multi-robot coordination under uncertainty

Amato, Christopher; Konidaris, George; Anders, Ariel; Cruz, Gabriel; How, Jonathan P.; Kaelbling, Leslie Pack

doi:10.1177/0278364916679611

Cited by 46 publications

(43 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Because tree-based representations become intractable as the horizon grows, we also developed multiple methods for optimizing finite-state controllers in macro-action Dec-POMDPs using the MacDec-POMDP [Amato et al, 2015a] and Dec-POSMDP [Omidshafiei et al, 2015] models. Some of these approaches can provide solutions with only a highlevel model of the macro-actions (i.e., distributions over time and outcomes) instead of a full model of the underlying Dec-POMDP [Amato et al, 2015a;2017;2017a]. Another approach automatically generates the macro-actions from low-level (continuous) dynamics models , while another method generates (macro-)observations from low-level sensor (e.g., camera) information [Omidshafiei et al, 2017b].…”

Section: Macro-action-based Methodsmentioning

confidence: 99%

“…We call this model a MacDec-POMDP [Amato et al, 2014] when the low-level Dec-POMDP model and the policies of the macro-actions are known and a decentralized partially observable semi-Markov decision process (Dec-POSMDP) when a high-level model is defined which includes time to completion Amato et al, 2015a] (but a simulator can be used in place of a model in each case). While these high-level models still include the states of the Dec-POMDP, they do not include the Dec-POMDP actions and observations.…”

Section: Macro-actions In Dec-pomdpsmentioning

confidence: 99%

“…By extending Dec-POMDP algorithms to the macro-action case, realistic multi-agent coordination problems can be solved that are orders of magnitude larger than problems solved by previous methods and continuous state spaces can be considered, including for multiple mobile robots in warehouse [Amato et al, 2015b], logistics [Amato et al, 2015a;2017] and aerial delivery [Omidshafiei et al, 2015; 2017a; 2017b] scenarios. We discuss some of these methods and domains below.…”

Section: Macro-actions In Dec-pomdpsmentioning

confidence: 99%

“…We demonstrated our macro-action-based approaches on multiple multi-robot domains with limited sensing and communication. These domains included a warehousing problem, where robots have uncertainty about the location of boxes and limited communication, but must coordinate to collect the large boxes and individually push the small boxes to the shipping location [Amato et al, 2015b]; a logistics (beer delivery) domain, where two robots must efficiently find out about and service beer orders in cooperation with a 'picker/bartender' robot, which can retrieve items [Amato et al, 2015a;2017]; and a package delivery domain, where a group of aerial robots must retrieve and deliver packages from base locations to delivery locations while dealing with limited battery life 2016;2017b;2017a]. 1 Our methods outperformed simpler methods and 'expert' hand-coded solutions (e.g., delivering an additional three beers every 10 minutes).…”

Section: Multi-robot Planningmentioning

confidence: 99%

See 3 more Smart Citations

Decision-Making Under Uncertainty in Multi-Agent and Multi-Robot Systems: Planning and Learning

Amato

2018

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence

Self Cite

View full text Add to dashboard Cite

Multi-agent planning and learning methods are becoming increasingly important in today's interconnected world. Methods for real-world domains, such as robotics, must consider uncertainty and limited communication in order to generate high-quality, robust solutions. This paper discusses our work on developing principled models to represent these problems and planning and learning methods that can scale to realistic multi-agent and multi-robot tasks.

show abstract

Section: Macro-action-based Methodsmentioning

confidence: 99%

Section: Macro-actions In Dec-pomdpsmentioning

confidence: 99%

Section: Macro-actions In Dec-pomdpsmentioning

confidence: 99%

Section: Multi-robot Planningmentioning

confidence: 99%

See 2 more Smart Citations

Decision-Making Under Uncertainty in Multi-Agent and Multi-Robot Systems: Planning and Learning

Amato

2018

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence

Self Cite

View full text Add to dashboard Cite

show abstract

“…Exploring algorithm determines the shortest and the lowest-cost path of a robot based on the environment size and type, cell size, and the numbers of robots constitute the exploration team and update the environment map based on the gathered information [3,24]. A set of common exploration algorithms will be discussed in section 4.…”

Section: Exploration Algorithmmentioning

confidence: 99%

An Evaluation of Multi-Robot Systems Exploration Algorithms

Mohamed

Elshenawy

Harb

2019

Journal of Al-Azhar University Engineering Sector

View full text Add to dashboard Cite

Multi-robot systems exploration of an environment is an important process that most automated applications depend on. In this paper, common algorithms used to perform the exploration process of an unknown cell-based environment occupied by a set of obstacles using a set of identical robots are studied and their performance are compared depending on three metrics: the total explored area in the map, overall mission time, and the number of hops in the networked robots. The performance of these algorithms is evaluated for different environments and different team sizes using MRESim computer simulator.

show abstract