Learning Multirobot Hose Transportation and Deployment by Distributed Round-Robin Q-Learning

Fernández-Gauna, Borja; Etxeberria‐Agiriano, Ismael; Graña, Manuel

doi:10.1371/journal.pone.0127129

Cited by 16 publications

(5 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Regarding applications of multi-agent learning, there have been many studies using traditional MARL methods to solve various problems such as controlling a group of autonomous vehicles or drones [43], robot soccer [102], controlling traffic signals [75], coordinating collaborative bots in factories and warehouse [47], controlling electrical power networks [93] or optimizing distributed sensor networks [37], automated trading [88], machine bidding in competitive e-commerce and financial markets [9], resource management [44], transportation [21], and phenomena of social sciences [62]. Since the emergence of DQN [73], efforts to extend traditional RL to deep RL in the multi-agent domain have been found in the literature but they are still very limited (see Table 4 for applications available in the current literature).…”

Section: Conclusion and Research Directionsmentioning

confidence: 99%

Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications

Nguyen

Nahavandi

2020

IEEE Trans. Cybern.

742

309

View full text Add to dashboard Cite

Reinforcement learning (RL) algorithms have been around for decades and employed to solve various sequential decision-making problems. These algorithms however have faced great challenges when dealing with high-dimensional environments. The recent development of deep learning has enabled RL methods to drive optimal policies for sophisticated and capable agents, which can perform efficiently in these challenging environments. This paper addresses an important aspect of deep RL related to situations that require multiple agents to communicate and cooperate to solve complex tasks. A survey of different approaches to problems related to multi-agent deep RL (MADRL) is presented, including non-stationarity, partial observability, continuous state and action spaces, multi-agent training schemes, multi-agent transfer learning. The merits and demerits of the reviewed methods will be analyzed and discussed, with their corresponding applications explored. It is envisaged that this review provides insights about various MADRL methods and can lead to future development of more robust and highly useful multi-agent learning methods for solving real-world problems.

show abstract

Section: Conclusion and Research Directionsmentioning

confidence: 99%

Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications

Nguyen

Nahavandi

2020

IEEE Trans. Cybern.

742

309

View full text Add to dashboard Cite

show abstract

“…However, this research relied on a path planning algorithm and the function of the DQN algorithm was only to adjust the robots whenever a pre-planned trajectory is not accessible. In [29], a multi-agent reinforcement learning algorithm was proposed to deal with a hose transportation problem. The proposed algorithm was based on the original Q-learning, thus was not capable of continuous state inputs.…”

Section: Related Workmentioning

confidence: 99%

Decentralized Control of Multi-Robot System in Cooperative Object Transportation Using Deep Reinforcement Learning

et al. 2020

View full text Add to dashboard Cite

Object transportation could be a challenging problem for a single robot due to the oversize and/or overweight issues. A multi-robot system can take the advantage of increased driving power and more flexible configuration to solve such a problem. However, an increased number of individuals also changed the dynamics of the system which makes control of a multi-robot system more complicated. Even worse, if the whole system is sitting on a centralized decision making unit, the data flow could be easily overloaded due to the upscaling of the system. In this research, we propose a decentralized control scheme on a multi-robot system with each individual equipped with a deep Q-network (DQN) controller to perform an oversized object transportation task. DQN is a deep reinforcement learning algorithm, thus does not require the knowledge of system dynamics, instead, it enables the robots to learn appropriate control strategies through trial-and-error style interactions within the task environment. Since analogous controllers are distributed on the individuals, the computational bottleneck is avoided systematically. We demonstrate such a system in a scenario of carrying an oversized rod through a doorway by a two-robot team. The presented multi-robot system learns abstract features of the task and cooperative behaviors are observed. The decentralized DQN-style controller is showing strong robustness against uncertainties. In addition, We propose a universal metric to assess the cooperation quantitatively.

show abstract

“…Many decentralized methods have referenced the design of POMDP, varying reliance on schemes and can only handle intermittent communication resource scheduling. Reinforcement learning (RL) [ 15 ] is a paradigm to solve POMDP problems, and it is inspired by a learning theory which has good performance in multi-robots decision applications [ 16 , 17 ]. For most RL-based multi-agent systems, the rewards are more achieved by long-team learning, which is the expected accumulated reward that the agent expects to receive in the future under the policy, and can be specified by update value function.…”

Section: State Of the Artmentioning

confidence: 99%

Decentralized Opportunistic Spectrum Resources Access Model and Algorithm toward Cooperative Ad-Hoc Networks

Liu

Mohammed

2016

PLoS ONE

View full text Add to dashboard Cite

Limited communication resources have gradually become a critical factor toward efficiency of decentralized large scale multi-agent coordination when both system scales up and tasks become more complex. In current researches, due to the agent’s limited communication and observational capability, an agent in a decentralized setting can only choose a part of channels to access, but cannot perceive or share global information. Each agent’s cooperative decision is based on the partial observation of the system state, and as such, uncertainty in the communication network is unavoidable. In this situation, it is a major challenge working out cooperative decision-making under uncertainty with only a partial observation of the environment. In this paper, we propose a decentralized approach that allows agents cooperatively search and independently choose channels. The key to our design is to build an up-to-date observation for each agent’s view so that a local decision model is achievable in a large scale team coordination. We simplify the Dec-POMDP model problem, and each agent can jointly work out its communication policy in order to improve its local decision utilities for the choice of communication resources. Finally, we discuss an implicate resource competition game, and show that, there exists an approximate resources access tradeoff balance between agents. Based on this discovery, the tradeoff between real-time decision-making and the efficiency of cooperation using these channels can be well improved.

show abstract

Learning Multirobot Hose Transportation and Deployment by Distributed Round-Robin Q-Learning

Cited by 16 publications

References 23 publications

Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications

Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications

Decentralized Control of Multi-Robot System in Cooperative Object Transportation Using Deep Reinforcement Learning

Decentralized Opportunistic Spectrum Resources Access Model and Algorithm toward Cooperative Ad-Hoc Networks

Contact Info

Product

Resources

About