Multiagent Deep Reinforcement Learning: Challenges and Directions Towards Human-Like Approaches

Wong, Annie; Bäck, Thomas; Kononova, Anna V.; Plaat, Aske

doi:10.48550/arxiv.2106.15691

Cited by 6 publications

(8 citation statements)

References 67 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This section develops a solution to the joint beamforming and codebook design by leveraging powerful exploration capability of MA-DRL to find a near-optimal solution over the huge search space mentioned above. The MA-DRL differs from its single agent counterpart in cooperating and acting jointly to achieve a common ultimate reward [15]. The MA-DRL is especially suitable for complex problems that can be decomposable into sub-problems, each of which is handled by a single DRL agents.…”

Section: Multi-agent Deep Reinforcement Learning Based Joint Beamform...mentioning

confidence: 99%

Deep Reinforcement Learning Based Beamforming Codebook Design for RIS-aided mmWave Systems

Abdallah

Çelik

Mansour

2023

2023 IEEE 20th Consumer Communications &Amp; Networking Conference (CCNC)

View full text Add to dashboard Cite

Reconfigurable intelligent surfaces (RISs) are envisioned to play a pivotal role in future wireless systems with the capability of enhancing propagation environments by intelligently reflecting the signals toward the target receivers. However, the optimal tuning of the phase shifters at the RIS is a challenging task due to the passive nature of reflective elements and the high complexity of acquiring channel state information (CSI). Conventionally, wireless systems rely on pre-defined reflection beamforming codebooks for both initial access and data transmission. However, these existing pre-defined codebooks are commonly not adaptive to the environments. Moreover, identifying the best beam is typically performed using an exhaustive search that leads to high beam training overhead. To address these issues, this paper develops a multi-agent deep reinforcement learning framework that learns how to jointly optimize the active beamforming from the BS and the RIS-reflection beam codebook relying only on the received power measurements. To accelerate learning convergence and reduce the search space, the proposed model divides the RIS into multiple partitions and associates beam patterns to the surrounding environments with low computational complexity. Simulation results show that the proposed learning framework can learn optimized active BS beamforming and RIS reflection codebook. For instance, the proposed MA-DRL approach with only 6 beams outperforms a 256-beam discrete Fourier transform (DFT) codebook with a 97% beam training overhead reduction.

show abstract

Section: Multi-agent Deep Reinforcement Learning Based Joint Beamform...mentioning

confidence: 99%

Deep Reinforcement Learning Based Beamforming Codebook Design for RIS-aided mmWave Systems

Abdallah

Çelik

Mansour

2023

2023 IEEE 20th Consumer Communications &Amp; Networking Conference (CCNC)

View full text Add to dashboard Cite

show abstract

“…At each timestep t, each agent is attempting to maximize its own reward by learning a policy that optimizes the total expected discounted future reward. We refer the reader to high-quality reviews that have been written on MARL (Hernandez-Leal et al, 2019;Nguyen et al, 2020;Wong et al, 2021). Here, we highlight that, among others, low sample efficiency is one of the greatest challenges for MARL, as millions of interactions with the environment are usually needed for agents to learn.…”

Section: Steps Toward Social Neuro Aimentioning

confidence: 99%

Social Neuro AI: Social Interaction as the “Dark Matter” of AI

Bolotta

Dumas

2022

Front. Comput. Sci.

View full text Add to dashboard Cite

This article introduces a three-axis framework indicating how AI can be informed by biological examples of social learning mechanisms. We argue that the complex human cognitive architecture owes a large portion of its expressive power to its ability to engage in social and cultural learning. However, the field of AI has mostly embraced a solipsistic perspective on intelligence. We thus argue that social interactions not only are largely unexplored in this field but also are an essential element of advanced cognitive ability, and therefore constitute metaphorically the “dark matter” of AI. In the first section, we discuss how social learning plays a key role in the development of intelligence. We do so by discussing social and cultural learning theories and empirical findings from social neuroscience. Then, we discuss three lines of research that fall under the umbrella of Social NeuroAI and can contribute to developing socially intelligent embodied agents in complex environments. First, neuroscientific theories of cognitive architecture, such as the global workspace theory and the attention schema theory, can enhance biological plausibility and help us understand how we could bridge individual and social theories of intelligence. Second, intelligence occurs in time as opposed to over time, and this is naturally incorporated by dynamical systems. Third, embodiment has been demonstrated to provide more sophisticated array of communicative signals. To conclude, we discuss the example of active inference, which offers powerful insights for developing agents that possess biological realism, can self-organize in time, and are socially embodied.

show abstract

“…We consider the distributed controller proposed in [47], consisting in a number of fully-connected, feed-forward Artificial Neural Networks (ANNs), one for every voxel. In particular, we adopt the "homogeneous" variant presented in [49], where the ANNs share the same parameters: Medvet et al [49] proved that such homogeneous representation is comparable to one where parameters are different for every ANN, with the additional benefit of a more compact search space, similarly to what happens in most multi-agent reinforcement learning systems [76]. Moreover, parameter sharing makes the controller agnostic with respect to the morphology, putting ourselves on a vantage point to test generalization to unseen morphologies.…”

Section: Controllermentioning

confidence: 99%

“…Modularity is also ubiquitous in the field of artificial intelligence: it appears in Graph Neural Networks (GNNs) [61], Cellular Automata (CA) [51], and multi-agent systems [76]. In evolutionary robotics, Voxel-based Soft Robots (VSRs) [29] are simulated aggregations of mechanically identical elastic blocks: as such, they have emerged as as a relevant formalism to model state-of-the-art robotic systems, e.g., soft robotics [59].…”

Section: Introductionmentioning

confidence: 99%

Evolving Modular Soft Robots without Explicit Inter-Module Communication using Local Self-Attention

Pigozzi,

Tang,

Medvet

et al. 2022

Preprint

View full text Add to dashboard Cite

Self-attention visualization GeneralizationFigure 1: Overview of the proposed approach. We use the same neural controller (left picture) inside each voxel, with shared parameters. The middle picture is a biped with the attention matrices of the different voxels. Each controller uses self-attention to compute importance scores (𝑨) among the inputs sensed by its voxel. We also find evolved controllers to generalize to unseen morphologies (right picture; color represents the ratio between the voxel current area and its rest area: red stands for contraction, green for expansion, yellow for no change).

show abstract

Multiagent Deep Reinforcement Learning: Challenges and Directions Towards Human-Like Approaches

Cited by 6 publications

References 67 publications

Deep Reinforcement Learning Based Beamforming Codebook Design for RIS-aided mmWave Systems

Deep Reinforcement Learning Based Beamforming Codebook Design for RIS-aided mmWave Systems

Social Neuro AI: Social Interaction as the “Dark Matter” of AI

Evolving Modular Soft Robots without Explicit Inter-Module Communication using Local Self-Attention

Contact Info

Product

Resources

About