Applications of Multi-Agent Deep Reinforcement Learning: Models and Algorithms

Ibrahim, Abdikarim Mohamed; Yau, Kok‐Lim Alvin; Chong, Yung-Wey; Wu, Celimuge

doi:10.3390/app112210870

Cited by 13 publications

(8 citation statements)

References 75 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…9(c) and Fig. 9(d), when |N | ∈ [5,20], we can see that the Propose-MDP scheme has the highest average revenue and profit, followed by the proposed scheme with χ = 4, 3, 2, the NoPre-SA, NoPre-DRL, NoPre-Reset schemes, the proposed scheme with χ = 1, and finally the NoPre-Random scheme. Note that the performances of the Propose-MDP scheme and the NoPre-SA schemes significantly decrease with N , which demonstrates that the exhaustive search-based schemes are hard to deal with scenarios with large numbers of BSs.…”

Section: Evaluation Results Versus Different Penaltiesmentioning

confidence: 90%

“…For instance, the authors in [19] discussed the effect of noise on the observation of agents and tried to extract real and complete observations from the original ones. In [20] and [21], the authors studied the problem of unstable feedback of agents when dealing with highly dynamic environments. In addition, the combination of future information prediction and DRL has also been extensively studied [32].…”

Section: B Slice Migration and Resource Allocationmentioning

confidence: 99%

See 1 more Smart Citation

Federated Deep Reinforcement Learning for Prediction-Based Network Slice Mobility in 6 G Mobile Networks

Ming,

Yu,

Taleb

2024

IEEE Trans. on Mobile Comput.

View full text Add to dashboard Cite

Network slices are generally coupled with services and face service continuity/unavailability concerns due to the high mobility and dynamic requests from users. Network slice mobility (NSM), which considers user mobility, service migration, and resource allocation from a holistic view, is witnessed as a key technology in enabling network slices to respond quickly to service degradation. Existing studies on NSM either ignored the trigger detection before NSM decision-making or didn't consider the prediction of future system information to improve the NSM performance, and the training of deep reinforcement learning (DRL) agents also faces challenges with incomplete observations. To cope with these challenges, we consider that network slices migrate periodically and utilize the prediction of system information to assist NSM decision-making. The periodical NSM problem is further transformed into a Markov decision process, and we creatively propose a prediction-based federated DRL framework to solve it. Particularly, the learning processes of the prediction model and DRL agents are performed in a federated learning paradigm. Based on extensive experiments, simulation results demonstrate that the proposed scheme outperforms the considered baseline schemes in improving long-term profit, reducing communication overhead, and saving transmission time.

show abstract

Section: Evaluation Results Versus Different Penaltiesmentioning

confidence: 90%

Section: B Slice Migration and Resource Allocationmentioning

confidence: 99%

Federated Deep Reinforcement Learning for Prediction-Based Network Slice Mobility in 6 G Mobile Networks

Ming,

Yu,

Taleb

2024

IEEE Trans. on Mobile Comput.

View full text Add to dashboard Cite

show abstract

“…Deep learning using artificial neural networks (ANN) is increasingly used in the field of NMR with significant growth in the last few years ( Cobas, 2020 ). Different algorithms are applied for classification or regression problems ( Ibrahim et al, 2021 , Schartner et al, 2023 , Wang et al, 2020 ). The aim of this paper was to trace the geographical origin by using the deep learning classification of sparkling wines based on their ICP-MS and DOSY NMR spectra represented in the reduced space.…”

Section: Introductionmentioning

confidence: 99%

Deep reinforcement learning classification of sparkling wines based on ICP-MS and DOSY NMR spectra

Jagatić Korenika,

Jeromel,

Tomaz

et al. 2024

Food Chemistry: X

View full text Add to dashboard Cite

“…MADRL extends the functions of DRL with MARL. MADRL enables multiple agents to interact with an environment to solve complex problems that the traditional DRL technique is not able to handle [10], particularly with distributed learning systems.…”

Section: List Of Abbreviationsmentioning

confidence: 99%

“…MARL is a group of agents that interact with the operating environment and interact with each other to achieve the goals [10]. MADRL extends the functions of RL and MARL with deep learning [10].…”

Section: Multi-agent Deep Reinforcement Learningmentioning

confidence: 99%

Multi-Agent Deep Reinforcement Learning Assisted Pre-connect Handover Management

Yao¹

View full text Add to dashboard Cite

Handover is an essential and significant component of mobility management in cellular networks. Handover management is more challenging for Fifth Generation (5G) networks because of ultra-reliable low latency communications (URLLC) requirements. This thesis proposes a make-before-break (MBB) adopted handover mechanism for user equipment (UE), namely, pre-connect handover (PHO). PHO aims at providing a seamless and reliable handover technique in 5G networks. PHO utilizes the Deep Q-Networks (DQN) algorithm to facilitate the sequential decision-making problem of the target base station (T-BS) selection based on the reference signal received quality (RSRQ) values and RSRQ change rates of all the candidate base stations (BSs). Furthermore, a multi-agent deep reinforcement learning (MADRL) solution is tailored to extend the DQN-assisted UEassociated PHO solution for modeling of the multi-UE environment. All the autonomous agents learn the action policy by interacting with the environment in a distributed manner. The feasibility of the PHO mechanism has been validated extensively via Network Simulator 3 (NS-3) and NS3-Gym. The performance of the DQN and MADRL-assisted PHO management solutions have been evaluated by considering various configurations.The experimental results demonstrated that the proposed PHO is not only achievable, but also that the DQN-assisted PHO technique can productively accomplish the optimal BS selection to maximize the success rate of PHO. Moreover, the MADRL-assisted PHO management solution can also be conducted and effectively applied to a realistic multi-UE environment where each UE is modeled with an agent.

show abstract

Applications of Multi-Agent Deep Reinforcement Learning: Models and Algorithms

Cited by 13 publications

References 75 publications

Federated Deep Reinforcement Learning for Prediction-Based Network Slice Mobility in 6 G Mobile Networks

Federated Deep Reinforcement Learning for Prediction-Based Network Slice Mobility in 6 G Mobile Networks

Deep reinforcement learning classification of sparkling wines based on ICP-MS and DOSY NMR spectra

Multi-Agent Deep Reinforcement Learning Assisted Pre-connect Handover Management

Contact Info

Product

Resources

About