Distributed Deep Reinforcement Learning Resource Allocation Scheme For Industry 4.0 Device-To-Device Scenarios

Romero, Jesus Burgueno; Adeogun, Ramoni; Bruun, Rasmus Liborius; Morejon, Santiago; de-la-Bandera, Isabel; Barco, Raquel

doi:10.1109/vtc2021-fall52928.2021.9625582

Cited by 5 publications

(4 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Samples for updating the DDQN weights are then drawn randomly from the buffer thereby eliminating correlations between successive samples. The agents are trained using the reward function in (11) with ζ target = 0 bps/Hz.…”

Section: B Ddqn Design and Training Proceduresmentioning

confidence: 99%

“…To overcome this limitation, algorithms for resource allocation have been traditionally based on hard-coded heuristics [5] or using optimization techniques such as game theory [6], genetic algorithm [7] and geometric programming [8]. Over the last few years, the focus appears to have shifted towards machine learning-based algorithms [4] resulting in a large number of published works applying supervised [9], unsupervised [10] and reinforcement learning techniques [11] for resource allocation in different types of wireless systems.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Distributed Channel Allocation for Mobile 6G Subnetworks via Multi-Agent Deep Q-Learning

Adeogun

Berardinelli

2023

2023 IEEE Wireless Communications and Networking Conference (WCNC)

Self Cite

View full text Add to dashboard Cite

Sixth generation (6G) in-X subnetworks are recently proposed as short-range low-power radio cells for supporting localized extreme wireless connectivity inside entities such as industrial robots, vehicles, and the human body. The deployment of in-X subnetworks in these entities may lead to fast changes in the interference level and hence, varying risks of communication failure. In this paper, we investigate fully distributed resource allocation for interference mitigation in dense deployments of 6G in-X subnetworks. Resource allocation is cast as a multiagent reinforcement learning problem and agents are trained in a simulated environment to perform channel selection with the goal of maximizing the per-subnetwork rate subject to a target rate constraint for each device. To overcome the slow convergence and performance degradation issues associated with fully distributed learning, we adopt a centralized training procedure involving local training of a deep Q-network (DQN) at a central location with measurements obtained at all subnetworks. The policy is implemented using Double Deep Q-Network (DDQN) due to its ability to enhance training stability and convergence. Performance evaluation results in an in-factory environment indicated that the proposed method can achieve up to 19% rate increase relative to random allocation and is only marginally worse than complex centralized benchmarks.

show abstract

Section: B Ddqn Design and Training Proceduresmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Distributed Channel Allocation for Mobile 6G Subnetworks via Multi-Agent Deep Q-Learning

Adeogun

Berardinelli

2023

2023 IEEE Wireless Communications and Networking Conference (WCNC)

Self Cite

View full text Add to dashboard Cite

show abstract

“…Reinforcement learning (RL) is adopted to resolve the joint user association and channel assignment problem. The authors in [8] use RL in a factory setting for time-slot selection for packet transmission. However, in [6]- [8], the authors do not take into consideration power allocation, which is certainly beneficial to manage the interference.…”

Section: Introductionmentioning

confidence: 99%

Deep Reinforcement Learning for Joint User Association and Resource Allocation in Factory Automation

Farzanullah

Le‐Ngoc

2022

2022 IEEE Wireless Communications and Networking Conference (WCNC)

View full text Add to dashboard Cite

We propose joint user association, channel assignment and power allocation for mobile robot Ultra-Reliable and Low Latency Communications (URLLC) based on multiconnectivity and reinforcement learning. The mobile robots require control messages from the central guidance system at regular intervals. We use a two-phase communication scheme where robots can form multiple clusters. The robots in a cluster are close to each other and can have reliable Deviceto-Device (D2D) communications. In Phase I, the APs transmit the combined payload of a cluster to the cluster leader within a latency constraint. The cluster leader broadcasts this message to its members in Phase II. We develop a distributed Multi-Agent Reinforcement Learning (MARL) algorithm for joint user association and resource allocation (RA) for Phase I. The cluster leaders use their local Channel State Information (CSI) to decide the APs for connection along with the sub-band and power level. The cluster leaders utilize multi-connectivity to connect to multiple APs to increase their reliability. The objective is to maximize the successful payload delivery probability for all robots. Illustrative simulation results indicate that the proposed scheme can approach the performance of the centralized algorithm and offer a substantial gain in reliability as compared to single-connectivity (when cluster leaders are able to connect to 1 AP).

show abstract

“…To overcome these limitations, we conjecture that reinforcement learning (RL) methods [10], [11] can be developed to perform resource selection, with potential performance improvement over existing approaches even with only quantized information. Moreover, a RL based method will eliminate the offline data generation requirement for the method in [9].…”

Section: Introductionmentioning

confidence: 99%

Intelligent Multi-Agent Resource Allocation in 6G in-X Subnetworks with Limited Sensing Information

Adeogun¹,

Berardinelli²

2022

Preprint

Self Cite

View full text Add to dashboard Cite

In this letter, we investigate dynamic resource selection in dense deployments of a recent 6G mobile in-X subnetworks (inXSs). We cast resource selection in inXSs as a multi-objective optimization problem involving maximization of per inXS sum capacities. Since inXSs are expected to be autonomous, selection decisions are made by each inXS based on its local information without signalling from other inXSs. A multi-agent Q-learning (MAQL) method based on limited sensing information (SI) is then developed resulting in significant reduction in the overhead associated with intra-subnetwork SI exchanges. We perform simulations with focus on two similar but distinct resource allocation problems: joint channel and transmit power selection and channel selection with aggregation. The results indicate that: 1) appropriate settings of Q-learning parameters leads to fast convergence of the MAQL method even with 1-bit quantization of the SI; 2) the proposed MAQL approach offer similar performance and is more robust to sensing delays than the best baseline heuristic with full SI.

show abstract

Distributed Deep Reinforcement Learning Resource Allocation Scheme For Industry 4.0 Device-To-Device Scenarios

Cited by 5 publications

References 15 publications

Distributed Channel Allocation for Mobile 6G Subnetworks via Multi-Agent Deep Q-Learning

Distributed Channel Allocation for Mobile 6G Subnetworks via Multi-Agent Deep Q-Learning

Deep Reinforcement Learning for Joint User Association and Resource Allocation in Factory Automation

Intelligent Multi-Agent Resource Allocation in 6G in-X Subnetworks with Limited Sensing Information

Contact Info

Product

Resources

About