2019
DOI: 10.1016/j.future.2018.12.037
|View full text |Cite
|
Sign up to set email alerts
|

Multi-agent deep learning for simultaneous optimization for time and energy in distributed routing system

Abstract: Packet routing problem most commonly emerges in the context of computer networks, thus the majority of routing algorithms existing nowadays is designed specifically for routing in computer networks. However, in the logistics domain, many problems can be formulated in terms of packet routing, e.g. in automated traffic routing or material handling systems. In this paper, we propose an algorithm for packet routing in such heterogeneous environments. Our approach is based on deep reinforcement learning networks co… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
32
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 38 publications
(32 citation statements)
references
References 14 publications
0
32
0
Order By: Relevance
“…The literature [31] applies the DQN-routing algorithm in Deep Reinforcement Learning DRL to solve the routing problem, which combines the advantages of Q-routing and DQN. Each router is considered as an agent whose parameters are shared and updated simultaneously during the training process (centralized training), but it provides independent packet transmission instructions (decentralized execution).…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The literature [31] applies the DQN-routing algorithm in Deep Reinforcement Learning DRL to solve the routing problem, which combines the advantages of Q-routing and DQN. Each router is considered as an agent whose parameters are shared and updated simultaneously during the training process (centralized training), but it provides independent packet transmission instructions (decentralized execution).…”
Section: Related Workmentioning
confidence: 99%
“…Energy is an important factor in UAV scenarios. To examine the energy consumption of the routing protocol, we counted the residual energy as a performance parameter, which is calculated as shown in (31). R is the number of rounds of UAV executing tasks, we set the amount of data to be distributed to execute one round of tasks to 1000 / bit round , the communication distance threshold 0 300…”
Section: Simulation Experimentsmentioning
confidence: 99%
“…This prevents the straightforward use of experience replay, which is crucial for stabilizing deep Q learning [34]. [35] combines the Q-routing and deep Q-learning to solve the routing problem. However, the training process of the algorithm proposed in [35] is in a centralized manner (all the routers need to share parameters), which might cause issues in real-world large-scale network environments.…”
Section: Reinforcement Learning For Routingmentioning
confidence: 99%
“…[35] combines the Q-routing and deep Q-learning to solve the routing problem. However, the training process of the algorithm proposed in [35] is in a centralized manner (all the routers need to share parameters), which might cause issues in real-world large-scale network environments. The authors in [36] propose to use a deep actor-critic reinforcement learning algorithm to optimize the performance of the communication network.…”
Section: Reinforcement Learning For Routingmentioning
confidence: 99%
“…In recent years, with the rapid development and successful application of machine learning technology in various fields, such as management operation research [10,16], medicine [4,22], computer science [11], etc., several technologies have been introduced to solve combinatorial optimization problems [26,29]. Vinyals et al [29] proposed a model consisting of two recurrent neural networks (RNNs) and an attention mechanism to solve combinatorial optimization problems.…”
Section: Introductionmentioning
confidence: 99%