Multi-agent deep learning for simultaneous optimization for time and energy in distributed routing system

Mukhutdinov, Dmitry; Filchenkov, Andrey; Shalyto, Anatoly; Vyatkin, Valeriy

doi:10.1016/j.future.2018.12.037

Cited by 38 publications

(32 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The literature [31] applies the DQN-routing algorithm in Deep Reinforcement Learning DRL to solve the routing problem, which combines the advantages of Q-routing and DQN. Each router is considered as an agent whose parameters are shared and updated simultaneously during the training process (centralized training), but it provides independent packet transmission instructions (decentralized execution).…”

Section: Related Workmentioning

confidence: 99%

“…Energy is an important factor in UAV scenarios. To examine the energy consumption of the routing protocol, we counted the residual energy as a performance parameter, which is calculated as shown in (31). R is the number of rounds of UAV executing tasks, we set the amount of data to be distributed to execute one round of tasks to 1000 / bit round , the communication distance threshold 0 300…”

Section: Simulation Experimentsmentioning

confidence: 99%

See 1 more Smart Citation

QLGR: A Q-learning-based Geographic FANET Routing Algorithm Based on Multi-agent Reinforcement Learning

2021

KSII TIIS

View full text Add to dashboard Cite

The utilization of UAVs in various fields has led to the development of flying ad hoc network (FANET) technology. In a network environment with highly dynamic topology and frequent link changes, the traditional routing technology of FANET cannot satisfy the new communication demands. Traditional routing algorithm, based on geographic location, can "fall" into a routing hole. In view of this problem, we propose a geolocation routing protocol based on multi-agent reinforcement learning, which decreases the packet loss rate and routing cost of the routing protocol. The protocol views each node as an intelligent agent and evaluates the value of its neighbor nodes through the local information. In the value function, nodes consider information such as link quality, residual energy and queue length, which reduces the possibility of a routing hole. The protocol uses global rewards to enable individual nodes to collaborate in transmitting data. The performance of the protocol is experimentally analyzed for UAVs under extreme conditions such as topology changes and energy constraints. Simulation results show that our proposed QLGR-S protocol has advantages in performance parameters such as throughput, end-to-end delay, and energy consumption compared with the traditional GPSR protocol. QLGR-S provides more reliable connectivity for UAV networking technology, safeguards the communication requirements between UAVs, and further promotes the development of UAV technology.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Simulation Experimentsmentioning

confidence: 99%

QLGR: A Q-learning-based Geographic FANET Routing Algorithm Based on Multi-agent Reinforcement Learning

2021

KSII TIIS

View full text Add to dashboard Cite

show abstract

“…This prevents the straightforward use of experience replay, which is crucial for stabilizing deep Q learning [34]. [35] combines the Q-routing and deep Q-learning to solve the routing problem. However, the training process of the algorithm proposed in [35] is in a centralized manner (all the routers need to share parameters), which might cause issues in real-world large-scale network environments.…”

Section: Reinforcement Learning For Routingmentioning

confidence: 99%

“…[35] combines the Q-routing and deep Q-learning to solve the routing problem. However, the training process of the algorithm proposed in [35] is in a centralized manner (all the routers need to share parameters), which might cause issues in real-world large-scale network environments. The authors in [36] propose to use a deep actor-critic reinforcement learning algorithm to optimize the performance of the communication network.…”

Section: Reinforcement Learning For Routingmentioning

confidence: 99%

MAMRL: Exploiting Multi-agent Meta Reinforcement Learning in WAN Traffic Engineering

Sun

Kiran²,

Ren

2021

Preprint

View full text Add to dashboard Cite

Traffic optimization challenges, such as load balancing, flow scheduling, and improving packet delivery time, are difficult online decision-making problems in wide area networks (WAN). Complex heuristics are needed for instance to find optimal paths that improve packet delivery time and minimize interruptions which may be caused by link failures or congestion. The recent success of reinforcement learning (RL) algorithms can provide useful solutions to build better robust systems that learn from experience in model-free settings. In this work, we consider a path optimization problem, specifically for packet routing, in large complex networks. We develop and evaluate a model-free approach, applying multi-agent meta reinforcement learning (MAMRL) that can determine the next-hop of each packet to get it delivered to its destination with minimum time overall. Specifically, we propose to leverage and compare deep policy optimization RL algorithms for enabling distributed model-free control in communication networks and present a novel meta-learning-based framework, MAMRL, for enabling quick adaptation to topology changes. To evaluate the proposed framework, we simulate with various WAN topologies.Our extensive packet-level simulation results show that compared to classical shortest path and traditional reinforcement learning approaches, MAMRL significantly reduces the average packet delivery time even when network demand increases; and compared to a non-meta deep policy optimization algorithm, our results show the reduction of packet loss in much fewer episodes when link failures occur while offering comparable average packet delivery time.

show abstract

“…In recent years, with the rapid development and successful application of machine learning technology in various fields, such as management operation research [10,16], medicine [4,22], computer science [11], etc., several technologies have been introduced to solve combinatorial optimization problems [26,29]. Vinyals et al [29] proposed a model consisting of two recurrent neural networks (RNNs) and an attention mechanism to solve combinatorial optimization problems.…”

Section: Introductionmentioning

confidence: 99%

A Pointer Neural Network for the Vehicle Routing Problem with Task Priority and Limited Resources

Sheng

Wei

2020

ITC

View full text Add to dashboard Cite

The vehicle routing problem with task priority and limited resources (VRPTPLR) is a generalized version of the vehicle routing problem (VRP) with multiple task priorities and insufficient vehicle capacities. The objective of this problem is to maximize the total benefits. Compared to the traditional mathematical analysis methods, the pointer neural network proposed in this paper continuously learns the mapping relationship between input nodes and output decision schemes based on the actual distribution conditions. In addition, a global attention mechanism is adopted in the neural network to improve the convergence rate and results. To verify the effectiveness of the method, we model the VRPTPLR and compare the results with those of a genetic algorithm. The parameter sensitivity of each algorithm is assessed using different datasets. Then, comparison experiments with the two algorithms employing optimal parameter configurations are performed for the validation sets, which are generated at different instance scales. It is found that the solution time of the pointer neural network is much shorter than that of the genetic algorithm and that the proposed method provides better solutions for large-scale instances.

show abstract

Multi-agent deep learning for simultaneous optimization for time and energy in distributed routing system

Cited by 38 publications

References 14 publications

QLGR: A Q-learning-based Geographic FANET Routing Algorithm Based on Multi-agent Reinforcement Learning

QLGR: A Q-learning-based Geographic FANET Routing Algorithm Based on Multi-agent Reinforcement Learning

MAMRL: Exploiting Multi-agent Meta Reinforcement Learning in WAN Traffic Engineering

A Pointer Neural Network for the Vehicle Routing Problem with Task Priority and Limited Resources

Contact Info

Product

Resources

About