2019
DOI: 10.1109/access.2019.2916314
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Agent Reinforcement Learning Based Cooperative Content Caching for Mobile Edge Networks

Abstract: Designing clustered unmanned aerial vehicle (UAV) communication networks based on cognitive radio (CR) and reinforcement learning can significantly improve the intelligence level of clustered UAV communication networks and the robustness of the system in a time-varying environment. Among them, designing smarter systems for spectrum sensing and access is a key research issue in CR. Therefore, we focus on the dynamic cooperative spectrum sensing and channel access in clustered cognitive UAV (CUAV) communication … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
47
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 77 publications
(47 citation statements)
references
References 63 publications
0
47
0
Order By: Relevance
“…Then r k is constrained to r k ≤ R k so as to guarantee GT task U k executed before its completion deadline. Then one has (29) where the inequality is due to the fact that the UAV will execute each offloaded task quicker than the task being conducted by GT locally, i.e. a k F k…”
Section: Appendix Prove Theorem In (13)mentioning
confidence: 99%
“…Then r k is constrained to r k ≤ R k so as to guarantee GT task U k executed before its completion deadline. Then one has (29) where the inequality is due to the fact that the UAV will execute each offloaded task quicker than the task being conducted by GT locally, i.e. a k F k…”
Section: Appendix Prove Theorem In (13)mentioning
confidence: 99%
“…Typical examples in this sense are web caching systems that store web content in user browsers or in dedicated http proxies. Today, caching is proposed to improve the efficiency of novel mobile edge computing solutions [9] or to optimize content dissemination in opportunistic networks where device-to-device connection is time limited and unpredictable [48,49]. In information-centric networking, a recent paradigm that moves the focus of networking from host-to-host communication to content-oriented delivery, caching is an essential element of the network since all nodes can store content objects in an internal memory.…”
Section: In-network Cachingmentioning
confidence: 99%
“…Content update statistics evolved to become a fundamental component in the design and study of any communication network where caching and prefetching play a central role, such as content delivery networks (CDNs) and Information-centric networks (ICNs) [3][4][5], 5G networks [6][7][8][9], and wireless networks [10]. Indeed, accurate estimation of inter-update distribution is an important issue.…”
Section: Introductionmentioning
confidence: 99%
“…In addition, for caching, we need to consider task popularity, which is an important factor affecting the caching. Task popularity refers to the probability of a task being requested within a specific time [24]. Most articles believe that popularity is known and follows a Zipf distribution [25]- [28].…”
Section: Introductionmentioning
confidence: 99%
“…Most articles believe that popularity is known and follows a Zipf distribution [25]- [28]. However, in the actual scenario, due to the dynamics of the task and the mobility of the user, the popularity changes dynamically over time, so it can be considered as a time series [24]. Gated Recurrent Unit (GRU) algorithm is a method of deep learning, which can deal with vanishing gradient and efficiently capture long-term dependencies [29].…”
Section: Introductionmentioning
confidence: 99%