Optimal and Scalable Caching for 5G Using Reinforcement Learning of Space-Time Popularities

Sadeghi, Alireza; Sheikholeslami, Fatemeh; Giannakis, Georgios B.

doi:10.1109/jstsp.2017.2787979

Cited by 233 publications

(183 citation statements)

References 36 publications

Supporting

Mentioning

173

Contrasting

Order By: Relevance

“…The grouped linear model is introduced in [30] to obtain the predicted content requests, based on which the cache replacement is optimized by using RL with model-free acceleration. In [31], a RL framework is proposed to obtain the optimal caching strategy at SBSs taking into account the space-time dynamics of the content popularity. In [32], the probabilistic caching strategy, resource allocation, and computation offloading at fog nodes are jointly considered to minimize the average transmission delay exploiting deep RL.…”

Section: A Related Workmentioning

confidence: 99%

“…According to [44], the skewness of the Zipf-like distribution is dependent on the specific application. In brief, Zipf-like distribution is able to well depict the real user request distribution of various networks and thus is widely adopted in [18], [21], [25], [26], [31]. In this paper, we consider the mobile UDN and thus Zipf-like distribution can be safely used to model the user request distribution.…”

Section: B Content Popularity Profilementioning

confidence: 99%

See 1 more Smart Citation

Reinforcement Learning Based Cooperative Coded Caching Under Dynamic Popularities in Ultra-Dense Networks

Gao

Dong

Pan

et al. 2020

IEEE Trans. Veh. Technol.

View full text Add to dashboard Cite

For ultra-dense networks with wireless backhaul, caching strategy at small base stations (SBSs), usually with limited storage, is critical to meet massive high data rate requests. Since the content popularity profile varies with time in an unknown way, we exploit reinforcement learning (RL) to design a cooperative caching strategy with maximum-distance separable (MDS) coding. We model the MDS coding based cooperative caching as a Markov decision process to capture the popularity dynamics and maximize the long-term expected cumulative traffic load served directly by the SBSs without accessing the macro base station. For the formulated problem, we first find the optimal solution for a small-scale system by embedding the cooperative MDS coding into Q-learning. To cope with the large-scale case, we approximate the state-action value function heuristically. The approximated function includes only a small number of learnable parameters and enables us to propose a fast and efficient action-selection approach, which dramatically reduces the complexity. Numerical results verify the optimality/near-optimality of the proposed RL based algorithms and show the superiority compared with the baseline schemes. They also exhibit good robustness to different environments.

show abstract

Section: A Related Workmentioning

confidence: 99%

Section: B Content Popularity Profilementioning

confidence: 99%

Reinforcement Learning Based Cooperative Coded Caching Under Dynamic Popularities in Ultra-Dense Networks

Gao

Dong

Pan

et al. 2020

IEEE Trans. Veh. Technol.

View full text Add to dashboard Cite

show abstract

“…Instead of fitting models, another option is to learn the popularity without using prior assumptions [21], [22]. For instance, [22] models the popularity evolution as a Markov process and employs Q-learning to estimate the transition probabilities which are then used for proactive caching. Such model-free solutions work well if there are adequate data, but have substantial computation and memory requirements.…”

Section: A Reactive Policiesmentioning

confidence: 99%

Online Convex Optimization for Caching Networks

Paschos¹,

Destounis

Iosifidis

2020

IEEE/ACM Trans. Networking

View full text Add to dashboard Cite

We study the problem of wireless edge caching when file popularity is unknown and possibly non-stationary. A bank of J caches receives file requests and a utility is accrued for each request depending on the serving cache. The network decides dynamically which files to store at each cache and how to route them, in order to maximize total utility. The request sequence is assumed to be drawn from an arbitrary distribution, capturing time-variance, temporal and spatial locality of requests. For this challenging setting, we propose the Bipartite Supergradient Caching Algorithm (BSCA) which provably exhibits no regret (RT /T → 0). That is, as the time horizon T increases, BSCA achieves (at least) the same utility with the cache configuration that we would have chosen knowing all future requests. The learning rate of the algorithm is characterized by its regret expression RT = O( √ JT ), which is independent of the file library size. For the single-cache case, we prove that this is the lowest attainable bound. BSCA requires at each step J projections on intersections of boxes and simplices, for which we propose a tailored algorithm. Our model is the first that draws a connection between the network caching problem and Online Convex Optimization, and we demonstrate its generality by discussing various practical extensions and presenting a tracedriven comparison with state-of-the-art competitors.

show abstract

“…In particular, researchers are working on Mobile Edge Computing (MEC) [94] and on what, where, when, and how to log cache contents. In this context, coding [95] and machine learning [96] techniques can be applied. Examples of typical scenarios are vehicular communications [97] and low-latency applications [98].…”

Section: Cachingmentioning

confidence: 99%

Orchestration and Control in Software‐Defined 5G Networks: Research Challenges

Nencioni

Garroppo

Gonzalez

et al. 2018

Wireless Communications and Mobile Computing

View full text Add to dashboard Cite

The fifth generation (5G) of cellular networks promises to be a major step in the evolution of wireless technology. 5G is planned to be used in a very broad set of application scenarios. These scenarios have strict heterogeneous requirements that will be accomplished by enhancements on the radio access network and a collection of innovative wireless technologies. Softwarization technologies, such as Software-Defined Networking (SDN) and Network Function Virtualization (NFV), will play a key role in integrating these different technologies. Network slicing emerges as a cost-efficient solution for the implementation of the diverse 5G requirements and verticals. The 5G radio access and core networks will be based on a SDN/NFV infrastructure, which will be able to orchestrate the resources and control the network in order to efficiently and flexibly and with scalability provide network services. In this paper, we present the up-to-date status of the software-defined 5G radio access and core networks and a broad range of future research challenges on the orchestration and control aspects.

show abstract

Optimal and Scalable Caching for 5G Using Reinforcement Learning of Space-Time Popularities

Cited by 233 publications

References 36 publications

Reinforcement Learning Based Cooperative Coded Caching Under Dynamic Popularities in Ultra-Dense Networks

Reinforcement Learning Based Cooperative Coded Caching Under Dynamic Popularities in Ultra-Dense Networks

Online Convex Optimization for Caching Networks

Orchestration and Control in Software‐Defined 5G Networks: Research Challenges

Contact Info

Product

Resources

About