2017
DOI: 10.1109/tvt.2017.2751641
|View full text |Cite
|
Sign up to set email alerts
|

Deep-Reinforcement-Learning-Based Optimization for Cache-Enabled Opportunistic Interference Alignment Wireless Networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
120
0
1

Year Published

2018
2018
2023
2023

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 278 publications
(121 citation statements)
references
References 34 publications
0
120
0
1
Order By: Relevance
“…Fortunately, reinforcement learning (RL) has been shown effective in addressing decision making under uncertainty [6]. In particular, recent success of deep RL in humanlevel video game play [7] and AlphaGo [8] has sparked a flurry of interest in applying RL techniques to solve problems from a wide variety of areas and remarkable progress has been made ever since [9], [10], [11]. It provides a robust and principled way to treat environment dynamics and perform sequential decision making under uncertainty, thus representing a promising method to handle the unique and challenging V2X dynamics.…”
Section: A Problem Statement and Motivationmentioning
confidence: 99%
“…Fortunately, reinforcement learning (RL) has been shown effective in addressing decision making under uncertainty [6]. In particular, recent success of deep RL in humanlevel video game play [7] and AlphaGo [8] has sparked a flurry of interest in applying RL techniques to solve problems from a wide variety of areas and remarkable progress has been made ever since [9], [10], [11]. It provides a robust and principled way to treat environment dynamics and perform sequential decision making under uncertainty, thus representing a promising method to handle the unique and challenging V2X dynamics.…”
Section: A Problem Statement and Motivationmentioning
confidence: 99%
“…For instance, an intelligent modulation and coding selection [67] has been developed for the primary transmission where a DRL agent is implemented at the primary transmitter to learn the interference pattern from secondary transmitters. Moreover, He et al have used DNN to learn the impact of user scheduling on the sum-rate in a wireless caching network [68]. It is noted that the perception and action is usually coupled, especially when we adopt reinforcement learning techniques.…”
Section: B Learning From Radio Environmentmentioning
confidence: 99%
“…Albeit reasonable for discrete states, these approaches cannot deal with large continuous state-action spaces. To cope with such spaces, DRL approaches have been considered for content caching in e.g., [19], [20], [21], [22], [6], [23].…”
Section: A Prior Art On Cachingmentioning
confidence: 99%