2014
DOI: 10.1109/twc.2014.022014.130840
|View full text |Cite
|
Sign up to set email alerts
|

TACT: A Transfer Actor-Critic Learning Framework for Energy Saving in Cellular Radio Access Networks

Abstract: Recent works have validated the possibility of improving energy efficiency in radio access networks (RANs), achieved by dynamically turning on/off some base stations (BSs). In this paper, we extend the research over BS switching operations, which should match up with traffic load variations. Instead of depending on the dynamic traffic loads which are still quite challenging to precisely forecast, we firstly formulate the traffic variations as a Markov decision process. Afterwards, in order to foresightedly min… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
115
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 131 publications
(115 citation statements)
references
References 24 publications
0
115
0
Order By: Relevance
“…In addition to game theory, machine learning has been considered as an effective tool in solving different network problems in 5G [15], [16]. RL is one of the most powerful tools for policy control and intelligent decision making [5], which has been widely adopted in wireless communications [17]- [19]. Recently, a number of works have applied RL to solve the intelligent resource management and decision making problem in D2D underlay networks [20]- [27].…”
Section: Arxiv:191209302v1 [Csni] 18 Dec 2019mentioning
confidence: 99%
“…In addition to game theory, machine learning has been considered as an effective tool in solving different network problems in 5G [15], [16]. RL is one of the most powerful tools for policy control and intelligent decision making [5], which has been widely adopted in wireless communications [17]- [19]. Recently, a number of works have applied RL to solve the intelligent resource management and decision making problem in D2D underlay networks [20]- [27].…”
Section: Arxiv:191209302v1 [Csni] 18 Dec 2019mentioning
confidence: 99%
“…Therefore, the total number of flows in the system is equal to bBONLb1Lb, which is proportional to the expected delay. () Thus, with a low system load on average, the UEs experience less delay …”
Section: System Modelmentioning
confidence: 99%
“…, which is proportional to the expected delay. 17,18 Thus, with a low system load on average, the UEs experience less delay. 19…”
Section: Figurementioning
confidence: 99%
“…More specifically, different criteria (e.g., traffic load [11], user spatial distribution [12] or networkimpact [13]) are considered in order to identify the optimal BS switching off strategy, guaranteeing a certain level of user satisfaction in the network. In the same context, a very recent interesting approach is the application of reinforcement learning schemes that cope with the dynamic nature of the traffic load in current cellular networks [14] in order to overcome an important limitation of the existing works, which rely on past predefined traffic patterns based on the network history.…”
Section: Base Station Switching Off and Infrastructure Sharingmentioning
confidence: 99%