ICC 2021 - IEEE International Conference on Communications 2021
DOI: 10.1109/icc42927.2021.9500379
|View full text |Cite
|
Sign up to set email alerts
|

Hierarchical Policy Learning for Hybrid Communication Load Balancing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 18 publications
(2 citation statements)
references
References 17 publications
0
2
0
Order By: Relevance
“…This avoids the need to engineer a sensible discretization scheme, and can potentially avoid undesired human bias in this process. An additional trend in recent papers is the integration of RL with other learning techniques such as clustering (i.e., Xu et al, 2019b), hierarchical learning (i.e., Kang et al, 2021), meta-learning (i.e., Feriani et al, 2022, and knowledge distillation (i.e., Li et al, 2022). These additional techniques complement the core RL method and address key challenges such as scalability, adaptability, and model generalization.…”
Section: Policy-gradient Methods For Load Balancingmentioning
confidence: 99%
See 1 more Smart Citation
“…This avoids the need to engineer a sensible discretization scheme, and can potentially avoid undesired human bias in this process. An additional trend in recent papers is the integration of RL with other learning techniques such as clustering (i.e., Xu et al, 2019b), hierarchical learning (i.e., Kang et al, 2021), meta-learning (i.e., Feriani et al, 2022, and knowledge distillation (i.e., Li et al, 2022). These additional techniques complement the core RL method and address key challenges such as scalability, adaptability, and model generalization.…”
Section: Policy-gradient Methods For Load Balancingmentioning
confidence: 99%
“…In Kang et al (2021), the authors propose a hierarchical policy learning framework for load balancing for both active users and idle users. Proximal policy optimization (PPO) of Schulman et al (2017) is the RL algorithm used for this work.…”
Section: Policy-gradient Methods For Load Balancingmentioning
confidence: 99%