2021 IEEE Global Communications Conference (GLOBECOM) 2021
DOI: 10.1109/globecom46510.2021.9685936
|View full text |Cite
|
Sign up to set email alerts
|

R-Learning-Based Admission Control for Service Federation in Multi-domain 5G Networks

Abstract: Network service federation in 5G/B5G networks enables service providers to extend service offering by collaborating with peering providers. Realizing this vision requires interoperability among providers towards end-to-end service orchestration across multiple administrative domains. Smart admission control is fundamental to make such extended offering profitable. Without prior knowledge of service requests, the admission controller (AC) either determines the domain to deploy the demand or rejects it to maximi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
4

Relationship

1
7

Authors

Journals

citations
Cited by 11 publications
(6 citation statements)
references
References 20 publications
0
4
0
Order By: Relevance
“…Bakhshi et al [41] propose a slice admission control in a federated environment formed by one consumer and provider domain. For a given slice request, the admission control decides whether to deploy the slice in the consumer or the provider domain or reject it.…”
Section: B Slice Admission Controlmentioning
confidence: 99%
“…Bakhshi et al [41] propose a slice admission control in a federated environment formed by one consumer and provider domain. For a given slice request, the admission control decides whether to deploy the slice in the consumer or the provider domain or reject it.…”
Section: B Slice Admission Controlmentioning
confidence: 99%
“…Policy Iteration [38] is one of the dynamic algorithms in RL that is used to learn the optimal policy in an MDP environment when we are aware of transition probabilities. Similar to [39], we have applied policy iteration for our goal, which tries to maximize the long-term reward of the InP. It follows three steps to find the best policy as illustrated in Algorithm 1.…”
Section: B Dynamic Algorithm: Policy Iterationmentioning
confidence: 99%
“…In [7], a reinforcement-learning (RL) based admission control is proposed to maximize profit. The RL algorithm improves revenue but will not perform optimally when the state space grows.…”
Section: Related Workmentioning
confidence: 99%