2022
DOI: 10.1007/s10922-022-09667-3
|View full text |Cite
|
Sign up to set email alerts
|

An Efficient and Decentralized Fuzzy Reinforcement Learning Bandwidth Controller for Multitenant Data Centers

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 32 publications
0
0
0
Order By: Relevance
“…Os algoritmos de aprendizado por reforço estimam a recompensa de uma ação a partir de um estado e escolhem as ações a serem tomadas de forma a maximizar a recompensa ao longo do tempo [Filho et al, 2022]. Diversos algoritmos podem ser utilizados para estimar essa recompensa.…”
Section: Non Real Time (Non-rt)unclassified
“…Os algoritmos de aprendizado por reforço estimam a recompensa de uma ação a partir de um estado e escolhem as ações a serem tomadas de forma a maximizar a recompensa ao longo do tempo [Filho et al, 2022]. Diversos algoritmos podem ser utilizados para estimar essa recompensa.…”
Section: Non Real Time (Non-rt)unclassified
“…To describe this cumulative reward from a mathematical model perspective, it is necessary to introduce a cumulative discount reward mechanism that distinguishes the rewards through a weight. The immediate reward weight is the highest, while the future reward is only an expected estimate, and the weight gradually decreases [31,38]. A complete reinforcement learning model must have four core components: model policy, reward function, value function, and exploration method.…”
mentioning
confidence: 99%