2019
DOI: 10.1109/access.2019.2941229
|View full text |Cite
|
Sign up to set email alerts
|

Q-Learning Algorithms: A Comprehensive Classification and Applications

Abstract: Q-learning is arguably one of the most applied representative reinforcement learning approaches and one of the off-policy strategies. Since the emergence of Q-learning, many studies have described its uses in reinforcement learning and artificial intelligence problems. However, there is an information gap as to how these powerful algorithms can be leveraged and incorporated into general artificial intelligence workflow. Early Q-learning algorithms were unsatisfactory in several aspects and covered a narrow ran… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
119
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 285 publications
(121 citation statements)
references
References 73 publications
0
119
0
Order By: Relevance
“…In recent years, artificial intelligence techniques, which include machine learning, have attracted a significant amount of interest from researchers of various fields [ 8 ]. Among such techniques, reinforcement learning (RL) is being investigated in wireless systems because it provides a solution to optimize the system parameters by learning the surrounding area in a dynamic and complicated wireless environment [ 10 , 11 , 12 ]. Q-learning is a representative RL, and studies on using this approach to allocate routing policies in a dynamically changing network environment have been conducted.…”
Section: Related Studiesmentioning
confidence: 99%
“…In recent years, artificial intelligence techniques, which include machine learning, have attracted a significant amount of interest from researchers of various fields [ 8 ]. Among such techniques, reinforcement learning (RL) is being investigated in wireless systems because it provides a solution to optimize the system parameters by learning the surrounding area in a dynamic and complicated wireless environment [ 10 , 11 , 12 ]. Q-learning is a representative RL, and studies on using this approach to allocate routing policies in a dynamically changing network environment have been conducted.…”
Section: Related Studiesmentioning
confidence: 99%
“…The RML [35,36] is a vital branch of the machine learning. It has been widely applied in real-time decision-making problems such as autonomous driving and robot control.…”
Section: Overview Of the Q Reinforcement Machine Learning (Q-rml)mentioning
confidence: 99%
“…The proposed solution incorporates a dual RML approach for joint capacity and latency online optimization. We consider the model-free Q-reinforcement-machine-learning (Q-RML) algorithm [35] for its performance merits and low implementation complexity under a moderate state-action space size. As depicted by Fig.…”
Section: Proposed Rml Based Pattern Optimizationmentioning
confidence: 99%
“…Some studies [11,16,30,35] have indicated that one of the restrictions of dynamic scheduling using machine learning classification approaches is the lack of a revised mechanism in the dynamic scheduling KB. Recent advances in information and communication technology (ICT) using reinforcement learning (RL) [36] has been widely used [37] in robot motion control [38,39], industrial process control [40,41], and production scheduling [15,[42][43][44][45][46], and internet of things (IoT) [47]. ICT learns the optimal policy through interaction with the environment without the model of the environment.…”
Section: Figure 1 the Role Of The Mdsr Mechanismmentioning
confidence: 99%