2019 27th Signal Processing and Communications Applications Conference (SIU) 2019
DOI: 10.1109/siu.2019.8806389
|View full text |Cite
|
Sign up to set email alerts
|

Model-Free Reinforcement Learning Algorithms: A Survey

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(10 citation statements)
references
References 5 publications
0
5
0
Order By: Relevance
“…RL studies the way natural and artificial systems can learn to predict the consequences of and optimize their behavior in environments where actions lead them from one state or situation to the next and can lead to rewards and punishments. A common model for RL is the standard Markov Decision Process. , RL can be divided into model-based RL and model-free RL, , as well as active RL and passive RL . DL models can also be used in RL to form deep RL (DRL). , These methods have been applied to diverse fields in biology and chemistry, including drug discovery, , protein design, , and chemical engineering. , They have also been used in image recognition , and financial markets .…”
Section: Methods For Small Molecular Data Challengesmentioning
confidence: 99%
See 1 more Smart Citation
“…RL studies the way natural and artificial systems can learn to predict the consequences of and optimize their behavior in environments where actions lead them from one state or situation to the next and can lead to rewards and punishments. A common model for RL is the standard Markov Decision Process. , RL can be divided into model-based RL and model-free RL, , as well as active RL and passive RL . DL models can also be used in RL to form deep RL (DRL). , These methods have been applied to diverse fields in biology and chemistry, including drug discovery, , protein design, , and chemical engineering. , They have also been used in image recognition , and financial markets .…”
Section: Methods For Small Molecular Data Challengesmentioning
confidence: 99%
“…A common model for RL is the standard Markov Decision Process. 354,355 RL can be divided into model-based RL 356 and model-free RL, 357,358 as well as active RL 359 and passive RL. 360 DL models can also be used in RL to form deep RL (DRL).…”
Section: Reinforcement Learningmentioning
confidence: 99%
“…They directly learn from experience by estimating the optimal policy or value function. Model-free algorithms such as Q-learning, Monte Carlo Control, SARSA (State-Action-Reward-State-Action), Deep Q Network, etc., are widely used in RL [24].…”
Section: Reinforcement Learningmentioning
confidence: 99%
“…But, it needs to improve the routing efficiency. A secure and efficient privacy-preserving unspecified authentication approach offers data security and privacy [18]. A trust-based formal model depicts the fault identification procedure and confirms faults [19].…”
Section: Introductionmentioning
confidence: 99%