2012 IEEE International Conference on Computer Science and Automation Engineering (CSAE) 2012
DOI: 10.1109/csae.2012.6272601
|View full text |Cite
|
Sign up to set email alerts
|

The apprentice modeling through reinforcement with a temporal analysis using the Q-learning algorithm

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 1 publication
0
2
0
Order By: Relevance
“…Meanwhile, some literature has analysed the training process for RL based on the parameters α, c, and some other parameters which are also important for the convergence rate [8]. e influence to the convergence in RL from the major parameters, algorithmic complexity, the reward designing, and the training data is analysed in [9].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Meanwhile, some literature has analysed the training process for RL based on the parameters α, c, and some other parameters which are also important for the convergence rate [8]. e influence to the convergence in RL from the major parameters, algorithmic complexity, the reward designing, and the training data is analysed in [9].…”
Section: Introductionmentioning
confidence: 99%
“…It has been successfully applied in many fields such as information processing, artificial intelligence, and statistics [12]. In the field of MDPs [8], entropy has been used to optimize the decision results.…”
Section: Introductionmentioning
confidence: 99%