Proceedings of the 11th Annual Conference on Genetic and Evolutionary Computation 2009
DOI: 10.1145/1569901.1570064
|View full text |Cite
|
Sign up to set email alerts
|

Uncertainty handling CMA-ES for reinforcement learning

Abstract: The covariance matrix adaptation evolution strategy (CMA-ES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an adaptive uncertainty handling mechanism. Because uncertainty is a typical property of RL problems this new algorithm, termed UH-CMA-ES, is promising for RL. The UH-CMA-ES dynamically adjusts the number of episodes considered in each evaluation of a policy. It controls the signal to noise ratio such that it is just high enough for a suff… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2009
2009
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(2 citation statements)
references
References 26 publications
(52 reference statements)
0
2
0
Order By: Relevance
“…It considers a different definition of convergence (more on this later).R-EDA was proposed as a typical noisy optimization algorithm, easy to analyze and making a good approximation of real-world approaches [17]. The noisy optimization framework is described in Algorithm 1 and R-EDA is defined in Algorithm 2.…”
Section: Frameworkmentioning
confidence: 99%
“…It considers a different definition of convergence (more on this later).R-EDA was proposed as a typical noisy optimization algorithm, easy to analyze and making a good approximation of real-world approaches [17]. The noisy optimization framework is described in Algorithm 1 and R-EDA is defined in Algorithm 2.…”
Section: Frameworkmentioning
confidence: 99%
“…Evolutionary tuning of reinforcement learning parameters has also been done by several researchers [51]- [55]. Eriksson et al [53] used evolutionary algorithms to tune the meta-parameters in a single-agent reinforcement learning algorithm.…”
Section: Introductionmentioning
confidence: 99%