2022
DOI: 10.1016/j.knosys.2022.108624
|View full text |Cite
|
Sign up to set email alerts
|

Adaptive evolution strategy with ensemble of mutations for Reinforcement Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
2

Relationship

1
8

Authors

Journals

citations
Cited by 17 publications
(4 citation statements)
references
References 18 publications
0
4
0
Order By: Relevance
“…Ajani and Mallipeddi [111] discussed the advantages and disadvantages of using Cauchy or Gaussian distribution for mutation. The authors stated that the Cauchy distribution is preferred when the search is far away from the optimal solution, and Gaussian is better near the global optimum.…”
Section: B Explorationmentioning
confidence: 99%
“…Ajani and Mallipeddi [111] discussed the advantages and disadvantages of using Cauchy or Gaussian distribution for mutation. The authors stated that the Cauchy distribution is preferred when the search is far away from the optimal solution, and Gaussian is better near the global optimum.…”
Section: B Explorationmentioning
confidence: 99%
“…Consequently, in comparison to backpropagationbased algorithms, ES algorithms often necessitate more rollouts and are susceptible to local optima [7]. Previous efforts to enhance the performance of ES algorithms have included adding regularization terms to encourage exploration [8] or employing various noise perturbations [1]. While ES algorithms are generally less sensitive to hyperparameter settings than traditional DRL [30], prior research has shown that hyperparameter fine-tuning can still be effective for them.…”
Section: Related Workmentioning
confidence: 99%
“…Several bio-inspired robotic applications and learning tasks often require the agents to adapt to the uncertainties in their environment. In simulated environments, Deep Reinforcement Learning (DRL) has proven to be successful at learning tasks across a wide range of domains such as games [1], rehabilitation [2], locomotion [3], production optimization [4], control [5][6][7], etc. However, in real-world environments, the deployment of DRL is often limited due to changing environmental conditions, an intrinsic feature mostly associated with real-world applications.…”
Section: Introductionmentioning
confidence: 99%