Proceedings of the 7th Annual Workshop on Genetic and Evolutionary Computation 2005
DOI: 10.1145/1102256.1102280
|View full text |Cite
|
Sign up to set email alerts
|

An autonomous explore/exploit strategy

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2007
2007
2012
2012

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 4 publications
0
7
0
Order By: Relevance
“…Similar to affect is the notion of comfort or safety, which has also been proposed to influence exploration behavior in robots (Likhachev & Arkin, 2000). Affect has been used in evolutionary algorithms to develop exploration/exploitation strategies in dynamic choice trials (McMahon, Scott, Baxter, & Browne, 2006), and affect has been embedded into the reinforcement-learning algorithm where reward is based on the happiness and sadness of the agent (Salichs & Malfaz, 2006).…”
Section: Related Workmentioning
confidence: 99%
“…Similar to affect is the notion of comfort or safety, which has also been proposed to influence exploration behavior in robots (Likhachev & Arkin, 2000). Affect has been used in evolutionary algorithms to develop exploration/exploitation strategies in dynamic choice trials (McMahon, Scott, Baxter, & Browne, 2006), and affect has been embedded into the reinforcement-learning algorithm where reward is based on the happiness and sadness of the agent (Salichs & Malfaz, 2006).…”
Section: Related Workmentioning
confidence: 99%
“…Strongly related to our approach to affect-modulated exploration is the research by McMahon, Scott, Baxter, and Browne (2006). The authors show how the discrete choice between exploration and exploitation trials can be controlled by a probability value that is derived from measures inspired by affect.…”
Section: Related Workmentioning
confidence: 99%
“…Selection mechanisms like -greedy have been applied to classifier systems to manage the exploration/exploitation trade-off (e.g. [11,14,28]). However, such mechanisms are typically used, as in TD methods, to select among individual actions, not to allocate evaluations among an entire population.…”
Section: Related and Future Workmentioning
confidence: 99%