2014
DOI: 10.1016/j.jtbi.2013.08.036
|View full text |Cite
|
Sign up to set email alerts
|

The application of temporal difference learning in optimal diet models

Abstract: This is the unspecified version of the paper.This version of the publication may differ from the final published version. An experience-based aversive learning model of foraging behaviour in uncertain environments is presented. We use Q-learning as a model-free implementation of Temporal Difference learning motivated by growing evidence for neural correlates in natural reinforcement settings. The predator has the choice of including an aposematic prey in its diet or to forage on alternative food sources. We sh… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 47 publications
0
1
0
Order By: Relevance
“…Typically, optimal behaviour has been assumed to result from natural selection of genetically determined behaviour strategies [4], yet in many species behaviour is crucially shaped by individual experiences and learning [5][6][7]. Existing work has considered how learning can optimize single responses [8][9][10][11][12][13] or specific sequences of two or three behaviours [14,15]. However, the question of how, and how much, learning contributes to optimal behaviour is still largely open.…”
Section: Introductionmentioning
confidence: 99%
“…Typically, optimal behaviour has been assumed to result from natural selection of genetically determined behaviour strategies [4], yet in many species behaviour is crucially shaped by individual experiences and learning [5][6][7]. Existing work has considered how learning can optimize single responses [8][9][10][11][12][13] or specific sequences of two or three behaviours [14,15]. However, the question of how, and how much, learning contributes to optimal behaviour is still largely open.…”
Section: Introductionmentioning
confidence: 99%