2009
DOI: 10.1007/s00199-009-0446-0
|View full text |Cite
|
Sign up to set email alerts
|

Rage against the machines: how subjects play against learning algorithms

Abstract: We use a large-scale internet experiment to explore how subjects learn to play against computers that are programmed to follow one of a number of standard learning algorithms. The learning theories are (unbeknown to subjects) a best response process, fictitious play, imitation, reinforcement learning, and a trial & error process. We explore how subjects' performances depend on their opponents' learning algorithm. Furthermore, we test whether subjects try to influence those algorithms to their advantage in a fo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
21
0

Year Published

2009
2009
2024
2024

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 37 publications
(21 citation statements)
references
References 24 publications
0
21
0
Order By: Relevance
“…However, this also seems to be driven by incentives as variances in lab-np and lab-f are not significantly different from those in 7 See our companion paper for details about the learning processes (Duersch et al 2009). 8 In a different context, Shavit et al (2001) find that bids in a lottery evaluation task are significantly higher on the internet than in a classroom experiment.…”
Section: Confirming Findings By Anderhub Et Al (2001) and Shavit Et mentioning
confidence: 95%
“…However, this also seems to be driven by incentives as variances in lab-np and lab-f are not significantly different from those in 7 See our companion paper for details about the learning processes (Duersch et al 2009). 8 In a different context, Shavit et al (2001) find that bids in a lottery evaluation task are significantly higher on the internet than in a classroom experiment.…”
Section: Confirming Findings By Anderhub Et Al (2001) and Shavit Et mentioning
confidence: 95%
“…To gain some more confidence that our Learning Condition really accurately picks up learning dynamics - 14 The typical argument claims that, since a rational selfish player chooses the lowest possible contribution (i.e. zero), any confusion would lead to a positive and therefore higher contribution.…”
Section: Learning Versus Conditional Cooperationmentioning
confidence: 99%
“…Hence, subjects' choice of the dominated action Soft and in alternation with Tough, is a strong indication that they are incurring costs to teach the other to take turns. 2 1 Duersch et al (2010) study how subjects learn to play against computers that are programmed to follow one of a number of standard learning algorithms. They find that teaching occurs frequently and that all learning algorithms are subject to exploitation with the notable exception of imitation.…”
mentioning
confidence: 99%