2019
DOI: 10.48550/arxiv.1903.06754
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Atari-HEAD: Atari Human Eye-Tracking and Demonstration Dataset

Abstract: Large-scale public datasets have been shown to benefit research in multiple areas of modern artificial intelligence. For decision-making research that requires human data, highquality datasets serve as important benchmarks to facilitate the development of new methods by providing a common reproducible standard. Many human decision-making tasks require visual attention to obtain high levels of performance. Therefore, measuring eye movements can provide a rich source of information about the strategies that huma… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
21
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(22 citation statements)
references
References 38 publications
1
21
0
Order By: Relevance
“…Average return represents the long-term rewards of two policies, while behavior matching characterizes the behavioural similarity of two policies. We note that similar metrics are also used for attention-guided learning in recent work (Zhang et al, 2019).…”
Section: Evaluationsmentioning
confidence: 99%
“…Average return represents the long-term rewards of two policies, while behavior matching characterizes the behavioural similarity of two policies. We note that similar metrics are also used for attention-guided learning in recent work (Zhang et al, 2019).…”
Section: Evaluationsmentioning
confidence: 99%
“…In addition, if we gather BC data from human players, the recorded actions are subject to delays from human-reflexes: if something surprising happens in an image, average humans react to this with a split-second delay. This action was supposed to be associated with the surprising event, but instead it will be recorded few frames later, associated with possibly a wrong observation and leading to state-action mismatch [14].…”
Section: B Challenges Of End-to-end Control Of Video Gamesmentioning
confidence: 99%
“…[15], and "how the state-action mismatch from human reflexes affects the results?" [14]. The former sheds light on if we should gather data from only few experts, or should we use data from many different players.…”
Section: Research Questions and Experimental Setupmentioning
confidence: 99%
See 1 more Smart Citation
“…Learning an attention mechanism that limits these choices could potentially result in much better performance. Humans have a remarkable ability to selectively pay attention to certain parts of the visual input (Judd et al, 2009;Borji et al, 2012), gathering relevant information, and sequentially combining their observations to build representations across different timescales (Hayhoe and Ballard, 2005;Zhang et al, 2019), which plays an important role in guiding further perception and action (Nobre and Stokes, 2019;Badman et al, 2020). In this paper, we explore ideas for endowing reinforcement learning (RL) agents with these type of capabilities.…”
Section: Introductionmentioning
confidence: 99%