Human Gaze Assisted Artificial Intelligence: A Review

Zhang, Ruohan; Saran, Akanksha; Liu, Bo; Zhu, Yifeng; Guo, Sihang; Niekum, Scott; Ballard, Dana H.; Hayhoe, Mary

doi:10.24963/ijcai.2020/689

Cited by 33 publications

(20 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The computational efficiency of the method shows promise for application in robotics, markedly in social robotics, where active vision plays an important role and where social robot's sensitivity to environmental information and the ability to localize the people around itself is crucial (Admoni and Scassellati, 2017 ; Wiese et al, 2017 ; Zhang et al, 2020 ). Social robots need to gather information about their human fellows to facilitate mutual understanding and coordination (Zhang et al, 2020 ). Designing robot gaze itself is challenging and difficult to standardize due to the variations in physical robots and human participants, while burdened with architectural constraints.…”

Section: Discussionmentioning

confidence: 99%

“…Designing robot gaze itself is challenging and difficult to standardize due to the variations in physical robots and human participants, while burdened with architectural constraints. Early research efforts (Breazeal et al, 2001 ) relied on simple saliency-based schemes (Itti et al, 1998 ) inherited from computer vision (Shic and Scassellati, 2007 ; Ferreira and Dias, 2014 ); in the last decade these have been reshaped in the form of deep neural nets, such as convolutional networks (Zhang et al, 2020 ). Yet, the aptness of accounting for task, value and context in the visuo-motor loop is crucial.…”

Section: Discussionmentioning

confidence: 99%

“…In our setting, no specific external task or goal is given (free-viewing condition). Then, if the ultimate objective of an active perceiver is total reward maximization (Zhang et al, 2020 ), reward can be related to the “internal” value (Berridge and Robinson, 2003 ). The latter has different psychological facets including affect (implicit “liking” and conscious pleasure) and motivation (implicit incentive salience, “wanting”).…”

Section: Methodsmentioning

confidence: 99%

“…Yet, even when limiting to the unimodal case of visual stimuli, gaze dynamics has been by and large overlooked in computer vision in spite of the pioneering work of Aloimonos et al ( 1988 ), Ballard ( 1991 ), and Bajcsy and Campos ( 1992 ). The current state of affairs is that effort is mostly spent to model salience (Borji and Itti, 2013 ; Borji, 2021 ) as a tool for predicting where/what to look at (for a critical discussion, see Tatler et al, 2011 ; Le Meur and Liu, 2015 ; Foulsham, 2019 ; Boccignone et al, 2020 ; Zhang et al, 2020 ).…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Gazing at Social Interactions Between Foraging and Decision Theory

D’Amelio

Boccignone

2021

Front. Neurorobot.

View full text Add to dashboard Cite

Finding the underlying principles of social attention in humans seems to be essential for the design of the interaction between natural and artificial agents. Here, we focus on the computational modeling of gaze dynamics as exhibited by humans when perceiving socially relevant multimodal information. The audio-visual landscape of social interactions is distilled into a number of multimodal patches that convey different social value, and we work under the general frame of foraging as a tradeoff between local patch exploitation and landscape exploration. We show that the spatio-temporal dynamics of gaze shifts can be parsimoniously described by Langevin-type stochastic differential equations triggering a decision equation over time. In particular, value-based patch choice and handling is reduced to a simple multi-alternative perceptual decision making that relies on a race-to-threshold between independent continuous-time perceptual evidence integrators, each integrator being associated with a patch.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Discussionmentioning

confidence: 99%

Section: Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Gazing at Social Interactions Between Foraging and Decision Theory

D’Amelio

Boccignone

2021

Front. Neurorobot.

View full text Add to dashboard Cite

show abstract

“…This allows humans to move their foveae to the right place at the right time, in order to attend to important visual features (Diaz et al 2013). Therefore, human expert's gaze serves as a good standard in many vision-related tasks for evaluating machine attention, or as learning target for training machine attention (Qiuxia et al 2020;Zhang et al 2020a). This approach is widely used in the computer vision research, see Nguyen et al (Nguyen, Zhao, and Yan 2018) for a review.…”

Section: Related Workmentioning

confidence: 99%

Leveraging Human Guidance for Deep Reinforcement Learning Tasks

Zhang

Torabi

Guan

et al. 2019

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence

Self Cite

View full text Add to dashboard Cite

Reinforcement learning agents can learn to solve sequential decision tasks by interacting with the environment. Human knowledge of how to solve these tasks can be incorporated using imitation learning, where the agent learns to imitate human demonstrated decisions. However, human guidance is not limited to the demonstrations. Other types of guidance could be more suitable for certain tasks and require less human effort. This survey provides a high-level overview of five recent learning frameworks that primarily rely on human guidance other than conventional, step-by-step action demonstrations. We review the motivation, assumption, and implementation of each framework. We then discuss possible future research directions.

show abstract

Interactive Reinforcement Learning for Autonomous Behavior Design

Cruz

Igarashi

2021

Human–Computer Interaction Series

View full text Add to dashboard Cite

Human Gaze Assisted Artificial Intelligence: A Review

Cited by 33 publications

References 23 publications

Gazing at Social Interactions Between Foraging and Decision Theory

Gazing at Social Interactions Between Foraging and Decision Theory

Leveraging Human Guidance for Deep Reinforcement Learning Tasks

Interactive Reinforcement Learning for Autonomous Behavior Design

Contact Info

Product

Resources

About