“…Yet, even when limiting to the unimodal case of visual stimuli, gaze dynamics has been by and large overlooked in computer vision in spite of the pioneering work of Aloimonos et al ( 1988 ), Ballard ( 1991 ), and Bajcsy and Campos ( 1992 ). The current state of affairs is that effort is mostly spent to model salience (Borji and Itti, 2013 ; Borji, 2021 ) as a tool for predicting where/what to look at (for a critical discussion, see Tatler et al, 2011 ; Le Meur and Liu, 2015 ; Foulsham, 2019 ; Boccignone et al, 2020 ; Zhang et al, 2020 ).…”