1D CNN with BLSTM for automated classification of fixations, saccades, and smooth pursuits

Startsev, Mikhail; Agtzidis, Ioannis; Dörr, Michael

doi:10.3758/s13428-018-1144-2

Cited by 82 publications

(81 citation statements)

References 32 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The proposed algorithm is rule-based, hence can be applied to data without prior training, apart from the adaptive estimation of velocity thresholds. This aspect distinguishes it from other recent developments based on deep neural networks (Startsev et al, 2018), and machine-learning in general (Zemblys et al, 2018). Some statistical learning algorithms require (labeled) training data, which can be a limitation in the context of a research study.…”

Section: Resultsmentioning

confidence: 99%

“…The validation analyses presented here are based on three different datasets: a manually annoted dataset (Andersson et al, 2017), and two datasets with prolonged recordings using movie stimuli (Hanke et al, 2016). Beyond our own validation, a recent evaluation of nine different smooth pursuit algorithms by Startsev, Agtzidis and Dorr as part of their recent paper (Startsev et al, 2018) also provides metrics for REMoDNaV. In their analysis, algorithm performance was evaluated against a partially hand-labelled eye movement annotation of the Hollywood2 dataset (Mathe and Sminchisescu, 2012).…”

Section: Resultsmentioning

confidence: 99%

See 1 more Smart Citation

REMoDNaV: Robust Eye-Movement Classification for Dynamic Stimulation

Dar

Wagner²,

Hanke

2019

Preprint

View full text Add to dashboard Cite

Tracking of eye movements is an established measurement for many types of experimental paradigms. More complex and more prolonged visual stimuli have made algorithmic approaches to eye movement event classification the most pragmatic option. A recent analysis revealed that many current algorithms are lackluster when it comes to data from viewing dynamic stimuli such as video sequences. Here we present an event classification algorithm-built on an existing velocity-based approach-that is suitable for both static and dynamic stimulation, and is capable of classifying saccades, post-saccadic oscillations, fixations, and smooth pursuit events. We validated classification performance and robustness on three public datasets: 1) manually annotated, trial-based gaze trajectories for viewing static images, moving dots, and short video sequences, 2) lab-quality gaze recordings for a feature length movie, and 3) gaze recordings acquired under suboptimal lighting conditions inside the bore of a magnetic resonance imaging (MRI) scanner for the same full-length movie. We found that the proposed algorithm performs on par or better compared to state-of-the-art alternatives for static stimulation. Moreover, it yields eye movement events with biologically plausible characteristics on prolonged dynamic recordings. Lastly, algorithm performance is robust on data acquired under suboptimal conditions that exhibit a temporally varying noise level. These results indicate that the proposed algorithm is a robust tool with improved classification accuracy across a range of use cases. The algorithm is cross-platform compatible, implemented using the Python programming language, and readily available as free and open source software from public sources.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Resultsmentioning

confidence: 99%

REMoDNaV: Robust Eye-Movement Classification for Dynamic Stimulation

Dar

Wagner²,

Hanke

2019

Preprint

View full text Add to dashboard Cite

show abstract

“…It was modified in recent works: In Zemblys et al (2019), the events that have the largest intersection are matched (rather than the temporally first intersecting event being treated as a match, as in the original matching scheme of I. T. C. Hooge et al, 2018), and the event-level Cohen's kappa scores are computed accordingly. In Startsev, Agtzidis, and Dorr (2019), a threshold for the ''quality'' of the intersection was recommended, which results in no more than one potential match for each of the ''true'' episodes. In Startsev, Göb, and Dorr, (2019) we additionally proposed a new event-level Cohen's kappa-based statistic, which we developed after analyzing the literature evaluation strategies in the context of eye movement classification baselines.…”

Section: Sample-and Event-level Evaluationmentioning

confidence: 99%

“…To put the performance of our detector in context, we compare it with three other methods that detect SP: the algorithms of Berg et al (2009, implemented in Walther & Koch, 2006 and Larsson et al (2015, reimplemented by our group and available for download on the data repository page), as well as I-VMP (San Agustin, 2010, implemented by Komogortsev, 2014). I-VMP, among others, was optimized in Startsev, Agtzidis, and Dorr (2019) via an exhaustive grid search of its parameters in order to deliver optimal performance on the full GazeCom data set, so its results represent an optimistic scenario. These three models (plus the approach described here) were the best nondeep-learning detectors tested in Startsev, Agtzidis, and Dorr (2019), when ranked by the average per-class sample-and event-level F1 scores.…”

Section: Algorithm Evaluationmentioning

confidence: 99%