“…They studied the reliability of experienced oculomotor researchers at a single point in time, without any rater training or consensus discussions. There is a very large literature on the beneficial effects of rater training (Abaza & Ross, 2009;Abbo, Okello, & Nakku, 2013;Alcott, Swann, & Grafham, 1999;Angkaw, Tran, & Haaga, 2006;Bank et al, 2002;Buijze, Guitton, van Dijk, Ring, & the Science of Variation Group, 2012;Chan & Yiu, 2002;Chapman et al, 2016;Cusick, Vasquez, Knowles, & Wallen, 2005;Degenhardt, Snider, Snider, & Johnson, 2005;Haj-Ali & Feil, 2006;Istriana et al, 2013;Iwarsson & Reinholt Petersen, 2012;Lievens, 2001;Lou et al, 2014;Lundh, Kowalski, Sundberg, & Landen, 2012;Magnan & Maklebust, 2009;Mist, Ritenbaugh, & Aickin, 2009;Rosen et al, 2008;Sattler, McKnight, Naney, & Mathis, 2015;Schredl, Burchert, & Gabatin, 2004;Solah et al, 2015;Staelens et al, 2014;Store-Valen et al, 2015;Taninishi et al, 2016). There is also a literature on the beneficial effects of having consensus discussions and employing consensus guidelines (Beerbaum et al, 2009;Degenhardt et al, 2005;Foppen, van der Schaaf, Beek, Verkooijen, & Fischer, 2016;Iwarsson & Reinholt Petersen, 2012;Meade et al, 2000;Weinstock et al, 2001).…”