“…A great deal of this work has focused on accounting for systematic variability in rater scoring. Research on rater training has so far suggested that training is useful in increasing rater consistency (Jang, Wagner & Park, 2014;McNamara, 1996;Plakans & Gebril, 2013 ;Weigle, 1999Weigle, , 2002Weir, 2005), but there continues to be unexplained variability that resists training (Crossley, Kyle & MacNamara, 2016;Hoyt & Kerns, 1999;Plakans & Gebril, 2013;Shrestha & Coffin, 2012).…”