“…The challenges of determining the construct representation, developing practical, valid, and reliable tests (e.g., Roever, 2008Roever, , 2011Roever, Fraser, & Elder, 2014;Walters, 2007Walters, , 2009, and performing classroom assessment (Ishihara, 2013;Ishihara & Cohen, 2010) have been recently addressed in the literature, but one aspect that has received little attention is rater behaviour (for rare exceptions, see Alemi & Tajeddin, 2013;Brown & Ahn, 2011;Liu, 2007;Liu & Xie, 2014;Roever, 2008;Taguchi, 2006Taguchi, , 2011Tajeddin & Alemi, 2014;Walters, 2007;Youn, 2007).…”