“…Previous research has shown that test adaptation can result in significant score incomparability (Ercikan, 1998(Ercikan, , 2003Gierl and Khaliq, 2001;Maldonado and Geisinger, 2005;Yildirim and Berberoĝlu, 2009;Ercikan et al, 2010;Oliveri and von Davier, 2011;Wetzel and Carstensen, 2013;Kreiner and Christensen, 2014). Although research has also demonstrated that psychometric differences between language versions may be attributable to multiple factors (Ercikan and McCreith, 2002;Ercikan et al, 2004a;Sireci et al, 2005;Wu and Ercikan, 2006;Elosua and López-Jaúregui, 2007;Solano-Flores et al, 2009;Arffman, 2010), evidence demonstrates that some differences across groups are attributable to a lack of equivalence across language versions due to translation errors (Oliveri and von Davier, 2011;Ercikan and Lyons-Thomas, 2013;Zhao et al, 2018). This lack of equivalence is, in part, due to test translation procedures (Gierl and Khaliq, 2001; Maldonado and Geisinger, Differences in words, expressions, and structure of sentence inherent to a language or culture.…”