“…By comparison, SIBTEST is particularly advantageous in situations when examinees' prior knowledge of content (impact) is present in the data (Klockars & Lee, 2008), and its statistics could be compared against the well-established criteria of DIF classification. Moreover, SIBTEST has been found capable for adaptations to the multilevel data structures (French & Finch, 2015), and its DIF statistics have been found robust, compared with other nonparametric DIF detection procedures, when sample sizes for reference and focal groups are small (Klockars & Lee, 2008;Roussos & Stout, 1996b). Nevertheless, both EIRM and SIBTEST could account for differences in ability between the focal and reference groups, have a well-established statistical foundation, they are robust to different sample sizes, and could be used to evaluate items and item-bundles (Briggs, 2008;De Boeck et al, 2011;French & Finch, 2015;Lee, Cohen, & Toro, 2009;Lei & Li, 2013;Roussos & Stout, 1996a).…”