Do Different Approaches to Examining Construct Comparability in Multilanguage Assessments Lead to Similar Conclusions?

Oliveri, María Elena; Ercikan, Kadriye

doi:10.1080/08957347.2011.607063

Cited by 36 publications

(24 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…It might also contribute to challenges in identifying sources of DIF documented in previous research (Ercikan et al, 2010;Oliveri & Ercikan, 2011). Multiple studies have been conducted with gender, ethnic, and language comparison groups to improve DIF detection and the identification of its sources by creating more homogeneous groups in measurement comparability research.…”

mentioning

confidence: 95%

“…The identification of differential item functioning (DIF) may signal that an item is measuring construct-irrelevant factors such as differential familiarity with item types, formats, or vocabulary knowledge for one or more of the comparison groups (Ercikan, Gierl, McCreith, Puhan, & Koh, 2004;Ercikan & Lyons-Thomas, 2013;Oliveri & Ercikan, 2011). Accurate DIF detection is central to making claims regarding whether an item should be used in an assessment or whether modification is required in order to reduce or eliminate construct-irrelevant variance across comparison groups.…”

mentioning

confidence: 98%

See 1 more Smart Citation

Effects of Population Heterogeneity on Accuracy of DIF Detection

Oliveri

Ercikan

Zumbo

2014

Applied Measurement in Education

Self Cite

View full text Add to dashboard Cite

mentioning

confidence: 95%

mentioning

confidence: 98%

Effects of Population Heterogeneity on Accuracy of DIF Detection

Oliveri

Ercikan

Zumbo

2014

Applied Measurement in Education

Self Cite

View full text Add to dashboard Cite

“…Items are typically flagged for DIF if response probabilities for examinees at the same ability levels depend on group membership. As different methods for identifying DIF may not give identical results, the use of more than one method is recommended, to allow for the corroboration of DIF status for the items analyzed (Ercikan and McCreith, 2002;Oliveri and Ercikan, 2011). In this research, an IRT-based approach and logistic/ordinal logistic regression approaches were used.…”

Section: Differential Item Functioning Analysismentioning

confidence: 99%

“…It is incumbent on countries with multiple official languages to take reasonable steps to ensure that linguistic groups are given the opportunity to perceive and respond to tests in the same way (Fairbairn and Fox, 2009;Rogers et al, 2010;Marotta et al, 2015). Yet, research conducted in Canada comparing the French and English versions of LSAs has found that 18-60% of items function differentially for the two groups (Gierl et al, 1999;Gierl, 2000;Ercikan and McCreith, 2002;Ercikan et al, 2004b;Oliveri and Ercikan, 2011;Marotta et al, 2015).…”

mentioning

confidence: 99%

Measurement Comparability of Reading in the English and French Canadian Populations: Special Case of the 2011 Progress in International Reading Literacy Study

Goodrich¹,

Ercikan

2019

Front. Educ.

Self Cite

View full text Add to dashboard Cite

The purpose of this study is to examine item equivalence and score comparability of the Progress in International Reading Literacy Study (PIRLS) 2011 for the Canadian French and English language groups. Two methods of differential item functioning were conducted to examine item equivalence across 13 test booklets designed to assess reading literacy in early years of schooling. Four bilingual reviewers with expertise in reading literacy conducted independent, linguistic, and cultural reviews to identify both the degree of item equivalence and potential sources of differences between language versions of released items. Results indicate that an average of 25% of items per booklet function differentially at the item level. Reviews by experts indicate differences between the two language versions on some items flagged as displaying differential item functioning (DIF). Some of these were identified to have linguistic differences pointing to differential difficulty levels in the two language versions.

show abstract

“…To illustrate, reading competencies may develop at different speeds and in different ways across languages with differing alphabets (Spielberger, Moscoso, & Brunner, 2005). Additional factors that might lead to the development of tests with limited comparability include natural variation in difficulty, commonality, or contextual meaning of vocabulary and differential sentence length or complexity (Hambleton, Merenda, & Spielberger, 2005;Oliveri & Ercikan, 2011;Solano-Flores, Backhoff, & ContreraNiño, 2009). Several guidelines have been developed to address comparability issues arising in test adaptation.…”

mentioning

confidence: 98%

A Framework for Developing Comparable Multilingual Assessments for Minority Populations: Why Context Matters

Oliveri

Ercikan

Simon

2015

International Journal of Testing

Self Cite

View full text Add to dashboard Cite

The assessment of linguistic minorities often involves using multiple language versions of assessments. In these assessments, comparability of scores across language groups is central to valid comparative interpretations. Various frameworks and guidelines describe factors that need to be considered when developing comparable assessments. These frameworks provide limited information in relation to the development of multiple language versions of assessments for assessing linguistic minorities within countries. To this end, we make various suggestions for the types of factors that should be considered when assessing linguistic minorities. Our recommendations are tailored to the particular constraints potentially faced by various jurisdictions tasked with developing multiple language versions of assessments for linguistic minorities. These challenges include having limited financial and staffing resources to develop comparable assessments and having insufficient sample sizes to perform psychometric analyses (e.g., item response theory) to examine comparability. Although we contextualize our study by focusing on linguistic minorities within Canada due to its bilingual status, our findings may also apply to other bilingual and multilingual countries with similar minority/majority contexts.

show abstract

Do Different Approaches to Examining Construct Comparability in Multilanguage Assessments Lead to Similar Conclusions?

Cited by 36 publications

References 27 publications

Effects of Population Heterogeneity on Accuracy of DIF Detection

Effects of Population Heterogeneity on Accuracy of DIF Detection

Measurement Comparability of Reading in the English and French Canadian Populations: Special Case of the 2011 Progress in International Reading Literacy Study

A Framework for Developing Comparable Multilingual Assessments for Minority Populations: Why Context Matters

Contact Info

Product

Resources

About