Applying Item Response Theory Methods to Examine the Impact of Different Response Formats

Hohensinn, Christine; Kubinger, Klaus D.

doi:10.1177/0013164410390032

Cited by 40 publications

(27 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…As Table 13 shows, no significant differences were found. Similar to earlier findings by Hohensinn and Kubinger (2011) and In'nami and Koizumi (2009), the CR items seemed slightly more difficult.…”

Section: Discussionsupporting

confidence: 88%

See 1 more Smart Citation

Assessing writing ability in primary education : on the evaluation of text quality and text complexity

Feenstra¹

View full text Add to dashboard Cite

Section: Discussionsupporting

confidence: 88%

“…Given this correlation, both the CR and the MC formats appear to measure the same latent trait to a great extent, but the formats may elicit different skills and/or assess skills differently (cf. Hohensinn & Kubinger, 2011;Lissitz & Hou, 2012).…”

Section: Discussionmentioning

confidence: 99%

Assessing writing ability in primary education : on the evaluation of text quality and text complexity

Feenstra¹

View full text Add to dashboard Cite

“…Current practices in low-stakes educational large-scale achievement tests involve treating unplanned missing values as incorrect or fractionally correct responses or ignoring them in the scaling (see, e.g., PISA, Adams & Wu, 2002;TIMSS [Third International Mathematics and Science Study], Martin, Gregory, & Stemler, 2000; NAEP [National Assessment of Educational Progress], Allen, Donoghue, & Schoeps, 2001;NEPS [National Educational Panel Study], Pohl & Carstensen, 2012). Research on these types of missing data approaches showed bias on item and person parameter estimates when missing values were scored as incorrect (Culbertson, 2011;De Ayala, Plake, & Impara, 2001;Finch, 2008;Hohensinn & Kubinger, 2011;Holman & Glas, 2005;Pohl, Gräfe, & Rose, 2014;Rose et al, 2010). The method of fractionally correct scoring performed slightly better but also resulted in bias, especially when missing values were MNAR (De Ayala et al, 2001;Finch, 2008).…”

mentioning

confidence: 99%

Dealing With Item Nonresponse in Large‐Scale Cognitive Assessments: The Impact of Missing Data Methods on Estimated Explanatory Relationships

Kohler

Pohl

Carstensen

2017

J Educational Measurement

View full text Add to dashboard Cite

Competence data from low‐stakes educational large‐scale assessment studies allow for evaluating relationships between competencies and other variables. The impact of item‐level nonresponse has not been investigated with regard to statistics that determine the size of these relationships (e.g., correlations, regression coefficients). Classical approaches such as ignoring missing values or treating them as incorrect are currently applied in many large‐scale studies, while recent model‐based approaches that can account for nonignorable nonresponse have been developed. Estimates of item and person parameters have been demonstrated to be biased for classical approaches when missing data are missing not at random (MNAR). In our study, we focus on parameter estimates of the structural model (i.e., the true regression coefficient when regressing competence on an explanatory variable), simulating data according to various missing data mechanisms. We found that model‐based approaches and ignoring missing values performed well in retrieving regression coefficients even when we induced missing data that were MNAR. Treating missing values as incorrect responses can lead to substantial bias. We demonstrate the validity of our approach empirically and discuss the relevance of our results.

show abstract

“…The Rasch testlet model yields itself very well to modeling LID due to its common method. Instead of forming group factors based on common passages we build them based on common test formats (Hohensinn & Kubinger, 2011). That is, each item is forced to load on a target ability dimension and its pertinent method dimension.…”

mentioning

confidence: 99%

Modeling Local Item Dependence Due to Common Test Format With a Multidimensional Rasch Model

Baghaei

Aryadoust

2014

International Journal of Testing

View full text Add to dashboard Cite

Applying Item Response Theory Methods to Examine the Impact of Different Response Formats

Cited by 40 publications

References 23 publications

Assessing writing ability in primary education : on the evaluation of text quality and text complexity

Assessing writing ability in primary education : on the evaluation of text quality and text complexity

Dealing With Item Nonresponse in Large‐Scale Cognitive Assessments: The Impact of Missing Data Methods on Estimated Explanatory Relationships

Modeling Local Item Dependence Due to Common Test Format With a Multidimensional Rasch Model

Contact Info

Product

Resources

About