2020
DOI: 10.35542/osf.io/m379h
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Parameter Estimation Accuracy of the Effort-Moderated IRT Model Under Multiple Assumption Violations

Abstract: As low-stakes testing contexts increase, low test-taking effort may serve as a serious validity threat. One common solution to this problem is to identify noneffortful responses and treat them as missing during parameter estimation via the Effort-Moderated IRT (EM-IRT) model. Although this model has been shown to outperform traditional IRT models (e.g., 2PL) in parameter estimation under simulated conditions, prior research has failed to examine its performance under violations to the model’s assumptions. Ther… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
13
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
1

Relationship

3
1

Authors

Journals

citations
Cited by 4 publications
(14 citation statements)
references
References 0 publications
1
13
0
Order By: Relevance
“…RG has been shown to bias measurement properties, such as reliability estimates (e.g., Wise & DeMars, 2009), measurement invariance (e.g., DeMars & Wise, 2010), linking coefficients (e.g., Mittelhaëuser et al, 2015), and item and person parameter estimates (e.g., Rios & Soland, 2020;van Barneveld, 2007). Furthermore, as RG is generally associated with underestimation of examinee ability (Silm et al, 2020), it has been documented to bias treatment effects (e.g., Osborne & Blanchard, 2011;Liu et al, 2015), achievement gains (e.g., Wise & DeMars, 2010), value-added estimates of teacher effectiveness (Wise et al, 2013), and subgroup comparisons (e.g., Debeer et al, 2014).…”
Section: Presence Of Rapid Guessing Misclassificationsmentioning
confidence: 99%
See 4 more Smart Citations
“…RG has been shown to bias measurement properties, such as reliability estimates (e.g., Wise & DeMars, 2009), measurement invariance (e.g., DeMars & Wise, 2010), linking coefficients (e.g., Mittelhaëuser et al, 2015), and item and person parameter estimates (e.g., Rios & Soland, 2020;van Barneveld, 2007). Furthermore, as RG is generally associated with underestimation of examinee ability (Silm et al, 2020), it has been documented to bias treatment effects (e.g., Osborne & Blanchard, 2011;Liu et al, 2015), achievement gains (e.g., Wise & DeMars, 2010), value-added estimates of teacher effectiveness (Wise et al, 2013), and subgroup comparisons (e.g., Debeer et al, 2014).…”
Section: Presence Of Rapid Guessing Misclassificationsmentioning
confidence: 99%
“…Concerning the former assumption, prior research has suggested that it may be untenable in practice given that RG has been found to be associated with item position, length, difficulty, and depth of knowledge required (e.g., Wise, 2020). However, recent evidence indicates that aggregate-level ability estimates may be largely robust to such a violation (Rios & Soland, 2020). In terms of the latter assumption, aggregate-level ability estimates will be biased either positively or negatively if noneffortful responders underlying ability is consistently higher or lower than the average ability of effortful responders.…”
Section: Scoring Approach Incorporating Response Timesmentioning
confidence: 99%
See 3 more Smart Citations