Christine E. DeMars scite author profile

The validity of inferences based on achievement test scores is dependent on the amount of effort that examinees put forth while taking the test. With low‐stakes tests, for which this problem is particularly prevalent, there is a consequent need for psychometric models that can take into account differing levels of examinee effort. This article introduces the effort‐moderated IRT model, which incorporates item response time into proficiency estimation and item parameter estimation. In two studies of the effort‐moderated model when rapid guessing (i.e., reflecting low examinee effort) was present, one based on real data and the other on simulated data, the effort‐moderated model performed better than the standard 3PL model. Specifically, it was found that the effort‐moderated model (a) showed better model fit, (b) yielded more accurate item parameter estimates, (c) more accurately estimated test information, and (d) yielded proficiency estimates with higher convergent validity.

show abstract

Application of the Bi‐Factor Multidimensional Item Response Theory Model to Testlet‐Based Tests

DeMars

2006

J Educational Measurement

120

152

View full text Add to dashboard Cite

Four item response theory (IRT) models were compared using data from tests where multiple items were grouped into testlets focused on a common stimulus. In the bifactor model each item was treated as a function of a primary trait plus a nuisance trait due to the testlet; in the testlet-effects model the slopes in the direction of the testlet traits were constrained within each testlet to be proportional to the slope in the direction of the primary trait; in the polytomous model the item scores were summed into a single score for each testlet; and in the independent-items model the testlet structure was ignored. Using the simulated data, reliability was overestimated somewhat by the independent-items model when the items were not independent within testlets. Under these nonindependent conditions, the independent-items model also yielded greater root mean square error (RMSE) for item difficulty and underestimated the item slopes. When the items within testlets were instead generated to be independent, the bi-factor model yielded somewhat higher RMSE in difficulty and slope. Similar differences between the models were illustrated with real data.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Christine E. DeMars

Low Examinee Effort in Low-Stakes Assessment: Problems and Potential Solutions

Item Response Theory

An Application of Item Response Time: The Effort‐Moderated IRT Model

Application of the Bi‐Factor Multidimensional Item Response Theory Model to Testlet‐Based Tests

Contact Info

Product

Resources

About