Some Empirical Results Related to the Robustness of the Rasch Model

Forsyth, Robert A.; Saisangjan, Upatham; Gilmer, Jerry S.

doi:10.1177/014662168100500203

Cited by 22 publications

(16 citation statements)

References 13 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Since the two ability estimates would have the same expected value if the tests were accurately equated, a comparison of the difference between them was used to evaluate the equating results obtained. Using a procedure employed by Wright (1968), Whitely and Dawis (1974), Linn (1978, 1979a) and Forsyth, Saisangjan and Gilmer (1981), a standardized difference score (Dr) was obtained for each student as follows: To examine the distribution of these differences, average values were also plotted at selected points along the ability scale for both samples.…”

Section: Vertical Equatingmentioning

confidence: 99%

Unidimensionality and Vertical Equating With the Rasch Model

Holmes¹

1982

J Educational Measurement

View full text Add to dashboard Cite

Section: Vertical Equatingmentioning

confidence: 99%

Unidimensionality and Vertical Equating With the Rasch Model

Holmes¹

1982

J Educational Measurement

View full text Add to dashboard Cite

“…One of these, unidimensionality, requires that there be only one trait underlying the examinees' responses to the test items. Although this assumption is never strictly met in practice, there is evidence that IRT equating methods are somewhat robust to violations of it (Cook & Eignor, 1982;Forsyth, Saisangjan, & Gilmer, 1981;Petersen, Cook, & Stocking, 1983). Other assumptions, such as a specified functional form for the item characteristic curves, are also required, though in a practical sense, the issue is not so much how well the data fit the model as how well the model will perform with real data in a real testing situation.…”

mentioning

confidence: 99%

An Application of the Three-Parameter IRT Model to Vertical Equating

Harris

Hooker

1987

Applied Psychological Measurement

View full text Add to dashboard Cite

This study examined the effectiveness of the threeparameter IRT model in vertically equating five overlapping levels of a mathematics computation test. One to four test levels were administered within intact classrooms to randomly equivalent groups of third through eighth grade students. Test characteristic curves were derived for each grade/test level combination. It was generally found that an examinee would receive a higher ability estimate if the test level administered had been calibrated on less able examinees.Practical implications for "out-of-level" and adaptive testing are discussed.It is often considered desirable to test a student in a given subject matter area periodically throughout his/her formal schooling, and to compare the scores obtained across the various testings. Because knowledge in many subject areas is closely linked to school curricula, standardized achievement tests are usually developed in levels that attempt to rnirr®r '6tYp~c~l&dquo; curriculum placement of different aspects of a subject area. This usually results in a standardized test battery with levels corresponding, at least roughly, to grades in school.In order to compare test scores across these levels, a scale must be developed that allows comparisons of raw scores obtained on tests differing in content and difficulty. This is the problem that vertical equating attempts to solve-how to develop a score scale across test levels which (1) differ in difficulty and (2) are designed for groups of examinees who differ in average ability level.This study was designed to examine the effectiveness of the three-parameter item response theory (IRT) model in vertically equating the mathematics computation test of the Iowa Tests of Basic Skills (Hieronymus, Lindquist, & Hoover, 1977).IRT methods are frequently suggested as the prefeffed vertical equating approach for two reasons:(1) It is recognized that problems exist with the classical test theory methods (see, e.g., Lord, 197?; Lord & Wingersky, 1984), and (2) IRT methods are usually conceived of as having &dquo;person-free&dquo; 9 calibration and &dquo;item-free&dquo; measurement. These properties imply that the item parameters which are estimated are invariant for all subgroups of examinees, and that, once the items are calibrated, the same 0 estimate would be obtained (except for errors of measurement) for an individual regardless of the subset of items he/she was administered. These properties, if they held, would essentially solve the problem of vertical equating.The two IRT models that have been most prominent in the vertical equating literature are the oneparameter (Rasch) model and the three-parameter model (see, e.g., Hambleton & Swaminathan, 1984). Although the Rasch model possesses certain desirable properties, such as simplicity and a monotonic relationship between raw score and estimated examinee ability, there are indications that the model does not perform well in practice in vertical equatat Kungl Tekniska Hogskolan / Royal Institute of Technology on August 24, 20...

show abstract

“…de plus, dans le cas de la modélisation à trois paramètres, l'estimation du paramètre de pseudo-chance nécessite un grand nombre de sujets qui affichent un niveau d'habileté faible : une situation qui est difficile à respecter si l'échan-tillon de sujets est trop petit. dans ce contexte, plusieurs auteurs ont démon-tré que le modèle de Rasch respecte de façon raisonnable le principe d'invariance des items et des sujets pour différents tests et différents groupes de répondants (Forsyth, sarsangjan, & Gilmer, 1981) …”

Section: Le Modèle à Réponses Dichotomiques De Raschunclassified

Estimation des paramètres d’item et de sujet à partir du modèle de Rasch

Béland

Magis

Raîche

2014

mee

View full text Add to dashboard Cite

La Théorie de la réponse aux items (TRI) est une classe de modèles de mesure très utilisée en éducation. À ce jour, de nombreux logiciels, tel BILOG-MG, sont disponibles afin de procéder à l’estimation des paramètres d’item et de sujet. Parmi ces logiciels, il ne faut pas négliger ICL et R qui sont gratuits et qui peuvent permettre de produire des analyses diversifiées. Cette étude a pour objectif de comparer la qualité d’estimation des paramètres selon une des modélisations issues de la TRI : le modèle de Rasch. Pour ce faire, nous comparons les estimateurs du paramètre de difficulté et de sujet selon trois logiciels : BILOG-MG, ICL et la librairie ltm, disponible sous le logiciel R. Nous procédons à une analyse par simulation informatique et, dans un second temps, nous analysons un test de classement en anglais, langue seconde. Les résultats démontrent que les logiciels étudiés permettent d’obtenir des estimateurs des paramètres similaires, la différence principale entre ces logiciels étant leur temps d’exécution des procédures d’estimation.

show abstract

Some Empirical Results Related to the Robustness of the Rasch Model

Cited by 22 publications

References 13 publications

Unidimensionality and Vertical Equating With the Rasch Model

Unidimensionality and Vertical Equating With the Rasch Model

An Application of the Three-Parameter IRT Model to Vertical Equating

Estimation des paramètres d’item et de sujet à partir du modèle de Rasch

Contact Info

Product

Resources

About