1985
DOI: 10.1177/014662168500900105
|View full text |Cite
|
Sign up to set email alerts
|

A Comparison of Several Goodness-of-Fit Statistics

Abstract: A study was conducted to evaluate four goodnessof-fit procedures using data simulation techniques. The procedures were evaluated using data generated according to three different item response theory models and a factor analytic model. Three different distributions of ability were used, as were three different sample sizes. It was concluded that the likelihood ratio chi-square procedure yielded the fewest erroneous rejections of the hypothesis of fit, whereas Bock's chisquare procedure yielded the fewest erron… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
104
1
1

Year Published

1986
1986
2018
2018

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 114 publications
(109 citation statements)
references
References 7 publications
3
104
1
1
Order By: Relevance
“…A number of studies have compared the 3PL model with 2PL and/or 1PL models in terms of model-data fit (Hambleton & Murray, 1983;McKinley & Mills, 1985;Swaminathan & Gifford, 1979;Yen, 1981). In general, results of these studies suggest that the 3PL model will provide better fit at the item level than the 2PL or 1PL models, unless the data are simulated to fit these latter models.…”
Section: Description Of Toefl Equating Designmentioning
confidence: 99%
See 1 more Smart Citation
“…A number of studies have compared the 3PL model with 2PL and/or 1PL models in terms of model-data fit (Hambleton & Murray, 1983;McKinley & Mills, 1985;Swaminathan & Gifford, 1979;Yen, 1981). In general, results of these studies suggest that the 3PL model will provide better fit at the item level than the 2PL or 1PL models, unless the data are simulated to fit these latter models.…”
Section: Description Of Toefl Equating Designmentioning
confidence: 99%
“…In general, results of these studies suggest that the 3PL model will provide better fit at the item level than the 2PL or 1PL models, unless the data are simulated to fit these latter models. McKinley and Mills (1985) reported that when data were generated with the 3PL model, the 2PL model showed considerably more misfit than the 3PL model in terms of the proportions of items identified as misfitting by several goodness-of-fit statistics. However, under similar conditions, Yen (1981) found that the 2PL model fit the data almost as well as the 3PL model.…”
Section: Description Of Toefl Equating Designmentioning
confidence: 99%
“…McKinley ve Mills (1985), tarafından geliştirilen bu indeks χ 2 olabilirlik oranı olarak adlandırılır. G 2 indeksi Yen'in Q1 indeksine benzemektedir.…”
Section: G 2 İndeksiunclassified
“…Also, as the number of categories of each item was four (={0, 1, 2, 3}), there were four parameters for both models, i.e., one slope and three location parameters Two different numbers of individuals at each generation were prepared (G = 16, 32), and two different sample sizes were arranged (N = 1000, 2000). In addition, the likelihood-ratio chi-square (χ 2 ) statistic (McKinley & Mills, 1985) with degrees of freedom (Q − 1) × j (C j − 1) − 4n, where Q is the number of quadrature points on the latent scale, and four is the number of parameters, was adopted for the fitness evaluation function. There were no substantial differences between the comparison with the information criteria and that with the χ 2 statistic as long as the number of GRM parameters was equal to that for GPCM.…”
Section: Setup For Simulationmentioning
confidence: 99%