Assessing Approximate Fit in Categorical Data Analysis

Maydeu-Olivares, Alberto; Joe, Harry

doi:10.1080/00273171.2014.911075

Cited by 185 publications

(145 citation statements)

References 41 publications

(58 reference statements)

Supporting

Mentioning

135

Contrasting

Unclassified

Order By: Relevance

“…A 90% confidence interval for the RMSEA 2 yields [0.03, 0.04]. MaydeuOlivares and Joe (2014) suggest that IRT models with an RMSEA 2 less than or equal to 0.05 provide a close approximation to the data-generating model and that those with an RMSEA 2 less than or equal to 0.05 / (K -1) provide an excellent approximation. Because K = 5, their criterion for an excellent approximation is RMSEA 2 ≤ 0.0125.…”

Section: Promis Depression Short Formmentioning

confidence: 95%

“…Our advice in this application is to attempt to find a better fitting model. Failing to do so, the fitted model may be used as it provides a close fit to the data using the criteria of Maydeu-Olivares and Joe (2014). Here is a word of caution: a piecewise assessment shall be performed regardless of the value of the RMSEA 2 (or similar overall measure of fit).…”

Section: Promis Depression Short Formmentioning

confidence: 98%

See 1 more Smart Citation

Identifying the Source of Misfit in Item Response Theory Models

Liu

Maydeu-Olivares

2014

Multivariate Behavioral Research

Self Cite

View full text Add to dashboard Cite

When an item response theory model fails to fit adequately, the items for which the model provides a good fit and those for which it does not must be determined. To this end, we compare the performance of several fit statistics for item pairs with known asymptotic distributions under maximum likelihood estimation of the item parameters: (a) a mean and variance adjustment to bivariate Pearson's X(2), (b) a bivariate subtable analog to Reiser's (1996) overall goodness-of-fit test, (c) a z statistic for the bivariate residual cross product, and (d) Maydeu-Olivares and Joe's (2006) M2 statistic applied to bivariate subtables. The unadjusted Pearson's X(2) with heuristically determined degrees of freedom is also included in the comparison. For binary and ordinal data, our simulation results suggest that the z statistic has the best Type I error and power behavior among all the statistics under investigation when the observed information matrix is used in its computation. However, if one has to use the cross-product information, the mean and variance adjusted X(2) is recommended. We illustrate the use of pairwise fit statistics in 2 real-data examples and discuss possible extensions of the current research in various directions.

show abstract

Section: Promis Depression Short Formmentioning

confidence: 95%

Section: Promis Depression Short Formmentioning

confidence: 98%

Identifying the Source of Misfit in Item Response Theory Models

Liu

Maydeu-Olivares

2014

Multivariate Behavioral Research

Self Cite

View full text Add to dashboard Cite

show abstract

“…For example, Maydeu-Olivares (2013) developed a rationale for constructing an M 2 -based RMSEA. More recently, Maydeu-Olivares & Joe (2014) expanded on this line of research and proposed some cutoff criteria for approximate fit. Another example is provided by Lee and Cai (2012), which proposed an M 2 -based Tucker-Lewis Index (Tucker & Lewis, 1973).…”

Section: Introductionmentioning

confidence: 99%

Evaluating Structural Equation Models for Categorical Outcomes: A New Test Statistic and a Practical Challenge of Interpretation

Monroe

Cai

2015

Multivariate Behavioral Research

View full text Add to dashboard Cite

This research is concerned with two topics in assessing model fit for categorical data analysis. The first topic involves the application of a limited-information overall test, introduced in the item response theory literature, to Structural Equation Modeling (SEM) of categorical outcome variables. Most popular SEM test statistics assess how well the model reproduces estimated polychoric correlations. In contrast, limited-information test statistics assess how well the underlying categorical data are reproduced. Here, the recently introduced C2 statistic of Cai and Monroe (2014) is applied. The second topic concerns how the Root Mean Square Error of Approximation (RMSEA) fit index can be affected by the number of categories in the outcome variable. This relationship creates challenges for interpreting RMSEA. While the two topics initially appear unrelated, they may conveniently be studied in tandem since RMSEA is based on an overall test statistic, such as C2. The results are illustrated with an empirical application to data from a large-scale educational survey.

show abstract

“…Given the conceptual partial ordering and the test option response functions shown in Figure , we decided to fit both ordered and nominal models for polytomous data. The following decision rule was adopted regarding fit: on the basis of the AIC and BIC, select the model that performs best across all forms and grades from among those with acceptable RMSEAs (RMSEA ≤ 0.09; Browne & Cudeck, ; Hu & Bentler, ; Maydeu‐Olivares & Joe, ). Also, we preferred to use the same model for all forms.…”

Section: Methodsmentioning

confidence: 99%

Can We Learn From Student Mistakes in a Formative, Reading Comprehension Assessment?

Liu

Kennedy

Seipel

et al. 2019

J Educational Measurement

View full text Add to dashboard Cite

This article describes an ongoing project to develop a formative, inferential reading comprehension assessment of causal story comprehension. It has three features to enhance classroom use: equated scale scores for progress monitoring within and across grades, a scale score to distinguish among low‐scoring students based on patterns of mistakes, and a reading efficiency index. Instead of two response types for each multiple‐choice item, correct and incorrect, each item has three response types: correct and two incorrect response types. Prior results on reliability, convergent and discriminant validity, and predictive utility of mistake subscores are briefly described. The three‐response‐type structure of items required rethinking the item response theory (IRT) modeling. IRT‐modeling results are presented, and implications for formative assessments and instructional use are discussed.

show abstract

Assessing Approximate Fit in Categorical Data Analysis

Cited by 185 publications

References 41 publications

Identifying the Source of Misfit in Item Response Theory Models

Identifying the Source of Misfit in Item Response Theory Models

Evaluating Structural Equation Models for Categorical Outcomes: A New Test Statistic and a Practical Challenge of Interpretation

Can We Learn From Student Mistakes in a Formative, Reading Comprehension Assessment?

Contact Info

Product

Resources

About