Klaas Sijtsma scite author profile

This discussion paper argues that both the use of Cronbach's alpha as a reliability estimate and as a measure of internal consistency suffer from major problems. First, alpha always has a value, which cannot be equal to the test score's reliability given the interitem covariance matrix and the usual assumptions about measurement error. Second, in practice, alpha is used more often as a measure of the test's internal consistency than as an estimate of reliability. However, it can be shown easily that alpha is unrelated to the internal structure of the test. It is further discussed that statistics based on a single test administration do not convey much information about the accuracy of individuals' test performance. The paper ends with a list of conclusions about the usefulness of alpha.

show abstract

Introduction to Nonparametric Item Response Theory

Sijtsma¹,

Molenaar²

2002

591

874

View full text Add to dashboard Cite

Cognitive Assessment Models with Few Assumptions, and Connections with Nonparametric Item Response Theory

Junker

Sijtsma

2001

Applied Psychological Measurement

780

646

View full text Add to dashboard Cite

Some usability and interpretability issues for single-strategy cognitive assessment models are considered. These models posit a stochastic conjunctive relationship between a set of cognitive attributes to be assessed and performance on particular items/tasks in the assessment. The models considered make few assumptions about the relationship between latent attributes and task performance beyond a simple conjunctive structure. An example shows that these models can be sensitive to cognitive attributes, even in data designed to well fit the Rasch model. Several stochastic ordering and monotonicity properties are considered that enhance the interpretability of the models. Simple data summaries are identified that inform about the presence or absence of cognitive attributes when the full computational power needed to estimate the models is not available.

show abstract

Methodology Review: Evaluating Person Fit

Meijer

Sijtsma

2001

Applied Psychological Measurement

311

275

View full text Add to dashboard Cite

Person-fit methods based on classical test theory-and item response theory (IRT), and methods investigating particular types of response behavior on tests, are examined. Similarities and differences among person-fit methods and their advantages and disadvantages are discussed. Sound person-fit methods have been derived for the Rasch model. For other IRT models, the empirical and theoretical distributions differ for most person-fit statistics when used with short and moderate length tests. The detection rate of person-fit statistics depends on the type of misfitting item-score patterns, test length, and trait levels. The usefulness of person-fit statistics for improving measurement depends on the application.

show abstract

Investigating an Invariant Item Ordering for Polytomously Scored Items

Ligtvoet

Ark

Marvelde

et al. 2010

Educational and Psychological Measurement

130

199

View full text Add to dashboard Cite

Investigating an invariant item ordering for polytomously scored itemsLigtvoet, R.; van der Ark, L.A.; Te Marvelde, J.M.; Sijtsma, K. General rightsCopyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights.-Users may download and print one copy of any publication from the public portal for the purpose of private study or research -You may not further distribute the material or use it for any profit-making activity or commercial gain -You may freely distribute the URL identifying the publication in the public portal Take down policy If you believe that this document breaches copyright, please contact us providing details, and we will remove access to the work immediately and investigate your claim.

show abstract

A tutorial on how to do a Mokken scale analysis on your test and questionnaire data

Sijtsma

Ark

2016

Brit J Math & Statis

167

197

View full text Add to dashboard Cite

Over the past decade, Mokken scale analysis (MSA) has rapidly grown in popularity among researchers from many different research areas. This tutorial provides researchers with a set of techniques and a procedure for their application, such that the construction of scales that have superior measurement properties is further optimized, taking full advantage of the properties of MSA. First, we define the conceptual context of MSA, discuss the two item response theory (IRT) models that constitute the basis of MSA, and discuss how these models differ from other IRT models. Second, we discuss dos and don'ts for MSA; the don'ts include misunderstandings we have frequently encountered with researchers in our three decades of experience with real-data MSA. Third, we discuss a methodology for MSA on real data that consist of a sample of persons who have provided scores on a set of items that, depending on the composition of the item set, constitute the basis for one or more scales, and we use the methodology to analyse an example real-data set.

show abstract

Stochastic ordering using the latent trait and the sum score in polytomous IRT models

et al. 1997

View full text Add to dashboard Cite

show abstract

Selection of Unidimensional Scales From a Multidimensional Item Bank in the Polytomous Mokken I RT Model

Hemker

Sijtsma

Molenaar

1995

Applied Psychological Measurement

146

173

View full text Add to dashboard Cite

An automated item selection procedure for selecting unidimensional scales of polytomous items from multi dimensional datasets is developed for use in the context of the Mokken item response theory model of monotone homogeneity (Mokken & Lewis, 1982). The selection procedure is directly based on the selection procedure proposed by Mokken (1971, p. 187) and relies heavily on the scalability coefficient H (Loevinger, 1948; Molenaar, 1991). New theoretical results relating the latent model structure to H are provided. The item selec tion procedure requires selection of a lower bound for H. A simulation study determined ranges of H for which the unidimensional item sets were retrieved from multidimensional datasets. If multidimensionality is suspected in an empirical dataset, well-chosen lower bound values can be used effectively to detect the unidi mensional scales.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Klaas Sijtsma

On the Use, the Misuse, and the Very Limited Usefulness of Cronbach’s Alpha

Introduction to Nonparametric Item Response Theory

Cognitive Assessment Models with Few Assumptions, and Connections with Nonparametric Item Response Theory

Methodology Review: Evaluating Person Fit

Investigating an Invariant Item Ordering for Polytomously Scored Items

A tutorial on how to do a Mokken scale analysis on your test and questionnaire data

Stochastic ordering using the latent trait and the sum score in polytomous IRT models

Selection of Unidimensional Scales From a Multidimensional Item Bank in the Polytomous Mokken I RT Model

Contact Info

Product

Resources

About