Neil J. Dorans scite author profile

At the Educational Testing Service, the Mantel‐Haenszel procedure is used for differential item functioning (DIF) detection and the standardization procedure is used to describe DIF. This report describes these procedures. First, an important distinction is made between DIF and Impact, pointing the need to compare the comparable. Then, these two contingency table DIF procedures are described in some detail, first in terms of their own origins as DIF procedures, and then from a common framework that points out similarities and differences. The relationship between the Mantel‐Haenszel procedure and IRT models in general and the Rasch model, in particular, is discussed. The utility of the standardization approach for assessing differential distractor functioning is described. Several issues in applied DIF analyses are discussed including inclusion of the studied item in the matching variable, and refinement of the matching variable. Future research topics dealing with the matching variable, the studied variable and the group variable are also discussed.

show abstract

Population Invariance and the Equatability of Tests: Basic Theory and The Linear Case

Dorans

Holland

2000

J Educational Measurement

204

359

View full text Add to dashboard Cite

How does the fact that two tests should not be equated manifest itself? This paper addresses this question through the study of the degree to which equating functions fail to exhibit population invariance across subpopulations. Equating fimctions are supposed to be population invariant by definition. But, when two tests are not equatable, it is possible that the linking functions, used to connect the scores of one to the scores of the other, are not invariant across different populations of examinees. While no acceptable equating function is ever completely population invariant, in the situations where equating is usually performed we believe that the dependence of the equating function on the population used to compute it is usually small enough to be ignored. We introduce two root‐mean‐square difference measures of the degree to which the functions used to link two tests computed on different subpopulations differ from the linking function computed for the whole population. We also introduce the system of “parallel‐linear” linking functions for multiple subpopulations and show that, for this system, our measure of population invariance can be computed easily from the standardized mean differences between the scores of the subpopulations on the two tests. For the parallel‐linear case, we develop a correlation‐based upper bound on our measure that holds for all systems of subpopulations. We illustrate these ideas using data from the SAT I and from a concordance study of several combinations of ACT and SAT I scores, In the appendices, we give some theoretical results bearing on the other equating “requirements” of “same construct,”“same reliability” and one aspect of Lord's concept of equity.

show abstract

Computerized Adaptive Testing

Wainer¹,

Dorans²,

Flaugher³

et al. 2000

619

295

View full text Add to dashboard Cite

Demonstrating the Utility of the Standardization Approach to Assessing Unexpected Differential Item Performance on the Scholastic Aptitude Test

Dorans

Kulick

1986

J Educational Measurement

285

243

View full text Add to dashboard Cite

show abstract

The polyserial correlation coefficient

Drasgow²,

1982

View full text Add to dashboard Cite

show abstract

Population Invariance and the Equatability of Tests: Basic Theory and the Linear Case

Dorans¹,

Holland²

2000

ETS Research Report Series

146

View full text Add to dashboard Cite

Equating functions are supposed to be population invariant by definition. But when two tests are not equatable, it is possible that the linking functions, used to connect the scores of one to the scores of the other, are not invariant across different populations of examinees. We introduce two root-meansquare difference measures of the degree to which linking function are different for different subpopulations. We also introduce the system of "parallel-linear" linking functions for multiple subpopulations and show that, for this system, our measure of population invariance can be easily computed from the standardized mean differences between the scores of the subpopulations on the two tests. For the parallel-linear case, we develop a correlation-based upper bound on our measure.

show abstract

What Combination of Sampling and Equating Methods Works Best?

Livingston¹,

Dorans²,

Wright³

1990

Applied Measurement in Education

128

View full text Add to dashboard Cite

Linking scores from multiple health outcome instruments

2007

View full text Add to dashboard Cite

To the extent that outcomes of health assessment instruments are to be used interchangeably, the summary scores based on these outcomes need to be equated or made comparable. If the summary scores of different health assessment instruments are not equated, inferences based on them could be flawed. Ideally, summary scores would be comparable because of careful instrument design. In practice, that rarely happens. Statistical intervention is usually needed. This article addresses key questions associated with the linking of summary scores of health outcomes. What is meant by outcome linking and equating? How does equating differ from other types of linking? What common data collection designs are used to capture data for outcomes linking? What are some of the standard statistical procedures used to link outcomes directly? What assumptions do they make? What role does IRT play in linking outcomes? What assumptions do IRT methods make? This article makes a distinction between direct statistical adjustments of summary score distributions, and indirect procedures based on psychometric models of items or questions.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Neil J. Dorans

Dif Detection and Description: Mantel‐haenszel and Standardization1,2

Population Invariance and the Equatability of Tests: Basic Theory and The Linear Case

Computerized Adaptive Testing

Demonstrating the Utility of the Standardization Approach to Assessing Unexpected Differential Item Performance on the Scholastic Aptitude Test

The polyserial correlation coefficient

Population Invariance and the Equatability of Tests: Basic Theory and the Linear Case

What Combination of Sampling and Equating Methods Works Best?

Linking scores from multiple health outcome instruments

Contact Info

Product

Resources

About