2003
DOI: 10.1177/0146621603259902
|View full text |Cite
|
Sign up to set email alerts
|

Effects of Anchor Item Methods on Differential Item Functioning Detection with the Likelihood Ratio Test

Abstract: Through simulations, this study investigates the effects of anchor item methods on Type I error and power of detecting differential item functioning (DIF) using the likelihood ratio test within the framework of item response theory. Four anchor item methods were compared: the all-other, 1-item, 4-item, and 10-item methods. The results showed that it is the average signed area between the reference and focal groups rather than the percentage of DIF items in a test that determines the Type I error of the all-oth… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

20
169
0

Year Published

2006
2006
2024
2024

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 110 publications
(189 citation statements)
references
References 29 publications
(35 reference statements)
20
169
0
Order By: Relevance
“…This procedure has been implemented on several popular computer programs, such as BILOG-MG [37], WINSTEPS [38], and ConQuest. There are other procedures to solve scale indeterminacy [39], for example, if a set of items are believed to have no DIF, they can served as the anchors so that the other items can be detected for the evidence of DIF. Unfortunately, this procedure was not applicable in this study, because no prior knowledge was available to claim which items were indeed DIF-free.…”
Section: Resultsmentioning
confidence: 99%
“…This procedure has been implemented on several popular computer programs, such as BILOG-MG [37], WINSTEPS [38], and ConQuest. There are other procedures to solve scale indeterminacy [39], for example, if a set of items are believed to have no DIF, they can served as the anchors so that the other items can be detected for the evidence of DIF. Unfortunately, this procedure was not applicable in this study, because no prior knowledge was available to claim which items were indeed DIF-free.…”
Section: Resultsmentioning
confidence: 99%
“…Several commonly used models and associated test statistics have been developed to detect DIF as a function of membership in observed groups (like gender or ethnicity), including the Mantel-Hansel test (Holland & Thayer, 1988), the standardization method (Dorans & Kulick, 1986), the logistic regression model (for example, Swaminathan & Rogers, 1990), the IRT-based chi-square test (Lord, 1980;Wright & Stone, 1979), the likelihood ratio test (IRT-LRT; Thissen, Steinberg, & Wainer, 1988;Wang & Yeh, 2003), and the multiple indicators multiple causes (MIMIC : Muthén, 1985;1989) model.…”
Section: Chapter 1: Introductionmentioning
confidence: 99%
“…In addition, as Wang and Yeh (2003) stated, if LR DIF can detect non-uniform DIF better than the MH DIF method, and is as powerful at detecting uniform DIF as the MH DIF method, then the inclusion of an effect size would make LR DIF a very attractive choice as a DIF detection method.…”
Section: Discussionmentioning
confidence: 99%