In this study purposes to indicate the effect of the number of DIF items and the distribution of DIF items in these forms, which be equalized on equating error. Mean-mean, mean-standard deviation, Haebara and Stocking-Lord Methods used in common item design equal groups as equalization methods. The study included six different simulation conditions. The conditions were compared according to the number of DIF items and the distribution of DIF items on tests. The results illustrated that adding DIF items to tests were equated caused an increase in the errors obtained by equating methods. We may state that the change in errors is lowest in characteristic curve transformation methods, largest in moment methods depending on the situations in these conditions.
The items that are suitable for everyone's own ability level with the support of computer programs instead of paper and pencil tests may help students to reach more accurate results. Computer adaptive tests (CAT), which are developed based on certain assumptions in this direction, are to create an optimum test for every person taking the exam. It then becomes essential to examine the development process of such important exams and to monitor what studies have contributed to this development in what year. Citespace is a program developed to map information fields, explain the relationship between different disciplines, examine and estimate the studies in a certain period of time, uncover the latest studies and predict the trend issues that occur according to the analysis of bibliographic records of related publications. In this study, it is aimed to find out what articles about CAT are produced in which areas, at what time periods e and which articles have a significant effect in these periods. CiteSpace program was used to make a document/article co-citation analysis. Articles on CAT between 1946-2016 were scanned by "or" connector. A total of 637 articles were used, the analyses were finalized according to the networks. As a result of the research, clusters were determined based on the relationship in the citations, articles that were the most cited and important among studies on CAT were presented.
Validity is one of the psychometric properties of the achievement tests. To determine the validity one of the examination is Item Bias Studies, which are based on Differential Item Functioning (DIF) analyses and field experts’ opinion. In this study, field experts were asked to estimate the DIF levels of the items to compare the estimations obtained from different statistical techniques. Firstly, the experts were asked to examine the questions and make the DIF level estimations according to the gender variable for the DIF estimation, the agreement of the experts was examined. Secondly, DIF levels were calculated by using the logistic regression and Mantel-Haenszel test. Thirdly, the experts’ estimations and the statistical analyses results were compared. As a conclusion, it was observed that the experts and the statistical techniques were in agreement among themselves and they were partially different from each other for the Sciences and equal for the Social Sciences tests.
This study is based on Meltem Yurtçu's doctoral thesis titled "The Comparison of the Equated Tests Scores by Using Various Covariates using Bayesian Nonparametric Model".
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.