Performance of Polytomous IRT Models With Rating Scale Data: An Investigation Over Sample Size, Instrument Length, and Missing Data

Dai, Shenghai; Vo, Thao T.; Kehinde, Olasunkanmi James; He, Haibo; Xue, Yu; Demir, Cihan; Wang, Xiaolin

doi:10.3389/feduc.2021.721963

Cited by 25 publications

(20 citation statements)

References 38 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…In practice, the chances of obtaining accurate item parameter estimation based on the assumption that items are tightly restricted to perfect simple structure rather than multiple latent traits are uncertain (Finch, 2011). According to Dai et al (2021), the choice of polytomous IRT models (e.g., GRM and generalized partial credit model [GPCM]) is beyond the model fit indices, especially when the sample size is less than 300 and the test length is less than 5. Similarly, fitting a complex structure of multidimensionality to a simple structure may be inappropriate in practice.…”

Section: Discussionmentioning

confidence: 99%

“…Where P * jk (θ) is the probability that observed scores for item j and examinee i given the ability or latent trait θ to obtain a score greater than or equal to category k, D = 1 or 1.7, a jm is the vector of item discrimination parameters for item j on each latent trait m, b jk is the vector of item difficulty parameters for each category k within item j, θ m is the vector of the latent traits on m th dimension. However, the number of latent traits and category responses influence the dynamical feature of MGRM to GRM, and other multidimensional IRT models (e.g., multidimensional two-parameter logistic model; De Ayala, 1994;Embretson and Reise, 2000;Penfield, 2014;Dai et al, 2021).…”

Section: Background and Literaturementioning

confidence: 99%

See 1 more Smart Citation

Item parameter estimations for multidimensional graded response model under complex structures

2022

Self Cite

View full text Add to dashboard Cite

Item parameter recovery in the compensatory multidimensional graded response model (MGRM) under simple and complex structures with rating-scale item response data was examined. A simulation study investigated factors that influence the precision of item parameter estimation, including sample size, intercorrelation between the dimensions, and test lengths for the MGRM under balanced and unbalanced complex structures, as well as the simple structure. The item responses for the MGRM were generated and analyzed across conditions using the R package mirt. The bias and root mean square error (RMSE) was used to evaluate item parameter recovery. Results suggested that item parameter estimation was more accurate in balanced complex structure conditions than in unbalanced or simple structures, especially when the test length was 40 items, and the sample size was large. Further, the mean bias and RMSE in the recovery of item threshold estimates along the two dimensions for both balanced and unbalanced complex structures were consistent across all conditions.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Background and Literaturementioning

confidence: 99%

Item parameter estimations for multidimensional graded response model under complex structures

2022

Self Cite

View full text Add to dashboard Cite

show abstract

“…The multidimensional graded response model, speci cally, has been recommended for survey assessment especially because of its ability to be used with lower sample sizes. 21 While very large sample sizes are not required to ensure generalizability, a large enough sample size is required to ensure accurate model t. 22 It has been recommended that sample sizes of at least 300 and instrument length of at least 5 be required for both GRM and GPCM models. 22 To determine which model resulted in the best t for the data, the M2 and SRMSR statistics were compared.…”

Section: Financial Self-e Cacy Scale (Fses)mentioning

confidence: 99%

“…21 While very large sample sizes are not required to ensure generalizability, a large enough sample size is required to ensure accurate model t. 22 It has been recommended that sample sizes of at least 300 and instrument length of at least 5 be required for both GRM and GPCM models. 22 To determine which model resulted in the best t for the data, the M2 and SRMSR statistics were compared. 21,23 We also evaluated the models using Akaike and Bayesian Information Criterion (AIC and BIC).…”

Section: Financial Self-e Cacy Scale (Fses)mentioning

confidence: 99%

Financial anxiety, financial self-efficacy, and general social supports: Reliability of assessments

Dickson

Mulligan

2023

Preprint

View full text Add to dashboard Cite

Background: Educational debt continues to increase across the health professions. Financial self-efficacy and generalized social supports are suggested as possible ways to mitigate the financial anxiety that results from high levels of debt. Assessment tools have not been evaluated for reliability among any group of health professions students. The purpose of this study was to assess the reliability of tools measuring financial anxiety, financial self-efficacy, and general social support in a graduate health profession student population. Methods: The Financial Anxiety Scale, Financial Self-Efficacy Scale, and General Social Support Scale was completed by 510 physical therapist students. Item response theory was used to assess reliability and item fit for each assessment. Results: The Financial Anxiety Scale, Financial Self-Efficacy Scale, and General Social Support Scale are reliable measures and demonstrate good item fit among the population of physical therapist students in the United States. Conclusions: Because the results of an item response theory analysis are not dependent on the population studied, the assessments may be reliable among other health professions students. The Financial Anxiety Scale and Financial Self-Efficacy Scale provide a large amount of test information for physical therapist students. The General Social Support Scale, by contrast, may be best utilized as a screening tool for those who have very low levels of general social supports.

show abstract

“…IRT 모형은 추정되는 모수에 따라 1 모수 모형 (Rasch model, Rasch, 1960), 2모수 모 형 (Birnbaum, 1958a(Birnbaum, , 1958bLord, 1952) 다 (Embretson & Reise, 2000). 반면에 연속적 간 격법은 통계적 가정에 크게 의존하지 않는다 (Rozeboom & Jones, 1956) (Dai et al, 2021, Reise & Yu, 1990 1952). 초기 연구들 (Guilford, 1954;Hevner, 1930;Saffir, 1937) (Dane, 1985;Dhami, 2008;Han & Park, 2017).…”

Section: 연속적 간격법은 자극과 반응범주 경계선unclassified

Introduction and an application of the method of successive intervals: Focusing on scaling blameworthiness of behaviors

Han¹,

Lee²

2021

KCPA

View full text Add to dashboard Cite

Performance of Polytomous IRT Models With Rating Scale Data: An Investigation Over Sample Size, Instrument Length, and Missing Data

Cited by 25 publications

References 38 publications

Item parameter estimations for multidimensional graded response model under complex structures

Item parameter estimations for multidimensional graded response model under complex structures

Financial anxiety, financial self-efficacy, and general social supports: Reliability of assessments

Introduction and an application of the method of successive intervals: Focusing on scaling blameworthiness of behaviors

Contact Info

Product

Resources

About