E. Damiano D’Urso scite author profile

In social sciences, the study of group differences concerning latent constructs is ubiquitous. These constructs are generally measured by means of scales composed of ordinal items. In order to compare these constructs across groups, one crucial requirement is that they are measured equivalently or, in technical jargon, that measurement invariance (MI) holds across the groups. This study compared the performance of scale- and item-level approaches based on multiple group categorical confirmatory factor analysis (MG-CCFA) and multiple group item response theory (MG-IRT) in testing MI with ordinal data. In general, the results of the simulation studies showed that MG-CCFA-based approaches outperformed MG-IRT-based approaches when testing MI at the scale level, whereas, at the item level, the best performing approach depends on the tested parameter (i.e., loadings or thresholds). That is, when testing loadings equivalence, the likelihood ratio test provided the best trade-off between true-positive rate and false-positive rate, whereas, when testing thresholds equivalence, the χ2 test outperformed the other testing strategies. In addition, the performance of MG-CCFA’s fit measures, such as RMSEA and CFI, seemed to depend largely on the length of the scale, especially when MI was tested at the item level. General caution is recommended when using these measures, especially when MI is tested for each item individually.

show abstract

The Dire Disregard of Measurement Invariance Testing in Psychological Science

D’Urso¹,

Maassen²,

Assen³

et al. 2022

Preprint

View full text Add to dashboard Cite

In psychological science, self-report scales are widely used to compare means in targeted latent constructs across time points, groups, or experimental conditions. For these scale mean comparisons (SMC) to be meaningful and unbiased, the scales should be measurement invariant across the compared time points or (experimental) groups. Measurement invariance (MI) testing checks whether the latent constructs are measured equivalently across groups or time points. Since MI is essential for meaningful comparisons, we conducted a systematic review to check whether MI is taken seriously in psychological research. Specifically, we sampled 426 psychology articles with openly available data that involved a total of 918 SMCs to (1) investigate common practices in conducting and reporting of MI testing, (2) check whether reported MI test results can be reproduced, and (3) conduct MI tests for the SMCs that enabled sufficiently powerful MI testing with the shared data. Our results indicate that (1) 4% of the 918 scales underwent MI testing across groups or time and that these tests were generally poorly reported, (2) none of the reported MI tests could be successfully reproduced, and (3) of 161 newly performed MI tests, a mere 46 (29%) reached sufficient MI (scalar invariance), and MI often failed completely (89; 55%). Thus, MI tests were rarely done and poorly reported in psychological studies, and the frequent violations of MI indicate that reported group differences cannot be solely attributed to group differences in the latent constructs. We offer recommendations on reporting MI tests and improving computational reproducibility practices.

show abstract

A many-analysts approach to the relation between religiosity and well-being

Hoogeveen

Sarafoglou

Aczél

et al. 2022

Religion, Brain & Behavior

View full text Add to dashboard Cite

Awareness is bliss: How acquiescence affects exploratory factor analysis

D’Urso¹,

Tijmstra²,

Vermunt³

et al. 2021

Preprint

View full text Add to dashboard Cite

Assessing the measurement model (MM) of self-report scales is crucial to obtain valid measurement of individuals' latent psychological constructs. This entails evaluating the number of measured constructs and determining which construct is measured by which item. Exploratory factor analysis (EFA) is the most-used method to evaluate these psychometric properties, where the number of measured constructs (i.e., factors) is assessed, and, afterwards, rotational freedom is resolved to interpret these factors. This study assessed the effects of an acquiescence response style (ARS) on EFA for unidimensional and multidimensional (un)balanced scales. Specifically, we evaluated (i) whether ARS is captured as an additional factor, (ii) the effect of different rotation approaches on the recovery of the content and ARS factors, and (iii) the effect of extracting the additional ARS factor on the recovery of factor loadings. ARS was often captured as an additional factor in balanced scales when it was strong. For these scales, ignoring (i.e., not extracting) this additional ARS factor, or rotating to simple structure when extracting it, harmed the recovery of the original MM by introducing bias in loadings and cross-loadings. These issues were avoided by using informed rotation approaches (i.e., target rotation), where (part of) the MM is specified a priori. Not extracting the additional ARS factor did not affect the loading recovery in unbalanced scales. Researchers should consider the potential presence of an additional ARS factor when assessing the psychometric properties of balanced scales, and use informed rotation approaches when suspecting that an additional factor is an ARS factor.

show abstract

Scale length does matter: Recommendations for Measurement Invariance Testing with Categorical Factor Analysis and Item Response Theory Approaches

D’Urso¹,

Roover²,

Vermunt³

et al. 2020

Preprint

View full text Add to dashboard Cite

In social sciences, the study of group differences concerning latent constructs is ubiquitous. These constructs are generally measured by means of scales composed of ordinal items. In order to compare these constructs across groups, one crucial requirement is that they are measured equivalently or, in technical jargon, that measurement invariance holds across the groups. This study compared the performance of multiple group categorical confirmatory factor analysis (MG-CCFA) and multiple group item response theory (MG-IRT) in testing measurement invariance with ordinal data. A simulation study was conducted to compare the true positive rate (TPR) and false positive rate (FPR) both at the scale and at the item level for these two approaches under an invariance and a non-invariance scenario. The results of the simulation studies showed that the performance, in terms of the TPR, of MG-CCFA- and MG-IRT-based approaches mostly depends on the scale length. In fact, for long scales, the likelihood ratio test (LRT) approach, for MG-IRT, outperformed the other approaches, while, for short scales, MG-CCFA seemed to be generally preferable. In addition, the performance of MG-CCFA's fit measures, such as RMSEA and CFI, seemed to depend largely on the length of the scale, especially when MI was tested at the item level. General caution is recommended when using these measures, especially when MI is tested for each item individually. A decision flowchart, based on the results of the simulation studies, is provided to help summarizing the results and providing indications on which approach performed best and in which setting.

show abstract

A Psychometric Study of a Trait and State Assessment of Sexual Pleasure – The Amsterdam Sexual Pleasure Inventory (ASPI 1.0).

Borgmann¹,

Brandner²,

D’Urso³

et al. 2023

Preprint

View full text Add to dashboard Cite

We studied the Amsterdam Sexual Pleasure Inventory’s (ASPI, Vol. 1.0) psychometric properties to present evidence regarding its intended interpretation and use. The ASPI is a theory-based and revised self-report battery which aims to assess different domains of state and trait sexual pleasure in survey-research in gender-diverse, sex-diverse, and relationship-diverse populations. We collected quantitative (n = 1371) and qualitative data (n = 637) using a cross-sectional multi-method design targeting the general (German-speaking) population. The theory-based 5-factor ESEM showed good and the PCA of the two general exploratory index-scales showed acceptable structural validity evidence. Measurement invariance of the 5-factor models was given for male and female (assigned-at-birth) participants and for sexually functional-scoring and dysfunctional-scoring people. Coefficient omega indicated that all scales, except those of one facet, showed acceptable to very good internal consistency reliability. The ASPI’s convergent and discriminant associations with sexological and psychological constructs demonstrated good overall construct validity evidence and the scales showed differential utility in differentiating known-groups. Participants understood the items as intended and felt that the ASPI covers relevant facets of sexual pleasure. The ASPI might help understand how individuals differ in experiencing sexual pleasure and how different contexts enable some people to experience pleasure while disadvantaging others.

show abstract

Does Acquiescence disagree with Measurement Invariance Testing?

D’Urso¹,

Tijmstra²,

Vermunt³

et al. 2022

Preprint

View full text Add to dashboard Cite

Measurement invariance (MI) is required for validly comparing latent constructs measured by multiple ordinal self-report items. Non-invariances may occur when disregarding(group differences in) an acquiescence response style (ARS; an agreeing tendency regardless of item content). If non-invariance results solely from neglecting ARS, one should not worry about scale inequivalences but model the ARS instead. In a simulation study, we investigated the effect of ARS on MI testing, both when including ARS as a factor in the measurement model or not. For (semi-) balanced scales, disregarding a large ARS resulted in non-invariance already at the configural level. This was resolved by including an ARS factor for all groups. For unbalanced scales, disregarding ARS did not affect testing, and including an ARS factor often resulted in non-convergence. Implications and recommendations for applied research are discussed.

show abstract

Awareness Is Bliss: How Acquiescence Affects Exploratory Factor Analysis

D’Urso

Tijmstra

Vermunt

et al. 2022

Educational and Psychological Measurement

View full text Add to dashboard Cite

Assessing the measurement model (MM) of self-report scales is crucial to obtain valid measurements of individuals’ latent psychological constructs. This entails evaluating the number of measured constructs and determining which construct is measured by which item. Exploratory factor analysis (EFA) is the most-used method to evaluate these psychometric properties, where the number of measured constructs (i.e., factors) is assessed, and, afterward, rotational freedom is resolved to interpret these factors. This study assessed the effects of an acquiescence response style (ARS) on EFA for unidimensional and multidimensional (un)balanced scales. Specifically, we evaluated (a) whether ARS is captured as an additional factor, (b) the effect of different rotation approaches on the content and ARS factors recovery, and (c) the effect of extracting the additional ARS factor on the recovery of factor loadings. ARS was often captured as an additional factor in balanced scales when it was strong. For these scales, ignoring extracting this additional ARS factor, or rotating to simple structure when extracting it, harmed the recovery of the original MM by introducing bias in loadings and cross-loadings. These issues were avoided by using informed rotation approaches (i.e., target rotation), where (part of) the rotation target is specified according to a priori expectations on the MM. Not extracting the additional ARS factor did not affect the loading recovery in unbalanced scales. Researchers should consider the potential presence of ARS when assessing the psychometric properties of balanced scales and use informed rotation approaches when suspecting that an additional factor is an ARS factor.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

E. Damiano D’Urso

Scale length does matter: Recommendations for measurement invariance testing with categorical factor analysis and item response theory approaches

The Dire Disregard of Measurement Invariance Testing in Psychological Science

A many-analysts approach to the relation between religiosity and well-being

Awareness is bliss: How acquiescence affects exploratory factor analysis

Scale length does matter: Recommendations for Measurement Invariance Testing with Categorical Factor Analysis and Item Response Theory Approaches

A Psychometric Study of a Trait and State Assessment of Sexual Pleasure – The Amsterdam Sexual Pleasure Inventory (ASPI 1.0).

Does Acquiescence disagree with Measurement Invariance Testing?

Awareness Is Bliss: How Acquiescence Affects Exploratory Factor Analysis

Contact Info

Product

Resources

About