Comparative evaluation of three situational judgment test response formats in terms of construct-related validity, subgroup differences, and susceptibility to response distortion.

Arthur, Winfred; Glaze, Ryan M.; Jarrett, Steven; White, Craig D; Schurig, Ira; Taylor, Jason E.

doi:10.1037/a0035788

Cited by 45 publications

(32 citation statements)

References 38 publications

(81 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…In support of the SJTs-as-methods perspective, several meta-analyses have found that SJT scores indeed relate to general mental ability and personality variables (Arthur et al, 2014;McDaniel & Nguyen, 2001;McDaniel et al, , 2007. However, under this perspective, the internal measurement structure specific to SJTs is essentially sidestepped and, as a result, there is no way of knowing what it is about SJTs that is reliable and, thus, leads to observed correlations with externally measured constructs.…”

Section: Practitioner Pointsmentioning

confidence: 99%

The internal structure of situational judgement tests reflects candidate main effects: Not dimensions or situations

Jackson

LoPilato

Hughes

et al. 2016

J Occupat & Organ Psyc

View full text Add to dashboard Cite

Despite their popularity and capacity to predict performance, there is no clear consensus on the internal measurement characteristics of situational judgement tests (SJTs). Contemporary propositions in the literature focus on treating SJTs as methods, as measures of dimensions, or as measures of situational responses. However, empirical evidence relating to the internal structure of SJT scores is lacking. Using generalizability theory, we decomposed multiple sources of variance for three different SJTs used with different samples of job candidates (N1 = 2,320; N2 = 989; N3 = 7,934). Results consistently indicated that (1) the vast majority of reliable observed score variance reflected SJT‐specific candidate main effects, analogous to a general judgement factor, and that (2) the contribution of dimensions and situations to reliable SJT variance was, in relative terms, negligible. These findings do not align neatly with any of the proposals in the contemporary literature; however, they do suggest an internal structure for SJTs. Practitioner points To help optimize reliable variance, overall‐level aggregation should be used when scoring SJTs. The majority of reliable variance in SJTs reflects a general performance factor, relative to variance pertaining to specific dimensions or situations. SJT‐based developmental feedback should be delivered in terms of general SJT performance rather than on performance relating to specific dimensions or situations. Generalizability theory, although underutilized in organizational multifaceted measurement, offers an approach to informing on the psychometric properties of SJTs that is well suited to the complexities of SJT measurement designs.

show abstract

Section: Practitioner Pointsmentioning

confidence: 99%

The internal structure of situational judgement tests reflects candidate main effects: Not dimensions or situations

Jackson

LoPilato

Hughes

et al. 2016

J Occupat & Organ Psyc

View full text Add to dashboard Cite

show abstract

“…As these results indicate, it took on average 8.55 more minutes to complete the personality measure on a mobile device (d = −0.49, p < .05). Whereas it is impossible to specify the exact reasons for this difference, the pattern of results is consonant with the fact that longer response latencies are associated with activities and tasks that have higher cognitive demands (Arthur et al, 2014;Bassili & Scott, 1996;Yan & Tourangeau, 2008). The higher cognitive demands arise from the structural differences (and challenges) such as increased scrolling time, interface manipulation, and comprehension time associated with using a small screen mobile device.…”

Section: Because They Have Smaller Screens and Interfaces Does It Tamentioning

confidence: 99%

The Use of Mobile Devices in High‐stakes Remotely Delivered Assessments and Testing

Arthur

Doverspike

Muñoz

et al. 2014

Int J Selection Assessment

Self Cite

View full text Add to dashboard Cite

With Internet access no longer restricted to desktop and laptop computers, job applicants now have the opportunity to complete remotely delivered assessments on mobile, handheld small screen devices such as smartphones, and personal digital assistants. In this study, a large dataset is used to investigate demographic and score differences between job applicants who completed a remotely delivered high-stakes assessment on a mobile device and those who completed it on a nonmobile device. Based on a sample of 3,575,207 job applicants who completed an unproctored Internet-based assessment between January 2011 and April 2012, the percentage of applicants completing the assessment on a mobile device was small, 1.93%, but nevertheless represented more than 69,000 people. Overall, there were small test-taker demographic differences in the use of mobile devices versus nonmobile devices in that mobile devices were slightly more likely to be used by women, AfricanAmericans and Hispanics, and younger applicants. Scores on a personality measure were similar for mobile and nonmobile devices but scores on a general mental ability test were substantially lower for mobile devices. Tests of measurement invariance also indicated equivalence across the mobile and nonmobile samples. Test taker and organizational implications for completing remotely delivered high-stakes noncognitive and cognitive assessments on mobile versus nonmobile devices are discussed.

show abstract

“…However, past research demonstrated that design variations in SJTs appear to influence subgroup differences. For example, knowledge‐based response instructions showed slightly higher ethnic and sex group differences (Whetzel et al, ) which seems to be due to the higher cognitive load of knowledge‐based response instructions (Whetzel et al, ; see also McDaniel et al, ) (for other examples, see Arthur et al, ; McDaniel, Psotka, Legree, Yost, & Weekley, ; Weng, Yang, Lievens, & McDaniel, ).…”

Section: Discussionmentioning

confidence: 99%

Subgroup differences in situational judgment test scores: Evidence from large applicant samples

Herde

Lievens

Jackson

et al. 2019

Int J Selection Assessment

View full text Add to dashboard Cite

To promote diversity in organizations it is important to have accurate knowledge about subgroup differences associated with selection procedures. However, current estimates of subgroup differences in situational judgment tests (SJTs) are overwhelmingly based on range‐restricted incumbent samples that are downwardly biased. This study provides much‐needed applicant level estimates of SJT subgroup differences (N = 37,530). As a key finding, Black‐White differences (d = 0.66) were higher than in incumbent samples (d = 0.38). Overall, sex differences were small. Females scored higher for management jobs (d = −0.13) and males scored higher for administrative jobs (d = 0.15). By analyzing applicant samples that do not suffer from range restriction, this study adds knowledge about subgroup differences in SJTs.

show abstract

Comparative evaluation of three situational judgment test response formats in terms of construct-related validity, subgroup differences, and susceptibility to response distortion.

Cited by 45 publications

References 38 publications

The internal structure of situational judgement tests reflects candidate main effects: Not dimensions or situations

The internal structure of situational judgement tests reflects candidate main effects: Not dimensions or situations

The Use of Mobile Devices in High‐stakes Remotely Delivered Assessments and Testing

Subgroup differences in situational judgment test scores: Evidence from large applicant samples

Contact Info

Product

Resources

About