2021
DOI: 10.1186/s40536-021-00110-8
|View full text |Cite
|
Sign up to set email alerts
|

Does the choice of response time threshold procedure substantially affect inferences concerning the identification and exclusion of rapid guessing responses? A meta-analysis

Abstract: Background In testing contexts that are predominately concerned with power, rapid guessing (RG) has the potential to undermine the validity of inferences made from educational assessments, as such responses are unreflective of the knowledge, skills, and abilities assessed. Given this concern, practitioners/researchers have utilized a multitude of response time threshold procedures that classify RG responses in these contexts based on either the use of no empirical data (e.g., an arbitrary time … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
16
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1

Relationship

3
5

Authors

Journals

citations
Cited by 17 publications
(19 citation statements)
references
References 49 publications
1
16
0
Order By: Relevance
“…For instance, if primary studies employed response time thresholds that were too strict, RG may have been underidentified (Wise & Kuhfeld, 2020), leading to potentially understated score distortion. This may be reflected in our descriptive results in which large differences in the percentage of RG responses identified by response time threshold procedures were observed, supporting prior research (Rios & Deng, 2021). Although researchers have utilized multiple proxies to identify RG (e.g., Harmes & Wise, 2016;Sahin & Colvin, 2020), they are all indirect indicators of examinee behaviors that make strong assumptions to infer the occurrence of RG behavior.…”
Section: Limitations and Directions For Future Researchsupporting
confidence: 86%
See 1 more Smart Citation
“…For instance, if primary studies employed response time thresholds that were too strict, RG may have been underidentified (Wise & Kuhfeld, 2020), leading to potentially understated score distortion. This may be reflected in our descriptive results in which large differences in the percentage of RG responses identified by response time threshold procedures were observed, supporting prior research (Rios & Deng, 2021). Although researchers have utilized multiple proxies to identify RG (e.g., Harmes & Wise, 2016;Sahin & Colvin, 2020), they are all indirect indicators of examinee behaviors that make strong assumptions to infer the occurrence of RG behavior.…”
Section: Limitations and Directions For Future Researchsupporting
confidence: 86%
“…Response times have been the most popular source of information utilized to identify RG in both the literature (e.g., Rios et al, 2017) and operational testing contexts (e.g., Wise & Kuhfeld, 2020). In this approach, a RT threshold is defined in which any response provided in less time is considered to be RG (for details, see Rios & Deng, 2021;Wise, 2017). The extensive adoption of RT as a proxy for RG is associated with its ability to be unobtrusive (i.e., examinees are unaware that their behavior is tracked) and identify RG on an item-by-item basis for each examinee (e.g., Silm et al, 2020).…”
Section: Identification Of Rapid Guessing In Practicementioning
confidence: 99%
“…In this case, the latent ignorability assumption is unlikely to hold, and scaling models that rely on it (see [ 4 , 12 ]) will result in biased and unfair country comparisons. We are skeptical that the decision of whether a missing item response is scored as wrong should be based on a particular response time threshold [ 166 , 172 , 173 ]. Students can also be simply instructed to quickly skip items that they are not probably able to solve.…”
Section: Discussionmentioning
confidence: 99%
“…Within the response time threshold framework, it is presumed that response times monotonically increase based on examinee effort, and thus, response time distributions between RG and solution behavior (SB) are anticipated to be distinct (i.e., bimodal). Given this assumption, identification of RG requires that a threshold is identified to differentiate between these distributions, which can be accomplished by using heuristic rules (e.g., any response provided is less than three seconds is a rapid guess), inspecting response time distributions, and combining response time and accuracy information (see Rios & Deng, 2021;Soland et al, 2019).…”
Section: Identifying Rgmentioning
confidence: 99%