“…Many other cognitive tasks that use difference scores to measure a specific cognitive component also produce robust effects at the group level, but fail to show reliability as a measure of individual differences. For example, the commonly used difference scores reflecting the ability to resist interference in the Stroop and flanker tasks, show low reliability ( Paap & Sawi, 2016 ; Paap, Anders-Jefferson, Zimiga, Mason, & Mikulinsky, 2020 ; Siegrist, 1997 ; Von Bastian et al, 2016 ). These measures are also only weakly correlated with each other ( Prior et al, 2017 ; Rey-Mermet, Gade, & Oberauer, 2018 ; Rouder & Haaf, 2019 ) even though they are thought to rely on similar processes ( Draheim et al, 2020 ).…”