“…To test whether the difference in the SR across emotional categories was related to the acoustic features of the target speech, we conducted a linear regression, with the SR accuracy (across 4 SNRs) as the dependent variable and the emotion category recognition (ECR) accuracy (30) and 11 acoustic parameters (duration, F0 mean, F0 SD, F0 max, F0 min, jitter [local], shimmer [local], root mean square (RMS) amplitude, harmonics-to-noise ratio, the spectral center of gravity, and spectral spread) [31] as independent variables (intensity and F0 range were removed from the model because there was high collinearity between intensity and RMS amplitude and between F0 range and F0 standard deviation). The results showed that for both HPs and SCHs, a better SR was positively associated with ECR accuracy and negatively associated with speech duration and local shimmer (for SCHs, adjusted R 2 = 0.187, F = 6.5, p < 0.001; for HPs, adjusted R 2 = 0.193, F = 6.7, p < 0.001), while holding the values of all other independent variables constant (Fig.…”