The Mandarin Chinese auditory emotions stimulus database: A validated set of Chinese pseudo-sentences

Gong, Bingyan; Li, Na; Li, Qiuhong; Yan, Xinyuan; Chen, Jing; Li, Liang; Wu, Xihong; Wu, Chao

doi:10.3758/s13428-022-01868-7

Cited by 7 publications

(6 citation statements)

References 66 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This value is also comparable to the existing audio-only emotional speech databases that do not consider the intensity factor: 84.24% for the Arabic KSUEmotions database [14], 75.5% for the Bangla SUBESCO database [16], 67.3% for the Danish DES database [19], 85% for the German EMO-DB database [24], and 80% for the Italian EMOVO database [25]. The unbiased hit rates were also computed to account for response biases, which are commonly used to assess the validity of emotional speech databases [16], [17], [22], [61]. The CADKES obtained overall unbiased hit rates of 69% across intensities, which were comparable to the RAVDESS stimuli in the audio modality with overall unbiased hit rates of 46.5% across intensities.…”

Section: Discussionmentioning

confidence: 99%

“…In the study of speech emotion recognition, validated emotional speech databases constitute a crucial building block for developing and evaluating speech emotion recognizers [13]. To date, numerous emotional speech databases have been created in many languages, including Arabic [14], [15], Bangla [16], Mandarin Chinese [17], [18], Danish [19], English [20]- [22], German [23], [24], Italian [25], and Persian [26]. However, the majority of the databases come from highresource languages such as English, Mandarin Chinese, and German [27].…”

Section: Introductionmentioning

confidence: 99%

“…Recently, there has been a growing need to address the role of emotional intensity in constructing emotional speech databases, given that emotional expressions with higher intensity tend to be more accurately recognized than their counterparts with lower intensity [31]- [34]. However, only a handful of databases have been explicitly designed to manipulate the levels of intensity of vocal emotional expressions [17], [18]. Specifically, there exists no such resource for the Korean language.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Chung-Ang Auditory Database of Korean Emotional Speech: A Validated Set of Vocal Expressions With Different Intensities

Nam

Lee

2022

IEEE Access

View full text Add to dashboard Cite

A growing body of evidence indicates that intensity plays a role in emotion perception. However, only a few databases have been explicitly designed to provide emotional stimuli that are expressed at varying intensities. We developed and validated a Korean audio-only database of emotional expressions. Eighteen actors were recorded using twenty-five sentences with strong and moderate intensities for "neutral," "happiness," "sadness," "anger," "fear," and "boredom" emotions. Twenty-five native Korean-speaking adults completed the emotion identification and naturalness rating tasks. All listeners were presented with the full set of 5400 recordings in a six-alternative forced-choice paradigm, yielding 135000 judgements for identification and naturalness, respectively. Raw and unbiased hit rates were calculated, with identification responses significantly above chance level for every emotion at both intensities. The overall raw hit rates reached 87% and 78% for the strong and moderate stimuli, respectively, indicating that strong emotional expressions were more accurately identified than their moderate counterparts. Similarly, a recognition advantage for strong intensity over moderate intensity was observed for each emotion at both intensities. High inter-and intra-rater reliabilities were found in listeners' identifying emotion categories and assigning naturalness ratings, respectively. Further, there was a strong association between identification accuracy and the degree of naturalness; more natural variants of an emotion were more accurately identified than its less natural counterparts. These results confirm that the proposed database will serve as a valuable source for emotion research.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Chung-Ang Auditory Database of Korean Emotional Speech: A Validated Set of Vocal Expressions With Different Intensities

Nam

Lee

2022

IEEE Access

View full text Add to dashboard Cite

show abstract

“…The target speech stimuli were selected from the Mandarin Chinese auditory emotions stimulus database [31]. We selected 288 sounds (auditory sentences) from 5 speakers (Z.Y.L., Z.Q.J., C.L., T.S.S., and Z.Y.F.…”

Section: Stimulimentioning

confidence: 99%

“…To test whether the difference in the SR across emotional categories was related to the acoustic features of the target speech, we conducted a linear regression, with the SR accuracy (across 4 SNRs) as the dependent variable and the emotion category recognition (ECR) accuracy (30) and 11 acoustic parameters (duration, F0 mean, F0 SD, F0 max, F0 min, jitter [local], shimmer [local], root mean square (RMS) amplitude, harmonics-to-noise ratio, the spectral center of gravity, and spectral spread) [31] as independent variables (intensity and F0 range were removed from the model because there was high collinearity between intensity and RMS amplitude and between F0 range and F0 standard deviation). The results showed that for both HPs and SCHs, a better SR was positively associated with ECR accuracy and negatively associated with speech duration and local shimmer (for SCHs, adjusted R 2 = 0.187, F = 6.5, p < 0.001; for HPs, adjusted R 2 = 0.193, F = 6.7, p < 0.001), while holding the values of all other independent variables constant (Fig.…”

Section: Associations Between the Sr And The Acoustic Characteristics...mentioning

confidence: 99%

Deficits in Prosodic Speech-in-Noise Recognition in Schizophrenia Patients and Its Association with Psychiatric Symptoms

She,

Gong,

et al. 2024

Preprint

Self Cite

View full text Add to dashboard Cite

Background Uncertainty in speech perception and emotional disturbances are intertwined with psychiatric symptoms. How prosody embedded in target speech affects speech-in-noise recognition (SR) and is related to psychiatric symptoms in patients with schizophrenia remains unclear. This study aimed to examine the neural substrates of prosodic SR deficits and their associations with psychiatric symptom dimensions in patients with schizophrenia. Methods Fifty-four schizophrenia patients (SCHs) and 59 healthy control participants (HPs) completed the SR task (the target pseudosentences were uttered in neutral, happy, sad, angry, fear, and disgust prosody), positive and negative syndrome scale (PANSS) assessment, and magnetic resonance imaging scanning. We examined the deficits of the six prosodic SRs in schizophrenia patients and examined their associations with brain gray matter volume (GMV) reduction and psychiatric symptoms. Results Negative prosody worsened SR and reduced SR change rates across groups. SCHs had lower rates of change in prosodic SR and SR than HPs. Prosodic SR was associated with acoustic features. The GMV PLS component (covering 47 brain regions with group differences) was related to group differences in the six prosodic SRs. A happy SR was associated with the PANSS total, negative, and general scores after adjusting for covariates. Conclusions A better prosodic SR was related to better emotional salience, shorter duration, and lower shimmer (local) of the target sentences. The prosodic SR abnormalities in SCHs were associated with brain GMV reductions in the regions involved in sensorimotor, speech, and emotion processing. These findings suggest the possibility of improving negative symptoms by improving a happy SR in schizophrenia patients based on neuroplasticity.

show abstract

Poverty of the stimulus and its role in learning the Chinese language according to syntactic content

Xiaowan

2024

Curr Psychol

View full text Add to dashboard Cite

The Mandarin Chinese auditory emotions stimulus database: A validated set of Chinese pseudo-sentences

Cited by 7 publications

References 66 publications

Chung-Ang Auditory Database of Korean Emotional Speech: A Validated Set of Vocal Expressions With Different Intensities

Chung-Ang Auditory Database of Korean Emotional Speech: A Validated Set of Vocal Expressions With Different Intensities

Deficits in Prosodic Speech-in-Noise Recognition in Schizophrenia Patients and Its Association with Psychiatric Symptoms

Poverty of the stimulus and its role in learning the Chinese language according to syntactic content

Contact Info

Product

Resources

About