2014
DOI: 10.1016/j.csl.2012.12.002
|View full text |Cite
|
Sign up to set email alerts
|

Medium-term speaker states—A review on intoxication, sleepiness and the first challenge

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
40
0
1

Year Published

2014
2014
2017
2017

Publication Types

Select...
7
2

Relationship

3
6

Authors

Journals

citations
Cited by 50 publications
(42 citation statements)
references
References 36 publications
1
40
0
1
Order By: Relevance
“…In particular, the bulk of studies found that the rate of speech and the overall amplitude decrease after alcohol consumption, whereas the pitch variability, the mean fundamental frequency (F0), and the sentence duration increase [1]. Likewise, sleepy speech manifests changes in prosody (e. g., monotonic intonation, shifted speech rate, reduced syllable duration due to retarded cognitive speech planning), articulation (e. g., slurred pronunciation, speech errors, disfluency), and voice quality (e. g., nasal or breathy speech) [15]. Specifically, in accordance with the findings of alcohol intoxication, the sleepiness-induced changes of speech parameters include a general decrease in speech rate [16] and an increased average absolute deviation of intensity [17].…”
Section: Introductionmentioning
confidence: 99%
“…In particular, the bulk of studies found that the rate of speech and the overall amplitude decrease after alcohol consumption, whereas the pitch variability, the mean fundamental frequency (F0), and the sentence duration increase [1]. Likewise, sleepy speech manifests changes in prosody (e. g., monotonic intonation, shifted speech rate, reduced syllable duration due to retarded cognitive speech planning), articulation (e. g., slurred pronunciation, speech errors, disfluency), and voice quality (e. g., nasal or breathy speech) [15]. Specifically, in accordance with the findings of alcohol intoxication, the sleepiness-induced changes of speech parameters include a general decrease in speech rate [16] and an increased average absolute deviation of intensity [17].…”
Section: Introductionmentioning
confidence: 99%
“…This feature obtained a very competitive RMSE of 10.17, well below the Challenge test audio baseline of 14.12. Given the consistent performance of this feature in system development and in other paralinguistic tasks [24], [25], this result is not surprising. The second audio system chosen was another KL-mean system.…”
Section: Test Set Resultsmentioning
confidence: 53%
“…Mel Frequency Cepstral Features (MFCC) are one of the strongest performing spectral features, when combined with Gaussian Mixture Models (GMM), for classifying either low/high levels of depression [22], [23], or the presence/absence of depression [7], [8]. Classification using MFCCs in combination with GMM-UBM (universal background model) supervectors has recently gained popularity for many paralinguistic tasks: many entrants to similar Interspeech Challenges on speaker affect, intoxication and sleepiness have used this style of system to obtain competitive results [24], [25]. Motivated by recent results showing a decrease in energy variability with increasing levels of depression, due in part to a decrease in the motor action associated with speech production [17], [20], and by results suggesting that this decrease in variability can be captured in a GMM [22], we explore a range of GMM-UBM supervector systems in combination with Support Vector Regression (SVR) for the task of predicting depression.…”
Section: Non Linguistic Audio Cuesmentioning
confidence: 99%
“…For a more detailed description of ALC, see Schiel et al (2012). A subset of ALC was used for the INTERSPEECH 2011 Speaker State Challenge (Schuller et al, 2012).…”
Section: Speech Data Used In Experimentsmentioning
confidence: 99%