2013
DOI: 10.1016/j.csl.2012.06.001
|View full text |Cite
|
Sign up to set email alerts
|

Investigating fuzzy-input fuzzy-output support vector machines for robust voice quality classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
23
0
2

Year Published

2013
2013
2016
2016

Publication Types

Select...
5
4

Relationship

3
6

Authors

Journals

citations
Cited by 60 publications
(26 citation statements)
references
References 17 publications
1
23
0
2
Order By: Relevance
“…We used various voice quality descriptors including normalized amplitude quotient (NAQ), parabolic spectral parameter (PSP), maxima dispersion quotient (MDQ), quasi-open quotient (QOQ), difference between the first two harmonics (H1-H2), and peak-slope. For more details, readers are referred to [16,17,37].…”
Section: Acoustic Descriptorsmentioning
confidence: 99%
“…We used various voice quality descriptors including normalized amplitude quotient (NAQ), parabolic spectral parameter (PSP), maxima dispersion quotient (MDQ), quasi-open quotient (QOQ), difference between the first two harmonics (H1-H2), and peak-slope. For more details, readers are referred to [16,17,37].…”
Section: Acoustic Descriptorsmentioning
confidence: 99%
“…The selection and choice of features is motivated mainly by related work and previous research [32,7,33,34]. Further, they have proven to be robust representatives of the targeted prosodic phenomena.…”
Section: Audio Featuresmentioning
confidence: 99%
“…In [24] and [7] the importance of voice qualities for emotion recognition are investigated and reported. We chose the peak slope parameter for the representation of breathy to tense voice qualities as it has proven to be very robust and successful in voice quality classification tasks [34]. Lastly, the spectral stationarity measure is used as an indicator for monotonicity in speech which is associated with low activity and negative valence [32].…”
Section: Audio Featuresmentioning
confidence: 99%
“…The abbreviation std indicates that the standard deviation of the observed measure was chosen. [27], depression [28] as well as the features' relevance for characterizing voice qualities on a breathy to tense dimension [26,18]. The first three features are derived from the glottal source signal estimated by iterative adaptive inverse filtering (IAIF, [1]).…”
Section: Acoustic Descriptorsmentioning
confidence: 99%