2017
DOI: 10.1007/s10044-017-0612-0
|View full text |Cite
|
Sign up to set email alerts
|

A feature selection-based speaker clustering method for paralinguistic tasks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 35 publications
0
5
0
Order By: Relevance
“…By reviewing the health based Computational Paralinguistics Challenge (ComParE) series and Audio/Visual Emotion Challenge (AVEC) workshops we observed a steady increase in the percentage of deep learning based entries from none in 2011 through to two-thirds in ComParE-2017 However, it is clear from this review that deep learning systems are still not yet a dominant force in speech and health. While two challenges have been won using a deep learning based approach: 2015's Parkinson's condition sub-challenge [118] and 2017's cold and flu sub-challenge [140], it is debatable given the results in [126] if the winning approach in 2015 was due to the deep learning approach or the postprocessing speaker clustering method employed [118]. Furthermore, the majority of challenge participants still use very conventional feature extraction and classification paradigms.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…By reviewing the health based Computational Paralinguistics Challenge (ComParE) series and Audio/Visual Emotion Challenge (AVEC) workshops we observed a steady increase in the percentage of deep learning based entries from none in 2011 through to two-thirds in ComParE-2017 However, it is clear from this review that deep learning systems are still not yet a dominant force in speech and health. While two challenges have been won using a deep learning based approach: 2015's Parkinson's condition sub-challenge [118] and 2017's cold and flu sub-challenge [140], it is debatable given the results in [126] if the winning approach in 2015 was due to the deep learning approach or the postprocessing speaker clustering method employed [118]. Furthermore, the majority of challenge participants still use very conventional feature extraction and classification paradigms.…”
Section: Resultsmentioning
confidence: 99%
“…Kaya et al [125] approach, used a novel framework based on Fisher vector encoding of extracted features and cascaded normalisation to account for variability due to differing speakers characteristics and spoken content. Recently, DRN approaches were shown to produce equivalent performance as conventional linear SVM and AdaBoost systems for the eating task [126]. However, this comparison was not the main aim of the authors.…”
Section: Eating (2015)mentioning
confidence: 98%
“…As can be observed, our system obtains the second best result, only from behind [5], that was the winner of the 2014 COMPARE challenge for CL level classification. [30] 63.10 Fusion of [2], MFCC+SDC supervectors and SCF supervectors + GMM-SVM [3] 63.70 Feature Selection + Speaker Clustering + SVM [31] 64.80 LSTM Unnormalized Saliency (this paper) 66.80 Fusion of 4 Speech Streams + i-Vectors + SVM [5] 68.90…”
Section: Resultsmentioning
confidence: 99%
“…However, for specific computational paralinguistic tasks it was shown (see e.g. [22,26,66,67]) that speaker-wise feature standardization might assist the subsequent classification steps. Therefore, in our preliminary tests we also experimented with this standardization approach for both the original and the PTFE feature sets.…”
Section: Feature Standardizationmentioning
confidence: 99%