2011
DOI: 10.1016/j.compeleceng.2010.08.001
|View full text |Cite
|
Sign up to set email alerts
|

Comparison of the impact of some Minkowski metrics on VQ/GMM based speaker recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2011
2011
2020
2020

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(7 citation statements)
references
References 19 publications
0
7
0
Order By: Relevance
“…For the same reason, since GMM is based on a different objective function, the differences between GMM and VQ type of models tend to be generally larger. In two recent independent studies [29,6], differences between these two clustering models were reported for different distance functions [29] and in SVM back-end setting [6]. We conclude that training methodology and data selection for UBM [30] are worth readdressing.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations
“…For the same reason, since GMM is based on a different objective function, the differences between GMM and VQ type of models tend to be generally larger. In two recent independent studies [29,6], differences between these two clustering models were reported for different distance functions [29] and in SVM back-end setting [6]. We conclude that training methodology and data selection for UBM [30] are worth readdressing.…”
Section: Discussionmentioning
confidence: 99%
“…Two recent studies include more detailed comparisons of GMM and VQ [46,29]. In [46] the MAP trained VQ outperformed MAP-trained GMM for longer training data (2.5 minutes) but the situation was reversed for 10-second speech samples.…”
Section: Review Of Clustering Methods In Speaker Recognitionmentioning
confidence: 99%
See 2 more Smart Citations
“…MSE is a common metric in the literature to compute the match score between the training and testing samples [22]. A better metric for computing match score in speaker recognition systems is a topic of on-going research [23]. MSE is computed according to the following expression.…”
Section: Speaker Recognition Using Mfccmentioning
confidence: 99%