The Speaker and Language Recognition Workshop (Odyssey 2016) 2016
DOI: 10.21437/odyssey.2016-59
|View full text |Cite
|
Sign up to set email alerts
|

On the Use of PLDA i-vector Scoring for Clustering Short Segments

Abstract: This paper extends upon a previous work using Mean Shift algorithm to perform speaker clustering on i-vectors generated from short speech segments. In this paper we examine the effectiveness of probabilistic linear discriminant analysis (PLDA) scoring as the metric of the mean shift clustering algorithm in the presence of different number of speakers. Our proposed method, combined with k-nearest neighbors (kNN) for bandwidth estimation, yields better and more robust results in comparison to the cosine similari… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
16
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
6
1

Relationship

2
5

Authors

Journals

citations
Cited by 16 publications
(17 citation statements)
references
References 15 publications
(22 reference statements)
0
16
0
Order By: Relevance
“…Mean-shift algorithm for clustering is well known [2,3], where the standard algorithm is based on Euclidean distance. As Euclidean distance is not fit to work well with i-vectors, it was first replaced by cosine distance [4,5] and later with PLDA score [6]. Another change to the standard algorithm is replacing the threshold h that determines the neighboring i-vectors, by k-nearest neighbors (kNN).…”
Section: Mean-shift Algorithmmentioning
confidence: 99%
See 3 more Smart Citations
“…Mean-shift algorithm for clustering is well known [2,3], where the standard algorithm is based on Euclidean distance. As Euclidean distance is not fit to work well with i-vectors, it was first replaced by cosine distance [4,5] and later with PLDA score [6]. Another change to the standard algorithm is replacing the threshold h that determines the neighboring i-vectors, by k-nearest neighbors (kNN).…”
Section: Mean-shift Algorithmmentioning
confidence: 99%
“…Another change to the standard algorithm is replacing the threshold h that determines the neighboring i-vectors, by k-nearest neighbors (kNN). It was found in that kNN is much less sensitive to the k value then the h parameter [6]. Let X = {xj} J j=1 be a set of i-vectors from several speakers, and let S h (x) be the set of the k nearest i-vectors, then the mean shift is given in eq.…”
Section: Mean-shift Algorithmmentioning
confidence: 99%
See 2 more Smart Citations
“…Speaker clustering is an unsupervised task of identifying which segments from a set of speech segments belong to the same speaker. It can be an inherent part in speaker diarization task [1], or it can be also a stand alone problem [2]. In this work the segments are well defined by a push to talk (PTT) button.…”
Section: Introductionmentioning
confidence: 99%