Interspeech 2006 2006
DOI: 10.21437/interspeech.2006-566
|View full text |Cite
|
Sign up to set email alerts
|

A spectral clustering approach to speaker diarization

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
1
0

Year Published

2010
2010
2024
2024

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 35 publications
(7 citation statements)
references
References 2 publications
0
1
0
Order By: Relevance
“…The most popular method for text-dependent speaker verification [9,10,11] and text-independent speaker verification [7,8,5,6] is discriminant analysis (PLDA) [5,6]. For text-independent speaker detection, hybrid techniques with deep learning-based components have also shown promise [12,13,14].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The most popular method for text-dependent speaker verification [9,10,11] and text-independent speaker verification [7,8,5,6] is discriminant analysis (PLDA) [5,6]. For text-independent speaker detection, hybrid techniques with deep learning-based components have also shown promise [12,13,14].…”
Section: Related Workmentioning
confidence: 99%
“…Neural networks have been used in speaker diarization systems rapidly in recent technological inventories [7], [8], [9], and [10]. In most literature, speaker diarization systems use text-based speaker verification and detection to identify the same speaker.…”
Section: Introductionmentioning
confidence: 99%
“…After model training, a temporal continuity of similarity scores can be incorporated [15]. This is done by multiplying the similarity score s(i, j) with an exponential decay given by, s (i, j) = s(i, j)β min(n b ,|i−j|) (10) where, β is a positive decay factor < 1, |i−j| is the absolute segment index difference value of embeddings from the ith and jth segment, and n b is the maximum value of the decay constant β.…”
Section: Choice Of Hyper Parametersmentioning
confidence: 99%
“…The inputs to the clustering algorithms commonly employ pre-processing techniques on the embeddings like length normalization [8], principal component analysis (PCA) [9] and PLDA based affinity matrix computation [4]. Another common approach to clustering is the spectral clustering approach [10]. In most of these approaches, the affinity matrix computation and clustering are performed as two independent steps with different cost functions.…”
Section: Introductionmentioning
confidence: 99%
“…In contrast, spectral clustering (SC; Luxburg, 2007 ) does not require a statistical metric to determine whether two clusters should be merged. Previous researches have applied SC to infer speaker clusters and achieved good performance (Iso, 2010 ; Ning et al, 2010 ), especially in speaker diarization tasks (Ning et al, 2006 ).…”
Section: Introductionmentioning
confidence: 99%