2007 IEEE Workshop on Automatic Speech Recognition &Amp; Understanding (ASRU) 2007
DOI: 10.1109/asru.2007.4430198
|View full text |Cite
|
Sign up to set email alerts
|

Multiple feature combination to improve speaker diarization of telephone conversations

Abstract: We report results on speaker diarization of telephone conversations. This speaker diarization process is similar to the multistage segmentation and clustering system used in broadcast news. It consists of an initial acoustic change point detection algorithm, iterative Viterbi re-segmentation, gender labeling, agglomerative clustering using a Bayesian information criterion (BIC), followed by agglomerative clustering using stateof-the-art speaker identification methods (SID) and Viterbi resegmentation using Gaus… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2008
2008
2010
2010

Publication Types

Select...
4

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 8 publications
0
2
0
Order By: Relevance
“…For telephone conversations, we showed in [8] that Viterbi resegmentation following SID clustering reduces the DER by 10%. We experimented with a similar module for broadcast news diarization.…”
Section: Viterbi Re-segmentation Using Gmmsmentioning
confidence: 97%
“…For telephone conversations, we showed in [8] that Viterbi resegmentation following SID clustering reduces the DER by 10%. We experimented with a similar module for broadcast news diarization.…”
Section: Viterbi Re-segmentation Using Gmmsmentioning
confidence: 97%
“…For example, in [5] 12 dimensional Mel cepstral Frequency coefficient (MFCC) feature vectors are used and the value of the threshold λ is 12.0. In [12] 26 MFCCs (12 MFCCs + energy + their first derivatives) are used and λ was defined as 3.0. The computation of an adaptive threshold λ for unsupervised speaker segmentation using the BIC is presented in [13].…”
Section: Introductionmentioning
confidence: 99%