IberSPEECH 2018 2018
DOI: 10.21437/iberspeech.2018-44
|View full text |Cite
|
Sign up to set email alerts
|

EML Submission to Albayzin 2018 Speaker Diarization Challenge

Abstract: Speaker diarization, who is speaking when, is one of the most challenging tasks in speaker recognition, as usually no prior information is available about the identity and the number of the speakers in an audio recording. The task will be more challenging when there is some noise or music on the background and the speakers are changed more frequently. This usually happens in broadcast news conversations. In this paper, we use the EML speaker diarization system as a participation to the recent Albayzin Evaluati… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2019
2019

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 2 publications
0
2
0
Order By: Relevance
“…4 show the same experiments as in Table 2 on the evaluation set, yielding the same conclusions. We participated in the challenge with only LDA transformed supervectors and the same low cost online diarization algorithm [27] and obtained very competitive or even better results compared to the other more expensive offline diarization techniques.…”
Section: Resultsmentioning
confidence: 99%
“…4 show the same experiments as in Table 2 on the evaluation set, yielding the same conclusions. We participated in the challenge with only LDA transformed supervectors and the same low cost online diarization algorithm [27] and obtained very competitive or even better results compared to the other more expensive offline diarization techniques.…”
Section: Resultsmentioning
confidence: 99%
“…The first contrastive system used ICMCfeatures (infinite impulse response-constant Q, Mel-frequency cepstral coefficients), one second uniform segmentation, binary key (BK) representation, and AHC, while the second one used MFCC features, bidirectional LSTM based speaker change detection, triplet-loss neural embedding representation, and affinity propagation clustering. • G19-EML [35]. European Media Laboratory GmbH, Germany.…”
Section: Closed-set Condition Systemsmentioning
confidence: 99%