ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
DOI: 10.1109/icassp40776.2020.9054251
|View full text |Cite
|
Sign up to set email alerts
|

But System for the Second Dihard Speech Diarization Challenge

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
52
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 46 publications
(53 citation statements)
references
References 11 publications
1
52
0
Order By: Relevance
“…For comparison, we also report the performance of several existing DIHARD challenge II submissions. The challenge top system by BUT achieves a DER value of 18.09% on the DIHARD II dev set [60]. However, it is mentioned in the paper that in their system, PLDA was adapted on the same development set.…”
Section: ) Analysis Of Experimental Resultsmentioning
confidence: 99%
“…For comparison, we also report the performance of several existing DIHARD challenge II submissions. The challenge top system by BUT achieves a DER value of 18.09% on the DIHARD II dev set [60]. However, it is mentioned in the paper that in their system, PLDA was adapted on the same development set.…”
Section: ) Analysis Of Experimental Resultsmentioning
confidence: 99%
“…We considered two methods for signal preprocessing: the speech enhancement method based on a long short-term memory (LSTM) network trained on simulated data [9] (also used in the baseline) and the weighted prediction error (WPE) [10,11] as it had proved to be useful in the Second DIHARD challenge [12]. In our experiments, we saw that using the LSTM-based speech enhancer was beneficial while the WPE method was actually harmful.…”
Section: Signal Pre-processingmentioning
confidence: 99%
“…• a deep neural network (DNN) based system with three feedforward layers receiving as input ±5 stacked frames and trained to output 10ms frame decisions (silence / speech) [12]. It was trained on part of the second DIHARD development set (the rest was used for validation while training), the train set of the "fullcorpus" partition of AMI 2 [13] (the test and development sets were used for validation while training), ICSI [14] and ISL [15] meetings.…”
Section: Voice Activity Detectionmentioning
confidence: 99%
See 1 more Smart Citation
“…These are then refined using a separate HMM. This first-pass AHC and second-pass HMM approach has proven to be effective on challenging diarisation tasks [14], and this is the approach that is adopted in this report.…”
Section: Introductionmentioning
confidence: 99%