Interspeech 2020 2020
DOI: 10.21437/interspeech.2020-3038
|View full text |Cite
|
Sign up to set email alerts
|

The INTERSPEECH 2020 Deep Noise Suppression Challenge: Datasets, Subjective Testing Framework, and Challenge Results

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
94
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 201 publications
(94 citation statements)
references
References 0 publications
0
94
0
Order By: Relevance
“…When training Tiny DCU-Net-16, we use the clean speech and noise speech dataset released from the INTERSPEECH 2020 DNS challenge [24]. The number of its trainable parameters is 108162 ≈ 108K.…”
Section: Methodsmentioning
confidence: 99%
“…When training Tiny DCU-Net-16, we use the clean speech and noise speech dataset released from the INTERSPEECH 2020 DNS challenge [24]. The number of its trainable parameters is 108162 ≈ 108K.…”
Section: Methodsmentioning
confidence: 99%
“…For example, the recordings from denoising algorithms may contain residuals of noise and reverberation, and the synthesized speech from vocoders may contain robotic sound. Therefore, to match the test-time conditions, we add 15-25dB noise randomly drawn from the DNS Challenge Dataset [36] to the input narrowband signal during training.…”
Section: Noise Augmentationmentioning
confidence: 99%
“…All considered algorithms were trained and evaluated on the DNS Challenge dataset [22]. In total, this dataset contains more than 500 h of speech from 2150 speakers and 180 h of noise from 150 different noise classes at a sampling frequency of 16 kHz.…”
Section: Datasetmentioning
confidence: 99%
“…More in particular, we propose to train temporal convolutional networks [11,20] to map the noisy speech STFT coefficients to the required quantities, i.e., the noise correlation matrix and the a-priori SNR, by minimizing the scale-invariant signal-to-distortion ratio loss function [21] at the MFMVDR filter output. Experimental results using the INTERSPEECH 2020 Deep Noise Suppression (DNS) Challenge dataset [22] show that the proposed deep MFMVDR filter outperforms complex-valued masking as well as directly estimating the multi-frame filter without exploiting the MFMVDR structure and Conv-TasNet [11].…”
Section: Introductionmentioning
confidence: 99%