ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
DOI: 10.1109/icassp40776.2020.9053188
|View full text |Cite
|
Sign up to set email alerts
|

Masking and Inpainting: A Two-Stage Speech Enhancement Approach for Low SNR and Non-Stationary Noise

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
11
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 35 publications
(15 citation statements)
references
References 10 publications
0
11
0
Order By: Relevance
“…The reason might be that the data augmentations and joint performing super-resolution can increase the generalization and inpainting ability of the model (Hao et al, 2020). The PESQ score of VF-UNet reaches 2.43, higher than SEGAN, WaveUNet, and the model trained with weakly labeled data in Kong et al (2021b).…”
Section: Super-resolutionmentioning
confidence: 98%
“…The reason might be that the data augmentations and joint performing super-resolution can increase the generalization and inpainting ability of the model (Hao et al, 2020). The PESQ score of VF-UNet reaches 2.43, higher than SEGAN, WaveUNet, and the model trained with weakly labeled data in Kong et al (2021b).…”
Section: Super-resolutionmentioning
confidence: 98%
“…They evaluated their systems on long gaps (about 500 ms), while in our work we aim at inpainting also extremely long segments (until 1600 ms), where additional information, like video, is essential to correctly restore speech signals. A very recent work proposed a two-stage enhancement network where binary masking of a noisy speech spectrogram was followed by inpainting of time-frequency bins affected by severe noise [16].…”
Section: Introductionmentioning
confidence: 99%
“…Recently, multi-stage learning has been successfully applied for a wide variety of tasks, including human pose estimation [28], action segmentation [29], speech enhancement [30]- [32] and speech separation [33]. A multi-stage architecture consists of stages that sequentially use the same model or a combination of different models, and each model operates directly on the output of the previous stage.…”
mentioning
confidence: 99%
“…Multi-stage learning systems where each stage performs a different task are considered in [30], [31], [33]. Here, each stage has a different task and a different target.…”
mentioning
confidence: 99%
See 1 more Smart Citation