2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2019
DOI: 10.1109/waspaa.2019.8937206
|View full text |Cite
|
Sign up to set email alerts
|

Generative Speech Enhancement Based on Cloned Networks

Abstract: We propose to implement speech enhancement by the regeneration of clean speech from a 'salient' representation extracted from the noisy signal. The network that extracts salient features is trained using a set of weight-sharing clones of the extractor network. The clones receive mel-frequency spectra of different noisy versions of the same speech signal as input. By encouraging the outputs of the clones to be similar for these different input signals, we train a feature extractor network that is robust to nois… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 32 publications
0
1
0
Order By: Relevance
“…Then, given noisy speech x, this paradigm corresponds to finding z such that G θ (z) − x 2 2 is minimised. The motivation for this paradigm of approaches is that they lead to more natural-sounding enhanced speech [244]. The improved naturalness results from the model not compromising when there are equally likely speech signals.…”
Section: Generative Enhancement Paradigmmentioning
confidence: 99%
“…Then, given noisy speech x, this paradigm corresponds to finding z such that G θ (z) − x 2 2 is minimised. The motivation for this paradigm of approaches is that they lead to more natural-sounding enhanced speech [244]. The improved naturalness results from the model not compromising when there are equally likely speech signals.…”
Section: Generative Enhancement Paradigmmentioning
confidence: 99%