2022
DOI: 10.48550/arxiv.2212.11851
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 0 publications
0
2
0
Order By: Relevance
“…This research area can be categorized into two groups: conditional methods, which require specialized training for specific problems, and zero-shot methods, which leverage priors from unconditional diffusion models. Within the category of conditional models, several works target speech enhancement [33]- [36], image deblurring [8], and JPEG reconstruction [37], among others. It may be noted that these methods all require pairs of clean/degraded samples and a well-thought training data pipeline.…”
Section: B Diffusion Models For Blind Inverse Problemsmentioning
confidence: 99%
See 1 more Smart Citation
“…This research area can be categorized into two groups: conditional methods, which require specialized training for specific problems, and zero-shot methods, which leverage priors from unconditional diffusion models. Within the category of conditional models, several works target speech enhancement [33]- [36], image deblurring [8], and JPEG reconstruction [37], among others. It may be noted that these methods all require pairs of clean/degraded samples and a well-thought training data pipeline.…”
Section: B Diffusion Models For Blind Inverse Problemsmentioning
confidence: 99%
“…We then use the denoised recording as the observations that will be used for the warm initialization and for the guidance of the diffusion-based generation. A similar strategy was used for the purpose of speech enhancement in [36]. Fig.…”
Section: Application To Historical Recordingsmentioning
confidence: 99%