2015
DOI: 10.1109/taslp.2014.2364452
|View full text |Cite
|
Sign up to set email alerts
|

A Regression Approach to Speech Enhancement Based on Deep Neural Networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

3
542
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
8
2

Relationship

0
10

Authors

Journals

citations
Cited by 1,128 publications
(545 citation statements)
references
References 38 publications
3
542
0
Order By: Relevance
“…For music source separation, s k can be vocal or accompanies. In this work, we build f in the time-frequency (T-F) domain [6,11].…”
Section: Regression Based Source Separationmentioning
confidence: 99%
“…For music source separation, s k can be vocal or accompanies. In this work, we build f in the time-frequency (T-F) domain [6,11].…”
Section: Regression Based Source Separationmentioning
confidence: 99%
“…Xu et al [6] first proposed to use the mapping based approach to reduce the effect of background noise. Specifically, they trained a deep neural network (DNN) to learn a nonlinear mapping from the noisy speech spectral magnitude to that of the clean speech.…”
Section: Dnn Based Mappingmentioning
confidence: 99%
“…The performance of speech enhancement using T-F masking is affected by both T-F mask estimator and T-F transform. The recent advance of T-F mask estimator is brought by DNN-based T-F mask estimation methods [1][2][3][4][5][6][7][8][9][10][11]. While DNN-based T-F masking is ordinarily applied in short-time Fourier transform (STFT) domain, some methods designed a specific T-F transform for assisting T-F mask estimation and investigated optimal T-F domain for speech enhancement [12,13].…”
Section: Introductionmentioning
confidence: 99%