2016
DOI: 10.1109/taslp.2015.2501724
|View full text |Cite
|
Sign up to set email alerts
|

Speech Dereverberation Using Non-Negative Convolutive Transfer Function and Spectro-Temporal Modeling

Abstract: This paper presents two single-channel speech dereverberation methods to enhance the quality of speech signals that have been recorded in an enclosed space. For both methods, the room acoustics are modeled using a non-negative approximation of the convolutive transfer function (N-CTF), and to additionally exploit the spectral properties of the speech signal, such as the low-rank nature of the speech spectrogram, the speech spectrogram is modeled using non-negative matrix factorization (NMF). Two methods are de… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
32
0

Year Published

2017
2017
2019
2019

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 27 publications
(32 citation statements)
references
References 30 publications
0
32
0
Order By: Relevance
“…(Q+Õ−1) ≤ IÕ and thusÕ ≥Q −1 I−1 , (19) has an exact solution and (20) can reach zero. Otherwise, (19) is a least square problem, and only an approximate solution can be achieved.…”
Section: A Adaptive Mint In the Magnitude Domainmentioning
confidence: 99%
See 1 more Smart Citation
“…(Q+Õ−1) ≤ IÕ and thusÕ ≥Q −1 I−1 , (19) has an exact solution and (20) can reach zero. Otherwise, (19) is a least square problem, and only an approximate solution can be achieved.…”
Section: A Adaptive Mint In the Magnitude Domainmentioning
confidence: 99%
“…Note that with the superscript (p) removed, h and J denote the stationary filter and the (stationary) random variable for the squared error, respectively. At frame p, the instantaneous filtering process in (19) and the squared error (20) are a random instance of the stationary system. At frame p, the adaptive update uses the gradient of the instantaneous error J (p) at the previous estimation point h (p−1) , i.e.…”
Section: A Adaptive Mint In the Magnitude Domainmentioning
confidence: 99%
“…In [21] and [35], the T-F magnitude response of the RIR is estimated. Another approach uses the RIR magnitude response and nonnegative matrix factorization (NMF) to remove reverberation [27]. A two-stage algorithm for enhancing reverberant speech is described by Wu and Wang [43], where the first stage estimates an inverse filter and the second stage uses spectral subtraction to minimize long-term reverberation.…”
Section: Introductionmentioning
confidence: 99%
“…Most single-channel speech dereverberation techniques can be classified into inverse filtering [10], [11], nonlinear mapping [12], spectral enhancement [6], [13], [14] and probabilistic model-based methods [15]- [17]. Inverse filtering methods typically try to reconstruct the original signal by designing an inverse filter for the Room Impulse Response (RIR).…”
Section: Introductionmentioning
confidence: 99%