2019
DOI: 10.1109/taslp.2019.2919183
|View full text |Cite
|
Sign up to set email alerts
|

Multichannel Online Dereverberation Based on Spectral Magnitude Inverse Filtering

Abstract: This paper addresses the problem of multichannel online dereverberation. The proposed method is carried out in the short-time Fourier transform (STFT) domain, and for each frequency band independently. In the STFT domain, the timedomain room impulse response is approximately represented by the convolutive transfer function (CTF). The multichannel CTFs are adaptively identified based on the cross-relation method, and using the recursive least square criterion. Instead of the complexvalued CTF convolution model,… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3

Relationship

2
6

Authors

Journals

citations
Cited by 12 publications
(3 citation statements)
references
References 49 publications
0
3
0
Order By: Relevance
“…Using this model, the narrowband inverse filtering techniques have been developed for speech dereverberation in [32] based on channel equalization, and in [8], [33] based on linear prediction.…”
Section: B Stft Domain Formulationmentioning
confidence: 99%
“…Using this model, the narrowband inverse filtering techniques have been developed for speech dereverberation in [32] based on channel equalization, and in [8], [33] based on linear prediction.…”
Section: B Stft Domain Formulationmentioning
confidence: 99%
“…In the case of speech, reverberations result in loss of intelligibility for people who are hard of hearing and non-native listeners in noiseless enclosures. In noisy enclosures, reverberations reduce the intelligibility even for people with normal listening [ 9 ]. Both noise and reverberation play havoc when the generated speech is already less intelligible (e.g.…”
Section: Introductionmentioning
confidence: 99%
“…Finally, the STFT coefficients of target speech (the direct-path speech) are estimated from h[f, t, :] by an output linear layer. The narrow-band block processes frequencies independently, and regular SA and time-convolutional feed forward network (T-ConvFFN) are used for modeling the rich temporal-spatial information in one frequency [6,[20][21][22][23][24] in an offline way. The cross-band block processes time frames independently, which will be kept unchanged and not described in this work, please refer to [13] for more details.…”
Section: Introductionmentioning
confidence: 99%