ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022
DOI: 10.1109/icassp43922.2022.9746235
|View full text |Cite
|
Sign up to set email alerts
|

Customizable End-To-End Optimization Of Online Neural Network-Supported Dereverberation For Hearing Devices

Abstract: This work focuses on online dereverberation for hearing devices using the weighted prediction error (WPE) algorithm. WPE filtering requires an estimate of the target speech power spectral density (PSD). Recently deep neural networks (DNNs) have been used for this task. However, these approaches optimize the PSD estimate which only indirectly affects the WPE output, thus potentially resulting in limited dereverberation. In this paper, we propose an endto-end approach specialized for online processing, that dire… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
16
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

2
4

Authors

Journals

citations
Cited by 7 publications
(16 citation statements)
references
References 28 publications
(54 reference statements)
0
16
0
Order By: Relevance
“…We noticed in our previous study [25] that although the energy residing in the moderate reverberation range corresponding to the filter length was particularly suppressed when training the approach in an end-to-end fashion, residual late reverberation could still be heard at the output. A further processing stage could be dedicated to removing this residual reverberation, as increasing the length of the linear filters results in rapidly increasing computational complexity and training difficulty.…”
Section: Introductionmentioning
confidence: 92%
See 4 more Smart Citations
“…We noticed in our previous study [25] that although the energy residing in the moderate reverberation range corresponding to the filter length was particularly suppressed when training the approach in an end-to-end fashion, residual late reverberation could still be heard at the output. A further processing stage could be dedicated to removing this residual reverberation, as increasing the length of the linear filters results in rapidly increasing computational complexity and training difficulty.…”
Section: Introductionmentioning
confidence: 92%
“…In [12], [20], the DNN is optimized with a mean-squared error (MSE) criterion on the masked output. In contrast, in our previous work [25] we proposed to use the Kullback-Leibler (KL) divergence [31]:…”
Section: Dnn-based Psd Estimationmentioning
confidence: 98%
See 3 more Smart Citations