Customizable End-To-End Optimization Of Online Neural Network-Supported Dereverberation For Hearing Devices

Jean-Marie, Lemercier,; Thiemann, Joachim; Koning, Raphael; Gerkmann, Timo

doi:10.1109/icassp43922.2022.9746235

Cited by 7 publications

(16 citation statements)

References 28 publications

(54 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We noticed in our previous study [25] that although the energy residing in the moderate reverberation range corresponding to the filter length was particularly suppressed when training the approach in an end-to-end fashion, residual late reverberation could still be heard at the output. A further processing stage could be dedicated to removing this residual reverberation, as increasing the length of the linear filters results in rapidly increasing computational complexity and training difficulty.…”

Section: Introductionmentioning

confidence: 92%

“…In [12], [20], the DNN is optimized with a mean-squared error (MSE) criterion on the masked output. In contrast, in our previous work [25] we proposed to use the Kullback-Leibler (KL) divergence [31]:…”

Section: Dnn-based Psd Estimationmentioning

confidence: 98%

“…1) End-to-end criterion and objectives: In [25], we showed that the mismatch between the DNN-optimization criterion (7) and the dereverberation task limited the overall performance. On the other hand, we argued that using ASR as an end-toend training criterion, as is done in [24], is not necessarily the best choice in order to optimize a dereverberation algorithm for hearing-aid users.…”

Section: End-to-end Training Proceduresmentioning

confidence: 99%

“…An end-to-end procedure using ASR was also introduced in [24] to optimize a DNN used for online dereverberation. In contrast, we proposed to use a criterion directly on the output signal rather than using ASR in our previous work [25]. We argued that it was more likely to improve instrumentally predicted speech intelligibility and quality.…”

Section: Introductionmentioning

confidence: 99%

“…We show with the newly introduced metrics that this latter stage particularly benefits from strong dereverberation within the linear filter range obtained with the previous end-to-end WPE approach. Finally, we customize the presented two-stage algorithm to hearing listener classes by adapting the training target and algorithm parameters as in previous work [25].…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

End-To-End Optimization of Online Neural Network-supported Two-Stage Dereverberation for Hearing Devices

Jean-Marie¹,

Thiemann²,

Koning³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

A two-stage online dereverberation algorithm for hearing devices is presented in this paper. The approach combines a multi-channel multi-frame linear filtering approach with a single-channel single-frame post-filter. Both components rely on power spectral density (PSD) estimates provided by deep neural networks (DNNs). This contribution extends our prior work, which shows that directly optimizing for a criterion at the output of the multi-channel linear filtering stage results in a more efficient dereverberation, as compared to placing the criterion at the output of the DNN to optimize the PSD estimation. In the present work, we show that the dereverberation performance of the proposed first stage particularly improves the early-to-mid reverberation ratio if trained end-to-end. We thus argue that it can be combined with a post-filtering stage which benefits from the early-to-mid ratio improvement and is consequently able to efficiently suppress the residual late reverberation. This proposed two stage procedure is shown to be both very effective in terms of dereverberation performance and computational demands. Furthermore, the proposed system can be adapted to the needs of different types of hearing-device users by controlling the amount of reduction of early reflections. The proposed system outperforms the previously proposed end-to-end DNNsupported linear filtering algorithm, as well as other traditional approaches, based on an evaluation using the noise-free version of the WHAMR! dataset.

show abstract

Section: Introductionmentioning

confidence: 92%