Non-intrusive deep learning-based computational speech metrics with high-accuracy across a wide range of acoustic scenes

Diehl, Paula; Thorbergsson, Leifur; Singer, Yosef; Skripniuk, Vladislav; Pudszuhn, Annett; Hofmann, Veit Maria; Sprengel, Elias; Meyer-Rachner, Paul

doi:10.1371/journal.pone.0278170

Cited by 2 publications

(1 citation statement)

References 30 publications

(33 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Our custom speech quality metric is generated by a multi-stage neural network, which was trained to predict human listeners' opinion scores from noisy speech files rated by human listeners on Amazon Mechanical Turk (MTurk). The methods used to create this metric are described in detail by Diehl et al 51 .…”

Section: Methodsmentioning

confidence: 99%

Restoring speech intelligibility for hearing aid users with deep learning

Diehl

Singer²,

Zilly³

et al. 2023

Sci Rep

View full text Add to dashboard Cite

Almost half a billion people world-wide suffer from disabling hearing loss. While hearing aids can partially compensate for this, a large proportion of users struggle to understand speech in situations with background noise. Here, we present a deep learning-based algorithm that selectively suppresses noise while maintaining speech signals. The algorithm restores speech intelligibility for hearing aid users to the level of control subjects with normal hearing. It consists of a deep network that is trained on a large custom database of noisy speech signals and is further optimized by a neural architecture search, using a novel deep learning-based metric for speech intelligibility. The network achieves state-of-the-art denoising on a range of human-graded assessments, generalizes across different noise categories and—in contrast to classic beamforming approaches—operates on a single microphone. The system runs in real time on a laptop, suggesting that large-scale deployment on hearing aid chips could be achieved within a few years. Deep learning-based denoising therefore holds the potential to improve the quality of life of millions of hearing impaired people soon.

show abstract

Section: Methodsmentioning

confidence: 99%

Restoring speech intelligibility for hearing aid users with deep learning

Diehl

Singer²,

Zilly³

et al. 2023

Sci Rep

View full text Add to dashboard Cite

show abstract

Deep learning-based denoising streamed from mobile phones improves speech-in-noise understanding for hearing aid users

Diehl,

Zilly,

Sattler

et al. 2023

Front. Med. Eng.

Self Cite

View full text Add to dashboard Cite

The hearing loss of almost half a billion people is commonly treated with hearing aids. However, current hearing aids often do not work well in real-world noisy environments. We present a deep learning based denoising system that runs in real time on iPhone 7 and Samsung Galaxy S10 (25 ms algorithmic latency). The denoised audio is streamed to the hearing aid, resulting in a total delay of around 65–75 ms, depending on the phone. In tests with hearing aid users having moderate to severe hearing loss, our denoising system improves audio across three tests: 1) listening for subjective audio ratings, 2) listening for objective speech intelligibility, and 3) live conversations in a noisy environment for subjective ratings. Subjective ratings increase by more than 40%, for both the listening test and the live conversation compared to a fitted hearing aid as a baseline. Speech reception thresholds, measuring speech understanding in noise, improve by 1.6 dB SRT. Ours is the first denoising system that is implemented on a mobile device, streamed directly to users’ hearing aids using only a single channel as audio input while improving user satisfaction on all tested aspects, including speech intelligibility. This includes overall preference of the denoised and streamed signal over the hearing aid, thereby accepting the higher latency for the significant improvement in speech understanding.

show abstract

Non-intrusive deep learning-based computational speech metrics with high-accuracy across a wide range of acoustic scenes

Cited by 2 publications

References 30 publications

Restoring speech intelligibility for hearing aid users with deep learning

Restoring speech intelligibility for hearing aid users with deep learning

Deep learning-based denoising streamed from mobile phones improves speech-in-noise understanding for hearing aid users

Contact Info

Product

Resources

About