Using recurrent neural networks to improve the perception of speech in non-stationary noise by people with cochlear implants

Goehring, Tobias; Keshavarzi, Mahmoud; Carlyon, Robert P.; Moore, Brian C. J.

doi:10.31234/osf.io/rukv3

2019

DOI: 10.31234/osf.io/rukv3

|View full text |Cite

Preprint

Using recurrent neural networks to improve the perception of speech in non-stationary noise by people with cochlear implants

Tobias Goehring¹,

Mahmoud Keshavarzi²,

Robert P. Carlyon³

et al.

Abstract: Speech-in-noise perception is a major problem for users of cochlear implants (CIs), especially with non-stationary background noise such as competing talkers or traffic. Algorithms that facilitate speech perception by attenuating background noise have produced benefits but relied on a priori information about the target speaker and/or background noise. We developed a recurrent neural network (RNN) algorithm for enhancing speech in non-stationary noise and evaluated its benefits for speech perception, using obj… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2021

Publication Types

Select...

Article1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

(1 citation statement)

References 47 publications

(109 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The SE process consists of two parts: to enhance the intelligibility and quality of processed speech, and to reduce the noises in the background. Previous well-established algorithms have helped improve the SE in CI users [37], [38], [29], [39], [40], [41], [42], [43] but there are only few studies with a newly upgrading deep-learning-based algorithm. Traditional SE methods are based on identifying the difference between clean and noisy speech [44], [45], [46], [47], [48], [49].…”

mentioning

confidence: 99%

A Study of Joint Effect on Denoising Techniques and Visual Cues to Improve Speech Intelligibility in Cochlear Implant Simulation

Tseng

Wang

et al. 2021

IEEE Trans. Cogn. Dev. Syst.

View full text Add to dashboard Cite

Speech perception is key to verbal communication. For people with hearing loss, the capability to recognize speech is restricted, particularly in a noisy environment or the situations without visual cues, such as lip-reading unavailable via phone call. This study aimed to understand the improvement of vocoded speech intelligibility in cochlear implant (CI) simulation through two potential methods: Speech Enhancement (SE) and Audiovisual Integration. A fully convolutional neural network (FCN) using an intelligibility-oriented objective function was recently proposed and proven to effectively facilitate the speech intelligibility as an advanced denoising SE approach. Furthermore, audiovisual integration is reported to supply better speech comprehension compared to audio-only information. An experiment was designed to test speech intelligibility using tonevocoded speech in CI simulation with a group of normal-hearing listeners. Experimental results confirmed the effectiveness of the FCN-based denoising SE and audiovisual integration on vocoded speech. Also, it positively recommended that these two methods could become a blended feature in a CI processor to improve the speech intelligibility for CI users under noisy conditions.

show abstract

mentioning

confidence: 99%

A Study of Joint Effect on Denoising Techniques and Visual Cues to Improve Speech Intelligibility in Cochlear Implant Simulation

Tseng

Wang

et al. 2021

IEEE Trans. Cogn. Dev. Syst.

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Using recurrent neural networks to improve the perception of speech in non-stationary noise by people with cochlear implants

Cited by 1 publication

References 47 publications

A Study of Joint Effect on Denoising Techniques and Visual Cues to Improve Speech Intelligibility in Cochlear Implant Simulation

A Study of Joint Effect on Denoising Techniques and Visual Cues to Improve Speech Intelligibility in Cochlear Implant Simulation

Contact Info

Product

Resources

About