Effect of auditory efferent time-constant duration on speech recognition in noise

Yasin, Ifat; Liu, Fangqi; Drga, Vit; Demosthenous, Andreas; Meddis, Ray

doi:10.1121/1.5023502

Cited by 8 publications

(17 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…The general trend for an increase in speech recognition scores in the absence of efferent activation with increasing SNR is similar to that reported by [16] and [20] for SNR values up to 20 dB. Overall, for SNRs of 0 to 20 dB, there was an improvement in speech recognition with efferent activation, compared to when there was no efferent activation within the model.…”

Section: A Comparing Effects Of Um and Am Noise On Speech Recognitionsupporting

confidence: 84%

“…In Experiment 2 of the current study, speech recognition in noise was also measured (using the same model settings) in AM pink noise using AM rates spread across 1-14 Hz (comparable to envelope fluctuation rates in speech). The results will allow for a direct comparison with Experiment 1 (which used UM pink noise), as well as comparison with earlier studies (measuring speech recognition in speech-shaped/babble noise) using auditory models with longer efferent time constants [14]- [16], [20].…”

Section: Introductionmentioning

confidence: 83%

“…In Experiment 1 of the current study, speech recognition in noise was measured in UM pink noise with time constants ranging from 50 to 2000 ms. The results will allow for a direct comparison with earlier studies measuring speech recognition in UM pink noise, using an auditory model with a single long efferent time constant of 2000 ms [14], [16] or time constants within a range of 118-2000 ms [20]. In Experiment 2 of the current study, speech recognition in noise was also measured (using the same model settings) in AM pink noise using AM rates spread across 1-14 Hz (comparable to envelope fluctuation rates in speech).…”

Section: Introductionmentioning

confidence: 89%

“…In the present study, the efferent decay time constants tested within the model were: 50, 70, 86, and 100 ms, in order to make a comparison with the shorter range of human auditory efferent time constants recorded using OAEs [18]; 118 ms, an efferent decay constant reported by [28] using human psychoacoustical measures; 200 ms, 450 ms, and 1000 ms which are within the range of slow and medium human efferent decay constants recorded using OAEs [18]; and 2000 ms, in order to make a direct comparison with two previous studies [14], [16]. UM pink noise was used in order to make a comparison with earlier findings [14], [16], [20], and AM pink noise was used in order to investigate the interaction of noise modulation with efferent time constants. UM pink noise or AM pink noise will hereafter be referred to as UM noise or AM noise.…”

Section: Introductionmentioning

confidence: 93%

“…Previous studies using an auditory model [13], have used only long efferent time constants (e.g., 2000 ms), in order to study speech recognition in pink noise [14], [16]. In order to investigate the role of auditory efferent activation across the range of time constants recorded in humans [18], a previous study by the authors [20] investigated speech recognition in unmodulated (UM) pink noise with an auditory model [13] using shorter time constants (118, 200, 450, 1000 ms), as well as the longer time constant of 2000 ms originally used in earlier studies. A range of SNRs was used in the study (−10, −5, 0, 5, 7, 10, 12, 15 and 20 dB).…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Optimizing Speech Recognition Using a Computational Model of Human Hearing: Effect of Noise Type and Efferent Time Constants

et al. 2020

Self Cite

View full text Add to dashboard Cite

Physiological and psychophysical methods allow for an extended investigation of ascending (afferent) neural pathways from the ear to the brain in mammals, and their role in enhancing signals in noise. However, there is increased interest in descending (efferent) neural fibers in the mammalian auditory pathway. This efferent pathway operates via the olivocochlear system, modifying auditory processing by cochlear innervation and enhancing human ability to detect sounds in noisy backgrounds. Effective speech intelligibility may depend on a complex interaction between efferent time-constants and types of background noise. In this study, an auditory model with efferent-inspired processing provided the front-end to an automatic-speech-recognition system (ASR), used as a tool to evaluate speech recognition with changes in time-constants (50 to 2000 ms) and background noise type (unmodulated and modulated noise). With efferent activation, maximal speech recognition improvement (for both noise types) occurred for signal-to-noise ratios around 10 dB, characteristic of real-world speech-listening situations. Net speech improvement due to efferent activation (NSIEA) was smaller in modulated noise than in unmodulated noise. For unmodulated noise, NSIEA increased with increasing time-constant. For modulated noise, NSIEA increased for time-constants up to 200 ms but remained similar for longer time-constants, consistent with speech-envelope modulation times important to speech recognition in modulated noise. The model improves our understanding of the complex interactions involved in speech recognition in noise, and could be used to simulate the difficulties of speech perception in noise as a consequence of different types of hearing loss.

show abstract

Section: A Comparing Effects Of Um and Am Noise On Speech Recognitionsupporting

confidence: 84%

Section: Introductionmentioning

confidence: 83%

Section: Introductionmentioning

confidence: 89%

Section: Introductionmentioning

confidence: 93%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Optimizing Speech Recognition Using a Computational Model of Human Hearing: Effect of Noise Type and Efferent Time Constants

et al. 2020

Self Cite

View full text Add to dashboard Cite

show abstract

Subcortical Auditory Model including Efferent Dynamic Gain Control with Inputs from Cochlear Nucleus and Inferior Colliculus

Farhadi

Jennings

Strickland

et al. 2022

Preprint

View full text Add to dashboard Cite

The focus of most existing auditory models is on the afferent system. The auditory efferent system contains descending projections from several levels of the auditory pathway, from the auditory cortex to the brainstem, that control gain in the cochlea. We developed a model with a time-varying, gain-control signal from the efferent system that includes sub-cortical ascending and descending neural pathways. The medial olivocochlear (MOC) efferent stage of the model receives excitatory projections from both fluctuation-sensitive neurons in the inferior colliculus (IC) and wide-dynamic-range neurons in the cochlear nucleus (CN). The response of the model MOC stage controlled cochlear gain dynamically. Changes in the rates of IC neurons in awake rabbit to long-duration amplitude-modulated (AM) noise were employed to adjust the parameters of the proposed model. In response to AM stimuli, physiological response rates of most IC neurons with band-enhanced (BE) modulation transfer functions (MTFs) increased over a time course consistent with the dynamics of the MOC efferent feedback. The time constant of the MOC model that best matched the IC physiology was compared to available descriptions of the MOC. Responses of the proposed subcortical model to AM noise simulate the trend of increasing rate over time, while the model without the efferent system did not show this trend.

show abstract

Effect of Time Constant on Speech Enhancement in Hearing Aids Based on Auditory Neural Feedback

Liu

Demosthenous

2020

2020 IEEE International Symposium on Circuits and Systems (ISCAS)

View full text Add to dashboard Cite

Effect of auditory efferent time-constant duration on speech recognition in noise

Cited by 8 publications

References 18 publications

Optimizing Speech Recognition Using a Computational Model of Human Hearing: Effect of Noise Type and Efferent Time Constants

Optimizing Speech Recognition Using a Computational Model of Human Hearing: Effect of Noise Type and Efferent Time Constants

Subcortical Auditory Model including Efferent Dynamic Gain Control with Inputs from Cochlear Nucleus and Inferior Colliculus

Effect of Time Constant on Speech Enhancement in Hearing Aids Based on Auditory Neural Feedback

Contact Info

Product

Resources

About