Abstract:The human auditory efferent system may play a role in improving speech-in-noise recognition with an associated range of time constants. Computational auditory models with efferent-inspired feedback demonstrate improved speech-in-noise recognition with long efferent time constants (2000 ms). This study used a similar model plus an Automatic Speech Recognition (ASR) system to investigate the role of shorter time constants. ASR speech recognition in noise improved with efferent feedback (compared to no-efferent f… Show more
“…The general trend for an increase in speech recognition scores in the absence of efferent activation with increasing SNR is similar to that reported by [16] and [20] for SNR values up to 20 dB. Overall, for SNRs of 0 to 20 dB, there was an improvement in speech recognition with efferent activation, compared to when there was no efferent activation within the model.…”
Section: A Comparing Effects Of Um and Am Noise On Speech Recognitionsupporting
confidence: 84%
“…In Experiment 2 of the current study, speech recognition in noise was also measured (using the same model settings) in AM pink noise using AM rates spread across 1-14 Hz (comparable to envelope fluctuation rates in speech). The results will allow for a direct comparison with Experiment 1 (which used UM pink noise), as well as comparison with earlier studies (measuring speech recognition in speech-shaped/babble noise) using auditory models with longer efferent time constants [14]- [16], [20].…”
Section: Introductionmentioning
confidence: 83%
“…In Experiment 1 of the current study, speech recognition in noise was measured in UM pink noise with time constants ranging from 50 to 2000 ms. The results will allow for a direct comparison with earlier studies measuring speech recognition in UM pink noise, using an auditory model with a single long efferent time constant of 2000 ms [14], [16] or time constants within a range of 118-2000 ms [20]. In Experiment 2 of the current study, speech recognition in noise was also measured (using the same model settings) in AM pink noise using AM rates spread across 1-14 Hz (comparable to envelope fluctuation rates in speech).…”
Section: Introductionmentioning
confidence: 89%
“…In the present study, the efferent decay time constants tested within the model were: 50, 70, 86, and 100 ms, in order to make a comparison with the shorter range of human auditory efferent time constants recorded using OAEs [18]; 118 ms, an efferent decay constant reported by [28] using human psychoacoustical measures; 200 ms, 450 ms, and 1000 ms which are within the range of slow and medium human efferent decay constants recorded using OAEs [18]; and 2000 ms, in order to make a direct comparison with two previous studies [14], [16]. UM pink noise was used in order to make a comparison with earlier findings [14], [16], [20], and AM pink noise was used in order to investigate the interaction of noise modulation with efferent time constants. UM pink noise or AM pink noise will hereafter be referred to as UM noise or AM noise.…”
Section: Introductionmentioning
confidence: 93%
“…Previous studies using an auditory model [13], have used only long efferent time constants (e.g., 2000 ms), in order to study speech recognition in pink noise [14], [16]. In order to investigate the role of auditory efferent activation across the range of time constants recorded in humans [18], a previous study by the authors [20] investigated speech recognition in unmodulated (UM) pink noise with an auditory model [13] using shorter time constants (118, 200, 450, 1000 ms), as well as the longer time constant of 2000 ms originally used in earlier studies. A range of SNRs was used in the study (−10, −5, 0, 5, 7, 10, 12, 15 and 20 dB).…”
Physiological and psychophysical methods allow for an extended investigation of ascending (afferent) neural pathways from the ear to the brain in mammals, and their role in enhancing signals in noise. However, there is increased interest in descending (efferent) neural fibers in the mammalian auditory pathway. This efferent pathway operates via the olivocochlear system, modifying auditory processing by cochlear innervation and enhancing human ability to detect sounds in noisy backgrounds. Effective speech intelligibility may depend on a complex interaction between efferent time-constants and types of background noise. In this study, an auditory model with efferent-inspired processing provided the front-end to an automatic-speech-recognition system (ASR), used as a tool to evaluate speech recognition with changes in time-constants (50 to 2000 ms) and background noise type (unmodulated and modulated noise). With efferent activation, maximal speech recognition improvement (for both noise types) occurred for signal-to-noise ratios around 10 dB, characteristic of real-world speech-listening situations. Net speech improvement due to efferent activation (NSIEA) was smaller in modulated noise than in unmodulated noise. For unmodulated noise, NSIEA increased with increasing time-constant. For modulated noise, NSIEA increased for time-constants up to 200 ms but remained similar for longer time-constants, consistent with speech-envelope modulation times important to speech recognition in modulated noise. The model improves our understanding of the complex interactions involved in speech recognition in noise, and could be used to simulate the difficulties of speech perception in noise as a consequence of different types of hearing loss.
“…The general trend for an increase in speech recognition scores in the absence of efferent activation with increasing SNR is similar to that reported by [16] and [20] for SNR values up to 20 dB. Overall, for SNRs of 0 to 20 dB, there was an improvement in speech recognition with efferent activation, compared to when there was no efferent activation within the model.…”
Section: A Comparing Effects Of Um and Am Noise On Speech Recognitionsupporting
confidence: 84%
“…In Experiment 2 of the current study, speech recognition in noise was also measured (using the same model settings) in AM pink noise using AM rates spread across 1-14 Hz (comparable to envelope fluctuation rates in speech). The results will allow for a direct comparison with Experiment 1 (which used UM pink noise), as well as comparison with earlier studies (measuring speech recognition in speech-shaped/babble noise) using auditory models with longer efferent time constants [14]- [16], [20].…”
Section: Introductionmentioning
confidence: 83%
“…In Experiment 1 of the current study, speech recognition in noise was measured in UM pink noise with time constants ranging from 50 to 2000 ms. The results will allow for a direct comparison with earlier studies measuring speech recognition in UM pink noise, using an auditory model with a single long efferent time constant of 2000 ms [14], [16] or time constants within a range of 118-2000 ms [20]. In Experiment 2 of the current study, speech recognition in noise was also measured (using the same model settings) in AM pink noise using AM rates spread across 1-14 Hz (comparable to envelope fluctuation rates in speech).…”
Section: Introductionmentioning
confidence: 89%
“…In the present study, the efferent decay time constants tested within the model were: 50, 70, 86, and 100 ms, in order to make a comparison with the shorter range of human auditory efferent time constants recorded using OAEs [18]; 118 ms, an efferent decay constant reported by [28] using human psychoacoustical measures; 200 ms, 450 ms, and 1000 ms which are within the range of slow and medium human efferent decay constants recorded using OAEs [18]; and 2000 ms, in order to make a direct comparison with two previous studies [14], [16]. UM pink noise was used in order to make a comparison with earlier findings [14], [16], [20], and AM pink noise was used in order to investigate the interaction of noise modulation with efferent time constants. UM pink noise or AM pink noise will hereafter be referred to as UM noise or AM noise.…”
Section: Introductionmentioning
confidence: 93%
“…Previous studies using an auditory model [13], have used only long efferent time constants (e.g., 2000 ms), in order to study speech recognition in pink noise [14], [16]. In order to investigate the role of auditory efferent activation across the range of time constants recorded in humans [18], a previous study by the authors [20] investigated speech recognition in unmodulated (UM) pink noise with an auditory model [13] using shorter time constants (118, 200, 450, 1000 ms), as well as the longer time constant of 2000 ms originally used in earlier studies. A range of SNRs was used in the study (−10, −5, 0, 5, 7, 10, 12, 15 and 20 dB).…”
Physiological and psychophysical methods allow for an extended investigation of ascending (afferent) neural pathways from the ear to the brain in mammals, and their role in enhancing signals in noise. However, there is increased interest in descending (efferent) neural fibers in the mammalian auditory pathway. This efferent pathway operates via the olivocochlear system, modifying auditory processing by cochlear innervation and enhancing human ability to detect sounds in noisy backgrounds. Effective speech intelligibility may depend on a complex interaction between efferent time-constants and types of background noise. In this study, an auditory model with efferent-inspired processing provided the front-end to an automatic-speech-recognition system (ASR), used as a tool to evaluate speech recognition with changes in time-constants (50 to 2000 ms) and background noise type (unmodulated and modulated noise). With efferent activation, maximal speech recognition improvement (for both noise types) occurred for signal-to-noise ratios around 10 dB, characteristic of real-world speech-listening situations. Net speech improvement due to efferent activation (NSIEA) was smaller in modulated noise than in unmodulated noise. For unmodulated noise, NSIEA increased with increasing time-constant. For modulated noise, NSIEA increased for time-constants up to 200 ms but remained similar for longer time-constants, consistent with speech-envelope modulation times important to speech recognition in modulated noise. The model improves our understanding of the complex interactions involved in speech recognition in noise, and could be used to simulate the difficulties of speech perception in noise as a consequence of different types of hearing loss.
The focus of most existing auditory models is on the afferent system. The auditory efferent system contains descending projections from several levels of the auditory pathway, from the auditory cortex to the brainstem, that control gain in the cochlea. We developed a model with a time-varying, gain-control signal from the efferent system that includes sub-cortical ascending and descending neural pathways. The medial olivocochlear (MOC) efferent stage of the model receives excitatory projections from both fluctuation-sensitive neurons in the inferior colliculus (IC) and wide-dynamic-range neurons in the cochlear nucleus (CN). The response of the model MOC stage controlled cochlear gain dynamically. Changes in the rates of IC neurons in awake rabbit to long-duration amplitude-modulated (AM) noise were employed to adjust the parameters of the proposed model. In response to AM stimuli, physiological response rates of most IC neurons with band-enhanced (BE) modulation transfer functions (MTFs) increased over a time course consistent with the dynamics of the MOC efferent feedback. The time constant of the MOC model that best matched the IC physiology was compared to available descriptions of the MOC. Responses of the proposed subcortical model to AM noise simulate the trend of increasing rate over time, while the model without the efferent system did not show this trend.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.