Abstract:The amplitude modulations (AMs) in speech signals are useful cues for speech recognition. Several adaptation mechanisms may make the detection of AM in noisy backgrounds easier when the AM carrier is presented later rather than earlier in the noise. The aim of the present study was to characterize temporal adaptation to noise in AM detection. AM detection thresholds were measured for monaural (50 ms, 1.5 kHz) pure-tone carriers presented at the onset ('early' condition) and 300 ms after the onset ('late' condi… Show more
“… Smith et al (2000) demonstrated that, in macaques, the tone threshold increment with CAS remained to some extent when MOC efferents were sectioned. Marrufo-Pérez et al (2018) showed that detection thresholds for short (50 ms) tones increased more when the tone and CAS onset coincided (early condition) than when the tone onset was delayed 300 ms from the CAS onset (late condition). Because the time course of MOCR activation is around 300 ms ( Backus and Guinan, 2006 ), one would expect greater threshold increments in the ‘late’ than in the ‘early’ condition if the MOCR were the only responsible for the increments, but this was not the case.…”
The roles of the medial olivocochlear reflex (MOCR) in human hearing have been widely investigated but remain controversial. We reason that this may be because the effects of MOCR activation on cochlear mechanical responses can be assessed only indirectly in healthy humans, and the different methods used to assess those effects possibly yield different and/or unreliable estimates. One aim of this study was to investigate the correlation between three methods often employed to assess the strength of MOCR activation by contralateral acoustic stimulation (CAS). We measured tone detection thresholds (N = 28), click-evoked otoacoustic emission (CEOAE) input/output (I/O) curves (N = 18), and distortion-product otoacoustic emission (DPOAE) I/O curves (N = 18) for various test frequencies in the presence and the absence of CAS (broadband noise of 60 dB SPL). As expected, CAS worsened tone detection thresholds, suppressed CEOAEs and DPOAEs, and horizontally shifted CEOAE and DPOAE I/O curves to higher levels. However, the CAS effect on tone detection thresholds was not correlated with the horizontal shift of CEOAE or DPOAE I/O curves, and the CAS-induced CEOAE suppression was not correlated with DPOAE suppression. Only the horizontal shifts of CEOAE and DPOAE I/O functions were correlated with each other at 1.5, 2, and 3 kHz. A second aim was to investigate which of the methods is more reliable. The test–retest variability of the CAS effect was high overall but smallest for tone detection thresholds and CEOAEs, suggesting that their use should be prioritized over the use of DPOAEs. Many factors not related with the MOCR, including the limited parametric space studied, the low resolution of the I/O curves, and the reduced numbers of observations due to data exclusion likely contributed to the weak correlations and the large test–retest variability noted. These findings can help us understand the inconsistencies among past studies and improve our understanding of the functional significance of the MOCR.
“… Smith et al (2000) demonstrated that, in macaques, the tone threshold increment with CAS remained to some extent when MOC efferents were sectioned. Marrufo-Pérez et al (2018) showed that detection thresholds for short (50 ms) tones increased more when the tone and CAS onset coincided (early condition) than when the tone onset was delayed 300 ms from the CAS onset (late condition). Because the time course of MOCR activation is around 300 ms ( Backus and Guinan, 2006 ), one would expect greater threshold increments in the ‘late’ than in the ‘early’ condition if the MOCR were the only responsible for the increments, but this was not the case.…”
The roles of the medial olivocochlear reflex (MOCR) in human hearing have been widely investigated but remain controversial. We reason that this may be because the effects of MOCR activation on cochlear mechanical responses can be assessed only indirectly in healthy humans, and the different methods used to assess those effects possibly yield different and/or unreliable estimates. One aim of this study was to investigate the correlation between three methods often employed to assess the strength of MOCR activation by contralateral acoustic stimulation (CAS). We measured tone detection thresholds (N = 28), click-evoked otoacoustic emission (CEOAE) input/output (I/O) curves (N = 18), and distortion-product otoacoustic emission (DPOAE) I/O curves (N = 18) for various test frequencies in the presence and the absence of CAS (broadband noise of 60 dB SPL). As expected, CAS worsened tone detection thresholds, suppressed CEOAEs and DPOAEs, and horizontally shifted CEOAE and DPOAE I/O curves to higher levels. However, the CAS effect on tone detection thresholds was not correlated with the horizontal shift of CEOAE or DPOAE I/O curves, and the CAS-induced CEOAE suppression was not correlated with DPOAE suppression. Only the horizontal shifts of CEOAE and DPOAE I/O functions were correlated with each other at 1.5, 2, and 3 kHz. A second aim was to investigate which of the methods is more reliable. The test–retest variability of the CAS effect was high overall but smallest for tone detection thresholds and CEOAEs, suggesting that their use should be prioritized over the use of DPOAEs. Many factors not related with the MOCR, including the limited parametric space studied, the low resolution of the I/O curves, and the reduced numbers of observations due to data exclusion likely contributed to the weak correlations and the large test–retest variability noted. These findings can help us understand the inconsistencies among past studies and improve our understanding of the functional significance of the MOCR.
“…It has been suggested that different efferent time constants may be required in different listening situations [27]. A number of studies suggest that efferent activation may play a role in the perception of amplitude modulation in general [47] and in human speech perception [48], [49], although the effect of different background noise-type and time constants has not been explicitly assessed. Recent studies with a binaural cochlear implant sound coding strategy with efferent-inspired feedback also demonstrate improved speech intelligibility in noise with very short efferent time constants [50].…”
Physiological and psychophysical methods allow for an extended investigation of ascending (afferent) neural pathways from the ear to the brain in mammals, and their role in enhancing signals in noise. However, there is increased interest in descending (efferent) neural fibers in the mammalian auditory pathway. This efferent pathway operates via the olivocochlear system, modifying auditory processing by cochlear innervation and enhancing human ability to detect sounds in noisy backgrounds. Effective speech intelligibility may depend on a complex interaction between efferent time-constants and types of background noise. In this study, an auditory model with efferent-inspired processing provided the front-end to an automatic-speech-recognition system (ASR), used as a tool to evaluate speech recognition with changes in time-constants (50 to 2000 ms) and background noise type (unmodulated and modulated noise). With efferent activation, maximal speech recognition improvement (for both noise types) occurred for signal-to-noise ratios around 10 dB, characteristic of real-world speech-listening situations. Net speech improvement due to efferent activation (NSIEA) was smaller in modulated noise than in unmodulated noise. For unmodulated noise, NSIEA increased with increasing time-constant. For modulated noise, NSIEA increased for time-constants up to 200 ms but remained similar for longer time-constants, consistent with speech-envelope modulation times important to speech recognition in modulated noise. The model improves our understanding of the complex interactions involved in speech recognition in noise, and could be used to simulate the difficulties of speech perception in noise as a consequence of different types of hearing loss.
“…Marrufo-P erez et al (2018a) discussed that such an effect could allow the basilar membrane to follow the dips and peaks in amplitude-modulated waveforms and improve encoding of such signals (see Fig. 1A in Marrufo-P erez et al, 2018a). Modulations in the waveform of speech carry important information on place, manner, and voicing (Rosen, 1992).…”
Section: Discussionmentioning
confidence: 99%
“…The human auditory system contains a rich efferent network wherein the cortex modulates brainstem activity, and the brainstem influences cochlear activity. The auditory efferent system improves detection of sounds in background noise, reduces auditory damage due to high-intensity sound exposure, and may be involved in selective attention (for recent reviews, see Guinan, 2018;Lopez-Poveda, 2018). The medial olivocochlear (MOC) branch of the auditory efferent system innervates the outer hair cells (OHCs), which are responsible for cochlear amplification (for a review of cochlear amplification, see Ashmore, 2008).…”
mentioning
confidence: 99%
“…For example, intensity discrimination is involved in the perception of stop consonants and vowel formants (Sinnott et al, 1985). Additionally, the MOCR may aid in encoding amplitude modulation at the level of the basilar membrane (Marrufo-P erez et al, 2018a), which can benefit perception of the different temporal components of the speech waveform (Rosen, 1992).…”
The medial olivocochlear (MOC) efferent system modifies cochlear output to aid signal detection in noise, but the precise role of efferents in speech-in-noise understanding remains unclear. The current study examined the contribution of the MOC reflex for speech recognition in noise in 30 normal-hearing young adults (27 females, mean age = 22.7 yr). The MOC reflex was assessed using contralateral inhibition of transient-evoked otoacoustic emissions. Speech-in-noise perception was evaluated using the coordinate response measure presented in ipsilateral speech-shaped noise at signal-to-noise ratios (SNRs) ranging from −12 to 0 dB. Performance was assessed without and with the presence of contralateral noise to activate the MOC reflex. Performance was significantly better with contralateral noise only at the lowest SNR. There was a trend of better performance with increasing contralateral inhibition at the lowest SNR. Threshold of the psychometric function was significantly correlated with contralateral inhibition. Response time on the speech task was not significantly correlated with contralateral inhibition. Results suggest that the MOC reflex contributes to listening in low SNRs and the relationship between the MOC reflex and perception is highly dependent upon the task characteristics.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.