Bipolar stimulation has been thought to be more beneficial than monopolar stimulation for speech coding in cochlear implants, on the basis of its more restricted current flow. The present study examined whether bipolar stimulation would indeed lead to reduced channel interaction in a behavioral forward masking experiment tested in four Nucleus 24 users. The masker was fixed on one channel and three masker levels that were balanced for loudness between the configurations were chosen. As expected, masking was maximal when the masker and probe channels were spatially close and decreased as they were separated. However, overall masking patterns did not consistently demonstrate sharper tuning with bipolar stimulation than monopolar. This implies that the spatial extent of a bipolar current field is not consistently narrower than that of an equally loud monopolar stimulus; therefore, it should not be assumed that bipolar stimulation leads to reduced channel interaction. Notably, bipolar masking patterns appeared to display more variations across channels, possibly influenced more by anatomical and neural irregularities near electrode contacts than monopolar masking patterns. The present psychophysical results provide a theoretical basis regarding the widespread use (and success) of monopolar configurations by implant users.
The present study investigated the effect of envelope modulations in a background masker on consonant recognition by normal hearing listeners. It is well known that listeners understand speech better under a temporally modulated masker than under a steady masker at the same level, due to masking release. The possibility of an opposite phenomenon, modulation interference, whereby speech recognition could be degraded by a modulated masker due to interference with auditory processing of the speech envelope, was hypothesized and tested under various speech and masker conditions. It was of interest whether modulation interference for speech perception, if it were observed, could be predicted by modulation masking, as found in psychoacoustic studies using nonspeech stimuli. Results revealed that masking release measurably occurred under a variety of conditions, especially when the speech signal maintained a high degree of redundancy across several frequency bands. Modulation interference was also clearly observed under several circumstances when the speech signal did not contain a high redundancy. However, the effect of modulation interference did not follow the expected pattern from psychoacoustic modulation masking results. In conclusion, (1) both factors, modulation interference and masking release, should be accounted for whenever a background masker contains temporal fluctuations, and (2) caution needs to be taken when psychoacoustic theory on modulation masking is applied to speech recognition.
Cochlear implant users may perceive intermediate place-pitches between those elicited by the individual electrodes when two electrodes are stimulated simultaneously or sequentially. This study examined pitch discrimination between adjacent electrodes using sequential dual-electrode stimulation in terms of the sensitivity index, d′, which was obtained by adding d′s from intermediate dual-electrode stimuli. Loudness was balanced for each tested pair and the intensities were roved. Twelve ears with the Nucleus® 24 or Freedom implants demonstrated a wide range of d′, from 0.7 to 9.6. “Virtual channels” can be implemented through nonsimultaneous stimulation, with comparable pitch discrimination to that observed with simultaneous stimulation.
The relative contributions of various regions of the frequency spectrum to speech recognition were assessed with a correlational method [K. A. Doherty and C. W. Turner, J. Acoust. Soc. Am. 100, 3769-3773 (1996)]. The speech materials employed were the 258-item set of the Nonsense Syllable Test. The speech was filtered into four frequency bands and a random level of noise was added to each band on each trial. A point biserial correlation was computed between the signal-to-noise ratio in each band on the trials and the listener's responses, and these correlations were then taken as estimates of the relative weights for each frequency band. When the four bands were presented separately, the correlations for each band were approximately equal; however, when the four bands were presented in combination, the correlations were quite different from one another, implying that in the broadband case listeners relied much more on some bands than on others. It is hypothesized that these differences reflect the way in which listeners combine and attend to speech information across various frequency regions. The frequency-weighting functions as determined by this method were highly similar across all subjects, suggesting that normal-hearing listeners use similar frequency-weighting strategies in recognizing speech.
Modern cochlear implants utilize interleaved presentation of pulses on different electrodes to avoid physical interference among multiple current fields, yet neural interaction still exists. In the present study, masking was examined with four Nucleus24 users with the banded electrode array in an interleaved masking paradigm, where a probe stimulus was interleaved with a masker stimulus. Spatial and temporal aspects of masking were addressed by fixing the masker at the middle of the electrode array and changing the location of the probe and by testing various stimulation rates: 125, 500, 2,000, and 6,410 Hz. In addition, growth of masking (GOM) was assessed by changing the masker level in six steps. Results indicated that masking patterns were generally much wider, regardless of stimulation rate, than those in acoustic hearing. The amount of masking decreased from the peak at the rate of approximately 0.5 dB/mm even at the highest masker level. The pattern of GOM with the rates higher than 500 Hz was different from that observed in previous masking studies, characterized by markedly shallow growth at low masker levels or overall shallow growth. A facilitating effect of the masker (lowering the threshold) was suspected, except for the 125-Hz condition, due to the fibers that were subliminally excited, but not discharged, by the masker with local perturbations of membrane potentials, and were subsequently discharged easily by a lower level probe when the temporal gap between masker and probe was sufficiently short. These results suggest that both refractory characteristics of neurons and neural summation be considered in interleaved stimulation of pulses at high, but clinically relevant, stimulation rates. Overall, the present masking study might provide a basis for models in psychophysics and speech understanding in current cochlear implant systems utilizing high-rate interleaved stimulation.
Comodulation masking release (CMR) refers to an improvement in the detection threshold of a signal masked by noise with coherent amplitude fluctuation across frequency, as compared to noise without the envelope coherence. The present study tested whether such an advantage for signal detection would facilitate the identification of speech phonemes. Consonant identification of bandpass speech was measured under the following three masker conditions: (1) a single band of noise in the speech band ("on-frequency" masker); (2) two bands of noise, one in the on-frequency band and the other in the "flanking band," with coherence of temporal envelope fluctuation between the two bands (comodulation); and (3) two bands of noise (on-frequency band and flanking band), without the coherence of the envelopes (noncomodulation). A pilot experiment with a small number of consonant tokens was followed by the main experiment with 12 consonants and the following masking conditions: three frequency locations of the flanking band and two masker levels. Results showed that in all conditions, the comodulation condition provided higher identification scores than the noncomodulation condition, and the difference in score was 3.5% on average. No significant difference was observed between the on-frequency only condition and the comodulation condition, i.e., an "unmasking" effect by the addition of a comodulated flaking band was not observed. The positive effect of CMR on consonant recognition found in the present study endorses a "cued-listening" theory, rather than an envelope correlation theory, as a basis of CMR in a suprathreshold task.
The aim of the present study was to examine cochlear implant ͑CI͒ users' perceptual segregation of speech from background noise with differing degrees of electrode separation between speech and noise. Eleven users of the nucleus CI system were tested on consonant identification using an experimental processing scheme called "multi-stream processing" in which speech and noise stimuli were processed separately and interleaved. Speech was presented to either ten ͑every other electrode͒ or six electrodes ͑every fourth electrode͒. Noise was routed to either the same ͑the "overlapped" condition͒ or a different set of electrodes ͑the "interlaced" condition͒, where speech and noise electrodes were separated by one-and two-electrode spacings for ten-and six-electrode presentations, respectively. Results indicated a small but significant improvement in consonant recognition ͑5%-10%͒ in the interlaced condition with a two-electrode spacing ͑approximately 1.1 mm͒ in two subjects. It appears that the results were influenced by peripheral channel interactions, partially accounting for individual variability. Although the overall effect was small and observed from a small number of subjects, the present study demonstrated that CI users' performance on segregating the target from the background might be improved if these sounds were presented with sufficient peripheral separation.
When listening to speech in a background masker, normal-hearing listeners take advantage of envelope fluctuations or dips in the masker (or the moments that the SNR is instantaneously favorable). Cochlear implant listeners, however, do not exhibit such ability. In a previous study [Nelson et al., J. Acoust. Soc. Am. 113, 961–968 (2003)], speech recognition performance with modulated maskers was similar or slightly worse than with a steady noise. Clinical observation indicates that implant listeners usually have more difficulty understanding speech in modulated backgrounds. In the present study, the recognition of IEEE sentences by Nucleus recipients was measured in a variety of backgrounds. In tightly controlled conditions via direct stimulation, performance is often substantially poorer with modulated backgrounds (the differences in score are as large as 20–30%), strongly indicating that they are subject to modulation interference [B. J. Kwon and C. W. Turner, J. Acoust. Soc. Am. 110, 1130–1140 (2001)]. A mixture of speech and modulated backgrounds might be perceived as one distorted signal, rather than two signals, compromising the identity of the speech, as perceptual segregation of signals is very challenging. A future implant system should incorporate a strategy to provide multiple input sounds without sacrificing the sound identity.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.