Cochlear implants provide users with limited spectral and temporal information. In this study, the amount of spectral and temporal information was systematically varied through simulations of cochlear implant processors using a noise-excited vocoder. Spectral information was controlled by varying the number of channels between 1 and 16, and temporal information was controlled by varying the lowpass cutoff frequencies of the envelope extractors from 1 to 512 Hz. Consonants and vowels processed using those conditions were presented to seven normal-hearing native-Englishspeaking listeners for identification. The results demonstrated that both spectral and temporal cues were important for consonant and vowel recognition with the spectral cues having a greater effect than the temporal cues for the ranges of numbers of channels and lowpass cutoff frequencies tested. The lowpass cutoff for asymptotic performance in consonant and vowel recognition was 16 and 4 Hz, respectively. The number of channels at which performance plateaued for consonants and vowels was 8 and 12, respectively. Within the above-mentioned ranges of lowpass cutoff frequency and number of channels, the temporal and spectral cues showed a tradeoff for phoneme recognition. Information transfer analyses showed different relative contributions of spectral and temporal cues in the perception of various phonetic/acoustic features.
Tone languages differ from English in that the pitch pattern of a single-syllable word conveys lexical meaning. In the present study, dependence of tonal-speech perception on features of the stimulation was examined using an acoustic simulation of a CIS-type speech-processing strategy for cochlear prostheses. Contributions of spectral features of the speech signals were assessed by varying the number of filter bands, while contributions of temporal envelope features were assessed by varying the low-pass cutoff frequency used for extracting the amplitude envelopes. Ten normal-hearing native Mandarin Chinese speakers were tested. When the low-pass cutoff frequency was fixed at 512 Hz, consonant, vowel, and sentence recognition improved as a function of the number of channels and reached plateau at 4 to 6 channels. Subjective judgments of sound quality continued to improve as the number of channels increased to 12, the highest number tested. Tone recognition, i.e., recognition of the four Mandarin tone patterns, depended on both the number of channels and the low-pass cutoff frequency. The trade-off between the temporal and spectral cues for tone recognition indicates that temporal cues can compensate for diminished spectral cues for tone recognition and vice versa. An additional tone recognition experiment using syllables of equal duration showed a marked decrease in performance, indicating that duration cues contribute to tone recognition. A third experiment showed that recognition of processed FM patterns that mimic Mandarin tone patterns was poor when temporal envelope and duration cues were removed.
Most modern cochlear-implant speech processors convey speech-envelope information using amplitude-modulated pulse trains. The use of higher-rate carrier pulse trains allows more envelope detail in the signal. However, neural response properties could limit the efficacy of high-rate carriers. This study examined effects of carrier rate and stimulation site, on psychophysical modulation detection thresholds (MDTs). Both of these variables could affect the neural representation of the carrier and thus affect perception of the modulation. Twelve human subjects with cochlear implants were tested. Phase duration of symmetric biphasic pulses was modulated sinusoidally at 40 Hz. MDTs were determined for monopolar stimulation at two carrier rates [250 and 4000 pulses/s (pps)], three stimulation sites (basal, middle, and apical), and five stimulus levels (10%, 30%, 50%, 70%, and 90% of the dynamic range). MDTs were lower for 250 pps carriers than for 4000 pps carriers in 71% of the 180 cases studied. Effects of carrier rate were greatest at the apical stimulation site and effects of stimulation site on MDTs depended on carrier rate. The data suggest a distinct disadvantage to using carrier pulse rates as high as 4000 pps. Stimulation site should be considered in evaluating modulation detection ability.
Sensory organs typically use receptor cells and afferent neurons to transduce environmental signals and transmit them to the CNS. When sensory cells are lost, nerves often regress from the sensory area. Therapeutic and regenerative approaches would benefit from the presence of nerve fibers in the tissue. In the hearing system, retraction of afferent innervation may accompany the degeneration of auditory hair cells that is associated with permanent hearing loss. The only therapy currently available for cases with severe or complete loss of hair cells is the cochlear implant auditory prosthesis. To enhance the therapeutic benefits of a cochlear implant, it is necessary to attract nerve fibers back into the cochlear epithelium. Here we show that forced expression of the neurotrophin gene BDNF in epithelial or mesothelial cells that remain in the deaf ear, induces robust regrowth of nerve fibers towards the cells that secrete the neurotrophin, and results in re-innervation of the sensory area. The process of neurotrophin-induced neuronal regeneration is accompanied by significant preservation of the spiral ganglion cells. The ability to regrow nerve fibers into the basilar membrane area and protect the auditory nerve will enhance performance of cochlear implants and augment future cell replacement therapies such as stem cell implantation or induced transdifferentiation. This model also provides a general experimental stage for drawing nerve fibers into a tissue devoid of neurons, and studying the interaction between the nerve fibers and the tissue.
The relative importance of temporal envelope and fine structure in speech and music perception was investigated by Smith et al. [Nature (London) 416, 87-90 (2002)] using "auditory chimera" in which the envelope from one sound was paired with the fine structure of another. Smith et al. found that, when 4 to 16 frequency bands were used, recognition of English speech was dominated by the envelope, whereas recognition of melody was dominated by the fine structure. In the present study, Mandarin Chinese monosyllables were divided into 4, 8, or 16 frequency bands and the fine structure and envelope of one tone pattern were exchanged with those of another tone pattern of the same monosyllable. Five normal-hearing native Mandarin Chinese speakers completed a four-alternative forced-choice tone-identification task. In the vast majority of trials, subjects based their identification of the monosyllables on the fine structure rather than the envelope. Thus, the relative importance of envelope and fine structure for lexical-tone perception resembled that for melody recognition rather than that for English speech recognition. Delivering fine-structure information in cochlear implant stimulation could be particularly beneficial for lexical-tone perception.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.