2014
DOI: 10.1179/1754762814y.0000000086
|View full text |Cite
|
Sign up to set email alerts
|

The effect of spectral smearing on the identification of pureF0intonation contours in vocoder simulations of cochlear implants

Abstract: Objectives: Performance of cochlear implant (CI) users on linguistic intonation recognition is poorer than that of normal-hearing listeners, due to the limited spectral detail provided by the implant. A higher spectral resolution is provided by narrow rather than by broad filter slopes. The corresponding effect of the filter slope on the identification of linguistic intonation conveyed by pitch movements alone was tested using vocoder simulations. Methods: Re-synthesized intonation variants of naturally produc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
2
0

Year Published

2017
2017
2019
2019

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 35 publications
0
2
0
Order By: Relevance
“…The literature reviewed above suggests that, similar to segmental perception, prosodic pitch (i.e., intonation) perception benefits from better frequency selectivity in the form of steeper filter slopes. However, whereas for segmental identification scores reached asymptote at 40 dB/octave (Litvak et al, 2007), performance for intonation perception was still at chance for 40 dB/octave (van de Velde et al, 2015), despite using the same number of channels (though some other vocoding parameters differed between the studies). Given the results of those studies, we hypothesize that, given comparable tasks, intonation perception requires greater channel independence, perhaps as realized by means of electrode configuration or steeper filter slopes, than segmental perception, because intonation perception relies more heavily on spectral versus temporal information relative to segmental perception.…”
Section: Introductionmentioning
confidence: 89%
See 1 more Smart Citation
“…The literature reviewed above suggests that, similar to segmental perception, prosodic pitch (i.e., intonation) perception benefits from better frequency selectivity in the form of steeper filter slopes. However, whereas for segmental identification scores reached asymptote at 40 dB/octave (Litvak et al, 2007), performance for intonation perception was still at chance for 40 dB/octave (van de Velde et al, 2015), despite using the same number of channels (though some other vocoding parameters differed between the studies). Given the results of those studies, we hypothesize that, given comparable tasks, intonation perception requires greater channel independence, perhaps as realized by means of electrode configuration or steeper filter slopes, than segmental perception, because intonation perception relies more heavily on spectral versus temporal information relative to segmental perception.…”
Section: Introductionmentioning
confidence: 89%
“…Slopes with a shallow roll-off overlap each other more than those with a steep roll-off, resulting in more spectral smearing. Moreover, even with steep analysis filters, spectral smearing is also induced by overlapping neuron areas stimulated by adjacent electrodes (Tang et al, 2011), a factor represented by means of the synthesis filter in vocoder simulations. Using vocoder simulations of CIs, this study aims to find the theoretically optimal filter slope for the perception of a specific aspect of speech in which pitch plays a central role (i.e., prosody).…”
Section: Introductionmentioning
confidence: 99%
“…Just as accuracy scores vary between listeners, the ability to make use of cues conveyed by intact acoustic features to compensate for uninformative F0 cues may differ between listeners (e.g., listeners presented full-spectrum speech or CI-simulations and CI listeners presented full-spectrum speech recognise vocal emotions more accurately when overall intensity cues are intact, i.e., not normalised. Gilbers et al (2015) suggest that informative overall intensity cues do not improve recognition of vocal emotions perceived through a CI or make use of speech rate cues to recognise vocal emotions, the majority of studies suggest listeners cannot make use of speech-rate cues to identify emotions conveyed in speech with uninformative F0 cues (Gilbers et al, 2015;Luo, 2016;Van de Velde, 2017). Combined, these studies highlight the lack of consensus regarding listeners' 66 abilities to make use of intensity or speech-rate cues in the presence of uninformative F0 cues.…”
Section: Introductionmentioning
confidence: 99%
“…Evidence exists that increased reliance on intensity cues may support recognition of emotions in the absence of F0 cues to emotional prosody (Chapter 3;Luo et al, 2007), although intensity cues fe, 2017). Other studies suggest that listeners cannot use intensity cues to compensate for uninformative F0 cues when identifying vocal emotions (Gilbers et al, 2015;Luo, to make use of speech-rate cues to recognise emotions in speech in which F0 cues are uninformative, while other studies indicate that whether or not listeners increase their reliance on speech rate when parsing vocal emotions, they are unable to make use of speech-rate cues to recognise emotions in speech with uninformative F0 cues (Chapter 3; Gilbers et al, 2015;Luo, 2016;Van de Velde, 2017). Further, speech rate, of itself, 2017).…”
Section: Introductionmentioning
confidence: 99%