The effect of spectral smearing on the identification of pure<i>F</i><sub>0</sub>intonation contours in vocoder simulations of cochlear implants

Velde, Daan J. van de; Dritsakis, Giorgos; Frijns, Johan H. M.; Heuven, Vincent J. van; Schiller, Niels O.

doi:10.1179/1754762814y.0000000086

Cited by 3 publications

(4 citation statements)

References 35 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The literature reviewed above suggests that, similar to segmental perception, prosodic pitch (i.e., intonation) perception benefits from better frequency selectivity in the form of steeper filter slopes. However, whereas for segmental identification scores reached asymptote at 40 dB/octave (Litvak et al, 2007), performance for intonation perception was still at chance for 40 dB/octave (van de Velde et al, 2015), despite using the same number of channels (though some other vocoding parameters differed between the studies). Given the results of those studies, we hypothesize that, given comparable tasks, intonation perception requires greater channel independence, perhaps as realized by means of electrode configuration or steeper filter slopes, than segmental perception, because intonation perception relies more heavily on spectral versus temporal information relative to segmental perception.…”

Section: Introductionmentioning

confidence: 89%

“…Slopes with a shallow roll-off overlap each other more than those with a steep roll-off, resulting in more spectral smearing. Moreover, even with steep analysis filters, spectral smearing is also induced by overlapping neuron areas stimulated by adjacent electrodes (Tang et al, 2011), a factor represented by means of the synthesis filter in vocoder simulations. Using vocoder simulations of CIs, this study aims to find the theoretically optimal filter slope for the perception of a specific aspect of speech in which pitch plays a central role (i.e., prosody).…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

The perception of emotion and focus prosody with varying acoustic cues in cochlear implant simulations with varying filter slopes

Velde

Schiller

Heuven

et al. 2017

The Journal of the Acoustical Society of America

View full text Add to dashboard Cite

This study aimed to find the optimal filter slope for cochlear implant simulations (vocoding) by testing the effect of a wide range of slopes on the discrimination of emotional and linguistic (focus) prosody, with varying availability of F0 and duration cues. Forty normally hearing participants judged if (non-)vocoded sentences were pronounced with happy or sad emotion, or with adjectival or nominal focus. Sentences were recorded as natural stimuli and manipulated to contain only emotion- or focus-relevant segmental duration or F0 information or both, and then noise-vocoded with 5, 20, 80, 120, and 160 dB/octave filter slopes. Performance increased with steeper slopes, but only up to 120 dB/octave, with bigger effects for emotion than for focus perception. For emotion, results with both cues most closely resembled results with F0, while for focus results with both cues most closely resembled those with duration, showing emotion perception relies primarily on F0, and focus perception on duration. This suggests that filter slopes affect focus perception less than emotion perception because for emotion, F0 is both more informative and more affected. The performance increase until extreme filter slope values suggests that much performance improvement in prosody perception is still to be gained for CI users.

show abstract

Section: Introductionmentioning

confidence: 89%

Section: Introductionmentioning

confidence: 99%

The perception of emotion and focus prosody with varying acoustic cues in cochlear implant simulations with varying filter slopes

Velde

Schiller

Heuven

et al. 2017

The Journal of the Acoustical Society of America

View full text Add to dashboard Cite

show abstract

“…Just as accuracy scores vary between listeners, the ability to make use of cues conveyed by intact acoustic features to compensate for uninformative F0 cues may differ between listeners (e.g., listeners presented full-spectrum speech or CI-simulations and CI listeners presented full-spectrum speech recognise vocal emotions more accurately when overall intensity cues are intact, i.e., not normalised. Gilbers et al (2015) suggest that informative overall intensity cues do not improve recognition of vocal emotions perceived through a CI or make use of speech rate cues to recognise vocal emotions, the majority of studies suggest listeners cannot make use of speech-rate cues to identify emotions conveyed in speech with uninformative F0 cues (Gilbers et al, 2015;Luo, 2016;Van de Velde, 2017). Combined, these studies highlight the lack of consensus regarding listeners' 66 abilities to make use of intensity or speech-rate cues in the presence of uninformative F0 cues.…”

Section: Introductionmentioning

confidence: 99%

“…Evidence exists that increased reliance on intensity cues may support recognition of emotions in the absence of F0 cues to emotional prosody (Chapter 3;Luo et al, 2007), although intensity cues fe, 2017). Other studies suggest that listeners cannot use intensity cues to compensate for uninformative F0 cues when identifying vocal emotions (Gilbers et al, 2015;Luo, to make use of speech-rate cues to recognise emotions in speech in which F0 cues are uninformative, while other studies indicate that whether or not listeners increase their reliance on speech rate when parsing vocal emotions, they are unable to make use of speech-rate cues to recognise emotions in speech with uninformative F0 cues (Chapter 3; Gilbers et al, 2015;Luo, 2016;Van de Velde, 2017). Further, speech rate, of itself, 2017).…”

Section: Introductionmentioning

confidence: 99%

Recognition and cortical haemodynamics of vocal emotions-an fNIRS perspective

Moffat¹

View full text Add to dashboard Cite

show abstract

Talking in Time: The development of a self-administered conversation analysis based training programme for cochlear implant users

Wells

Beeston

Bradley

et al. 2019

Cochlear Implants International

View full text Add to dashboard Cite

The effect of spectral smearing on the identification of pureF₀intonation contours in vocoder simulations of cochlear implants

Cited by 3 publications

References 35 publications

The perception of emotion and focus prosody with varying acoustic cues in cochlear implant simulations with varying filter slopes

The perception of emotion and focus prosody with varying acoustic cues in cochlear implant simulations with varying filter slopes

Recognition and cortical haemodynamics of vocal emotions-an fNIRS perspective

Talking in Time: The development of a self-administered conversation analysis based training programme for cochlear implant users

Contact Info

Product

Resources

About

The effect of spectral smearing on the identification of pureF0intonation contours in vocoder simulations of cochlear implants

Cited by 3 publications

References 35 publications

The perception of emotion and focus prosody with varying acoustic cues in cochlear implant simulations with varying filter slopes

The perception of emotion and focus prosody with varying acoustic cues in cochlear implant simulations with varying filter slopes

Recognition and cortical haemodynamics of vocal emotions-an fNIRS perspective

Talking in Time: The development of a self-administered conversation analysis based training programme for cochlear implant users

Contact Info

Product

Resources

About

The effect of spectral smearing on the identification of pureF₀intonation contours in vocoder simulations of cochlear implants