Modulation sensitivity in the perceptual organization of speech

Remez, Robert E.; Thomas, Emily F.; Dubowski, Kathryn R.; Koinis, Stavroula M.; Porter, Natalie A. C.; Paddu, Nina U.; Москаленко, Марина; Grossman, Yael S.

doi:10.3758/s13414-013-0542-x

Cited by 11 publications

(22 citation statements)

References 32 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…One thing that should be noted here is that the above mentioned experiment measured the intelligibility of speech by asking for subjective judgments – i.e., people read the target speech before listening to it, and judged the intelligibility of locally time-reversed speech subjectively. The repetitive exposure to the target speech might have produced the relatively high intelligibility ratings for relatively long temporal reversal ( Stilp et al, 2010 ; Remez et al, 2013 ; Ueda et al, 2017 ), but this study showed, at least, that people might be tolerant to temporal distortion occurring at a large scale. Their findings raised the question of whether detailed analysis of the temporal fine structure of speech is required in speech perception ( Liberman et al, 1967 ; Steffen and Werani, 1994 ; Greenberg, 1999 ; Greenberg and Arai, 2001 ; Magrin-Chagnolleau et al, 2002 ; Remez et al, 2013 ; Ueda et al, 2017 ).…”

Section: Introductionmentioning

confidence: 56%

Perceptual Restoration of Temporally Distorted Speech in L1 vs. L2: Local Time Reversal and Modulation Filtering

2018

View full text Add to dashboard Cite

Speech is intelligible even when the temporal envelope of speech is distorted. The current study investigates how native and non-native speakers perceptually restore temporally distorted speech. Participants were native English speakers (NS), and native Japanese speakers who spoke English as a second language (NNS). In Experiment 1, participants listened to “locally time-reversed speech” where every x-ms of speech signal was reversed on the temporal axis. Here, the local time reversal shifted the constituents of the speech signal forward or backward from the original position, and the amplitude envelope of speech was altered as a function of reversed segment length. In Experiment 2, participants listened to “modulation-filtered speech” where the modulation frequency components of speech were low-pass filtered at a particular cut-off frequency. Here, the temporal envelope of speech was altered as a function of cut-off frequency. The results suggest that speech becomes gradually unintelligible as the length of reversed segments increases (Experiment 1), and as a lower cut-off frequency is imposed (Experiment 2). Both experiments exhibit the equivalent level of speech intelligibility across six levels of degradation for native and non-native speakers respectively, which poses a question whether the regular occurrence of local time reversal can be discussed in the modulation frequency domain, by simply converting the length of reversed segments (ms) into frequency (Hz).

show abstract

Section: Introductionmentioning

confidence: 56%

Perceptual Restoration of Temporally Distorted Speech in L1 vs. L2: Local Time Reversal and Modulation Filtering

2018

View full text Add to dashboard Cite

show abstract

“…For example, some authors presented a stimulus in a trial for a fixed number of times 13–15 , ranging from once to five times, whereas other researchers 10–12 let their participants listen ad libitum up to four times on each trial. Some experimenters presented their stimuli in random order 10–12, 14 , whereas others presented them in a systematic order 13 .…”

Section: Introductionmentioning

confidence: 99%

Intelligibility of locally time-reversed speech: A multilingual comparison

Ueda

Nakajima

Ellermeier

et al. 2017

Sci Rep

View full text Add to dashboard Cite

A set of experiments was performed to make a cross-language comparison of intelligibility of locally time-reversed speech, employing a total of 117 native listeners of English, German, Japanese, and Mandarin Chinese. The experiments enabled to examine whether the languages of three types of timing—stress-, syllable-, and mora-timed languages—exhibit different trends in intelligibility, depending on the duration of the segments that were temporally reversed. The results showed a strikingly similar trend across languages, especially when the time axis of segment duration was normalised with respect to the deviation of a talker’s speech rate from the average in each language. This similarity is somewhat surprising given the systematic differences in vocalic proportions characterising the languages studied which had been shown in previous research and were largely replicated with the present speech material. These findings suggest that a universal temporal window shorter than 20–40 ms plays a crucial role in perceiving locally time-reversed speech by working as a buffer in which temporal reorganisation can take place with regard to lexical and semantic processing.

show abstract

“…Nakajima et al, 2018;Ueda et al, 2017)]. The results of previous perceptual experiments (Greenberg and Arai, 2001;Ishida et al, 2018;Kiss et al, 2008;Meunier et al, 2002;Nakajima et al, 2018;Remez et al, 2013;Saberi and Perrott, 1999;Steffen and Werani, 1994;Stilp et al, 2010;Ueda et al, 2017) indicate that the auditory system is capable of overriding this kind of degradation and retrieving plausible solutions to some extent, unless the reversed segment duration becomes too long. The restoration process, however, should certainly impose an extra processing load on the auditory system compared to the processing of normal speech.…”

Section: Introductionmentioning

confidence: 92%

Irrelevant speech effects with locally time-reversed speech: Native vs non-native language

Ueda

Nakajima

Kattner

et al. 2019

The Journal of the Acoustical Society of America

View full text Add to dashboard Cite

Irrelevant speech is known to interfere with short-term memory of visually presented items. Here, this irrelevant speech effect was studied with a factorial combination of three variables: the participants' native language, the language the irrelevant speech was derived from, and the playback direction of the irrelevant speech. We used locally time-reversed speech as well to disentangle the contributions of local and global integrity. German and Japanese speech was presented to German (n ¼ 79) and Japanese (n ¼ 81) participants while participants were performing a serialrecall task. In both groups, any kind of irrelevant speech impaired recall accuracy as compared to a pink-noise control condition. When the participants' native language was presented, normal speech and locally time-reversed speech with short segment duration, preserving intelligibility, was the most disruptive. Locally time-reversed speech with longer segment durations and normal or locally time-reversed speech played entirely backward, both lacking intelligibility, was less disruptive. When the unfamiliar, incomprehensible signal was presented as irrelevant speech, no significant difference was found between locally time-reversed speech and its globally inverted version, suggesting that the effect of global inversion depends on the familiarity of the language.

show abstract

Modulation sensitivity in the perceptual organization of speech

Cited by 11 publications

References 32 publications

Perceptual Restoration of Temporally Distorted Speech in L1 vs. L2: Local Time Reversal and Modulation Filtering

Perceptual Restoration of Temporally Distorted Speech in L1 vs. L2: Local Time Reversal and Modulation Filtering

Intelligibility of locally time-reversed speech: A multilingual comparison

Irrelevant speech effects with locally time-reversed speech: Native vs non-native language

Contact Info

Product

Resources

About