Linguistic processing of task-irrelevant speech at a cocktail party

Yahav, Paz Har-shai; Golumbic, Elana Zion

doi:10.7554/elife.65096

Cited by 49 publications

(51 citation statements)

References 155 publications

(265 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Other studies also confirmed the recognition of some words from a background speech stream, even if the background consisted of multiple voices (Dekerle et al, 2014). Furthermore, signs of spectro-temporal and linguistic processing of task-irrelevant speech streams were found in the auditory cortex, left inferior cortex, and posterior parietal cortex (Brodbeck et al, 2020; Har-shai Yahav & Zion Golumbic, 2021). The prerequisite of background stream segregation might be highly distinctive features, which results in categorical differences, such as different gender of speakers; but this background stream segregation might be unique to speech perception.…”

Section: Discussionmentioning

confidence: 99%

Do we parse the background into separate streams in the cocktail party?

Szalárdy

Tóth

Farkas

et al. 2022

Preprint

View full text Add to dashboard Cite

In the cocktail party situation, people with normal hearing usually follow a single speaker among multiple concurrent ones. However, there is no agreement in the literature as to whether the background is segregated into multiple streams/speakers. The current study varied the number of concurrent speech streams and investigated target detection and memory for the contents of a target stream as well as the processing of distractors. A male-spoken target stream was either presented alone (single-speech), together with one male-spoken (one-distractor), or a male- and a female-spoken distractor (two-distractor). Behavioral measures of target detection and content tracking performance as well as target- and distractor detection related ERPs were assessed. We found that the detection sensitivity and the target N2b amplitude decreased whereas the P3b amplitude increased from the single-speech to the concurrent speech streams conditions. Importantly, the behavioral distractor effect differed between the conditions with one- vs. two-distractor (distraction by the female speaker was lower than that of the male speaker in either condition) and the target N2b elicited in the presence of two distractors was significantly smaller than that elicited in the presence of one distractor. Further, the voltage in the N2b time window significantly differed between the one- and two-distractor conditions for the same (M2) speaker. These results show that speech processing was different in the presence of one vs. two distractors, and thus, the current data suggest that the two background speech streams were segregated from each other.

show abstract

Section: Discussionmentioning

confidence: 99%

Do we parse the background into separate streams in the cocktail party?

Szalárdy

Tóth

Farkas

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…In interpreting the results of our study, it is important to bear in mind that the neural metric of speech tracking used here (envelope-following response) primarily captures the acoustic representations of input in auditory cortex (Ding and Simon 2012a; Mesgarani and Chang 2012; Zion Golumbic et al 2013; Fiedler et al 2019a; Har-shai Yahav and Zion Golumbic 2021). Therefore, the current results should not be taken as an exhaustive description of all the ways in which attention may affect neural processing of speech.…”

Section: Discussionmentioning

confidence: 99%

“…This ‘ground-truth’ problem is extremely difficult to address empirically, since we lack a reliable readout of an individual’s internal allocation of attention. Unfortunately, most behavioural and neural measures used in in Selective Attention paradigms cannot provide sufficient temporal resolution to indicate the momentary locus of attention and/or how well task-irrelevant speech has been processed (Har-shai Yahav and Zion Golumbic 2021). While some attempts have been made at utilizing advanced analytical approaches to achieve real-time measures of attention (Miran et al 2018; Jaeger et al 2020), these studies are still limited by the lack of access to the true internal state of attention.…”

Section: Introductionmentioning

confidence: 99%

“…And yet, these modulatory top-down effects do not eliminate the internal representation of task-irrelevant (or ‘to-be-ignored’) speech. Not only is this speech still encoded in auditory cortex (Ding and Simon 2012a; Horton and Srinivasan 2013; Zion Golumbic et al 2013; O’Sullivan et al 2015; Fiedler et al 2019), but a multitude of behavioural and neural findings provide evidence that some linguistic processing is also applied to task-irrelevant speech (Tun et al 2002; Dupoux et al 2003; Beaman et al 2007; Rivenez et al 2008; Carey et al 2014; Parmentier et al 2014, 2018; Schepman et al 2015; Vachon et al 2019; Dai et al 2021; Har-shai Yahav and Zion Golumbic, 2021). Therefore, alongside the convincing evidence that selective-attention biases neural processing such that to-be-attended speech is preferentially encoded, we still do not have a full understanding of the system’s capacity and limitations for processing additional concurrent speech.…”

Section: Introductionmentioning

confidence: 99%

“…This 'ground-truth' problem is extremely difficult to address empirically, since we lack a reliable readout of an individual's internal allocation of attention. Unfortunately, most behavioural and neural measures used in in Selective Attention paradigms cannot provide sufficient temporal resolution to indicate the momentary locus of attention and/or how well task-irrelevant speech has been processed (Har-shai Yahav and Zion Golumbic 2021).…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Capacity and tradeoffs in neural encoding of concurrent speech during Selective and Distributed Attention

Kaufman

Golumbic

2022

Preprint

Self Cite

View full text Add to dashboard Cite

Speech comprehension is severely compromised when several people talk at once, due to limited perceptual and cognitive resources. Under some circumstances listeners can employ top-down attention to prioritize the processing of task-relevant speech. However, whether the system can effectively represent more than one speech input remains highly debated. Here we studied how task-relevance affects the neural representation of concurrent speakers under two extreme conditions: when only one speaker was task-relevant (Selective Attention), vs. when two speakers were equally relevant (Distributed Attention). Neural activity was measured using magnetoencephalography (MEG) and we analysed the speech-tracking responses to both speakers. Crucially, we explored different hypotheses as to how the brain may have represented the two speech streams, without making a-priori assumptions regarding participants' internal allocation of attention. Results indicate that neural tracking of concurrent speech did not fully mirror their instructed task-relevance. When Distributed Attention was required, we observed a tradeoff between the two speakers despite their equal task-relevance, akin to the top-down modulation observed during Selective Attention. This points to the system's inherent limitation to fully process two speech streams, and highlights the complex nature of attention, particularly for continuous speech.

show abstract