Mean-based neural coding of voices

Andics, Attila; McQueen, James M.; Petersson, Karl Magnus

doi:10.1016/j.neuroimage.2013.05.002

Cited by 34 publications

(34 citation statements)

References 54 publications

(91 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Recall that the typicality analysis collapsed both Long and Short VOT variants for both talkers (each of which might be perceived as ‘typical’ or ‘atypical’ of a given talker according to the counterbalanced group assignment), so differences in activation seen within this contrast cannot be attributed to surface-level properties of the stimuli. Notably, right posterior temporal and parietal regions have been linked in lesion studies (Van Lancker et al, 1988, 1989) and imaging studies (Andics et al, 2010, 2013) to access to talker identity. This region also abuts a slightly more ventral MTG area that was found to be responsive in a previous study to talker-specific phonetic variability when that variability takes the form of an ambiguous phoneme inserted in a biasing lexical context (Myers & Mesite, 2014).…”

Section: Discussionmentioning

confidence: 99%

“…Imaging studies have further corroborated the separation between regions that are sensitive to the acoustics of the voice—and thus could be used for discriminating between talkers—and those responsible for mapping voice acoustics to an individual identity which can be used for talker identification (von Kriegstein, et al, 2003). In particular, while voice acoustics may be processed in bilateral temporal regions (specifically the superior temporal sulcus or STS), imaging studies have sited vocal identity processing (or access to familiar voices) in the anterior right temporal lobe rather than the right posterior region implicated in lesion studies (Andics et al, 2010; Andics, McQueen, & Petersson, 2013; Belin & Zatorre, 2003; Campanella & Belin, 2007). …”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Voice-sensitive brain networks encode talker-specific phonetic detail

Myers

Theodore

2017

Brain and Language

View full text Add to dashboard Cite

The speech stream simultaneously carries information about talker identity and linguistic content, and the same acoustic property (e.g., voice-onset-time, or VOT) may be used for both purposes. Separable neural networks for processing talker identity and phonetic content have been identified, but it is unclear how a singular acoustic property is parsed by the neural system for talker identification versus phonetic processing. In the current study, listeners were exposed to two talkers with characteristically different VOTs. Subsequently, brain activation was measured using fMRI as listeners performed a phonetic categorization task on these stimuli. Right temporoparietal regions previously implicated in talker identification showed sensitivity to the match between VOT variant and talker, whereas left posterior temporal regions showed sensitivity to the typicality of phonetic exemplars, regardless of talker typicality. Taken together, these results suggest that neural systems for voice recognition capture talker-specific phonetic variation.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Voice-sensitive brain networks encode talker-specific phonetic detail

Myers

Theodore

2017

Brain and Language

View full text Add to dashboard Cite

show abstract

“…Thus normal processing of voices involves increasingly more abstract representations of speaker identity, independent of other (e.g., acoustic) features (Warren et al 2006). Current evidence also suggests that individual voices are coded relative to how different they are from a prototypical, or average, voice (i.e., a ‘prior’; Andics et al 2013; Andics et al 2010; Latinus et al 2013), which indicates that both prediction and prediction error signals are routinely implemented in the perception of speaker identity.…”

Section: An Auditory Processing Stream Frameworkmentioning

confidence: 99%

A Neuropsychological Approach to Auditory Verbal Hallucinations and Thought Insertion - Grounded in Normal Voice Perception

Badcock

2015

Rev.Phil.Psych.

View full text Add to dashboard Cite

A neuropsychological perspective on auditory verbal hallucinations (AVH) links key phenomenological features of the experience, such as voice location and identity, to functionally separable pathways in normal human audition. Although this auditory processing stream (APS) framework has proven valuable for integrating research on phenomenology with cognitive and neural accounts of hallucinatory experiences, it has not yet been applied to other symptoms presumed to be closely related to AVH – such as thought insertion (TI). In this paper, I propose that an APS framework offers a useful way of thinking about the experience of TI as well as AVH, providing a common conceptual framework for both. I argue that previous self-monitoring theories struggle to account for both the differences and similarities in the characteristic features of AVH and TI, which can be readily accommodated within an APS framework. Furthermore, the APS framework can be integrated with predictive processing accounts of psychotic symptoms; makes predictions about potential sites of prediction error signals; and may offer a template for understanding a range of other symptoms beyond AVH and TI.Electronic supplementary materialThe online version of this article (doi:10.1007/s13164-015-0270-3) contains supplementary material, which is available to authorized users.

show abstract

“…Based on the previous literature, we expected in the crossmodal condition a decrease in activation in samespeaker (person-congruent) compared with differentspeaker (person-incongruent) trials in face-sensitive areas of the fusiform gyrus [Grill-Spector et al, 2004;Shah et al, 2001] and in voice-sensitive areas along the STS [Andics et al, 2013b;Joassin et al, 2011;Latinus et al, 2011] and the inferior frontal gyrus (IFG, [Andics et al, 2013a,b;Latinus et al, 2011]). The S2 stimulus (the target) was a human voice in all conditions.…”

Section: Introductionmentioning

confidence: 99%

“…We further manipulated the congruency between the prime and the target, that is, whether the prime and the target belonged to the same speaker (person-congruent) or to different speakers (person-incongruent). Based on the previous literature, we expected in the crossmodal condition a decrease in activation in samespeaker (person-congruent) compared with differentspeaker (person-incongruent) trials in face-sensitive areas of the fusiform gyrus [Grill-Spector et al, 2004;Shah et al, 2001] and in voice-sensitive areas along the STS [Andics et al, 2013b;Joassin et al, 2011;Latinus et al, 2011] and the inferior frontal gyrus (IFG, [Andics et al, 2013a,b;Latinus et al, 2011]). We further predicted a similar decline of the BOLD signal in supramodal brain regions which have been previously reported to be activated during the integration of human faces and voices, specifically the pSTS [Blank et al, 2011;Joassin et al, 2011;Klasen et al, 2011;Watson et al, 2013Watson et al, , 2014a and the angular gyrus [Joassin et al, 2011;Klasen et al, 2011;M€ uller et al, 2011].…”

Section: Introductionmentioning

confidence: 99%

Activation in the angular gyrus and in the pSTS is modulated by face primes during voice recognition

Hölig

Föcker

Best

et al. 2017

Human Brain Mapping

View full text Add to dashboard Cite

The aim of the present study was to better understand the interaction of face and voice processing when identifying people. In a S1-S2 crossmodal priming fMRI experiment, the target (S2) was a disyllabic voice stimulus, whereas the modality of the prime (S1) was manipulated blockwise and consisted of the silent video of a speaking face in the crossmodal condition or of a voice stimulus in the unimodal condition. Primes and targets were from the same speaker (person-congruent) or from two different speakers (person-incongruent). Participants had to classify the S2 as either an old or a young person. Response times were shorter after a congruent than after an incongruent face prime. The right posterior superior temporal sulcus (pSTS) and the right angular gyrus showed a significant person identity effect (person-incongruent > person-congruent) in the crossmodal condition but not in the unimodal condition. In the unimodal condition, a person identity effect was observed in the bilateral inferior frontal gyrus. Our data suggest that both the priming with a voice and with a face result in a preactivated voice representation of the respective person, which eventually facilitates (person-congruent trials) or hampers (person-incongruent trials) the processing of the identity of a subsequent voice. This process involves activation in the right pSTS and in the right angular gyrus for voices primed by faces, but not for voices primed by voices. Hum Brain Mapp 38:2553-2565, 2017. © 2017 Wiley Periodicals, Inc.

show abstract

Mean-based neural coding of voices

Cited by 34 publications

References 54 publications

Voice-sensitive brain networks encode talker-specific phonetic detail

Voice-sensitive brain networks encode talker-specific phonetic detail

A Neuropsychological Approach to Auditory Verbal Hallucinations and Thought Insertion - Grounded in Normal Voice Perception

Activation in the angular gyrus and in the pSTS is modulated by face primes during voice recognition

Contact Info

Product

Resources

About