Although previous work has shown that some speech sounds contain more speaker-dependent information than others, not much is known about the speaker information of the same segment in different linguistic contexts. The present study therefore investigated whether Dutch fricatives /s/ and /x/ from telephone dialogues contain differential speaker information as a function of syllabic position and labial co-articulation. These linguistic effects, established in earlier work on read broadband speech, were firstly investigated. Using a corpus of Dutch telephone speech, results showed that the telephone bandwidth captures the expected effects of perseverative and anticipatory labialization for back fricative /x/, for which spectral peaks fall within the telephone band, but not for front fricative /s/, for which the spectral peak falls outside the telephone band.Multinomial logistic regression shows that /s/ contains slightly more speaker information than /x/ in telephone speech and that speaker information is distributed across the speech signal in a systematic way; even though differences in classification accuracy were small, for both /s/ and /x/, codas and tokens with labial neighbours are more speaker-specific than onsets and tokens with non-labial neighbours. These findings indicate that speaker information contained by the same speech sound is not the same across contexts.
Previous studies have suggested that female voices may impede verbal processing. For example, words were remembered less well and lexical decision was slower when spoken by a female speaker. The current study tried to replicate this gender effect in an auditory semantic/associative priming task that excluded any effects of speaker variability and extended previous research by examining the role of two voice features important in perceived gender: pitch and formant frequencies. Additionally, listener gender was included in the experimental design. Results show that, contrary to previous findings, there is no evidence that a lexical decision of a target word is slower when spoken by a female speaker than by a male speaker for female and male listeners. Additionally, the semantic/associative priming effect was not affected by speaker gender, neither did female mean pitch or formants predict the semantic/associative priming effect. At the behavioural level, the current study found no evidence for a gender effect in a semantic/associative priming task.
In forensic speech science, nasals are often reported to be particularly useful in characterizing speakers because of their low within-speaker and high between-speaker variability. However, empirical acoustic data from nasal consonants indicate that there is a somewhat larger role for the oral cavity in nasal consonant acoustics than is generally predicted by acoustic models. For example, in read speech, nasal consonant acoustics show lingual coarticulation that differs by nasal consonant, and syllabic position also seems to affect realizations of nasal consonants within speakers. In the current exploratory study, the within-and between-speaker variation in the most frequent nasals in Standard Dutch, /n/ and /m/, was investigated. Using 3695 [n] and 3291 [m] tokens sampled from 54 speakers' spontaneous telephone utterances, linear mixed-effects modeling of acoustic-phonetic features showed effects of phonetic context that differed by nasal consonant and by syllabic position. A subsequent speaker-classification test using multinomial logistic regression on the acoustic-phonetic features seems to indicate that nasals displaying larger effects of phonetic context also perform slightly better in speaker classification, although differences were minor. This might be caused by between-speaker variation in the degree and timing of lingual coarticulatory gestures.
In Moroccan Dutch, /s/ has been claimed to be pronounced as retracted [s] (towards /ʃ/) in certain consonant clusters. Recently, retracted s-pronunciation has also been attested in endogenous Dutch. We tested empirically whether Moroccan Dutch [s] is indeed more retracted than endogenous Dutch [s] in relevant clusters. Additionally, we tested whether the inter-speaker variation of /s/ is smaller between Moroccan Dutch speakers than between endogenous Dutch speakers, as expected if retraction of /s/ would be used as identity marker in in-group conversations in Moroccan Dutch. The [s] realizations of 21 young, male Moroccan Dutch and 21 endogenous Dutch speakers were analyzed. Analyses of the spectral centre of gravity (CoG) show that both groups of speakers had more retracted pronunciations of [s] in typically retracting contexts than in typically non-retracting contexts. However, Moroccan Dutch speakers had higher CoG in both contexts than endogenous Dutch speakers, refuting the stronger retraction expected in Moroccan Dutch speakers. The inter-speaker variation was larger between Moroccan Dutch speakers than between endogenous-Dutch speakers, refuting the expected usage of /s/ retraction as a group identity marker.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.