The effects of lexical neighbors on stop consonant articulation

Goldrick, Matthew; Vaughn, Charlotte; Murphy, Amanda Clare

doi:10.1121/1.4812821

Cited by 42 publications

(59 citation statements)

References 15 publications

(18 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Table 4.2 shows the posterior expectation and 95% highest posterior density intervals for each of the prior belief parameters given the adaptation data above. One possible reason for this is that a substantial minority of English speakers produce pre-voiced /b/ (Lisker & Abramson, 1964;Goldrick et al, 2013), which is characterized by a lower (negative) VOT and a higher variance (often higher even than /p/). That is, across talkers, the /b/ VOT distribution parameters (mean and variance) have a bimodal distribution.…”

Section: Do Inferred Beliefs Align With a Typical Talker?mentioning

confidence: 99%

“…We assumed a single, unimodal prior distribution, and the prior beliefs we inferred to be most likely are consistent with a compromise between the two types of /b/ distributions that talkers actually produce. Figure 4.6 shows the bimodal distribution of /b/ VOTs observed by Goldrick et al (2013), which has one short-lag cluster around 10ms VOT with low variance, and another prevoiced cluster centered around -100ms VOT, with high variance. The model-inferred /b/ distribution is a reasonable compromise, lying in between these two clusters in its mean and variance.…”

Section: Do Inferred Beliefs Align With a Typical Talker?mentioning

confidence: 99%

“…First, existing studies of voiced stop production suggest that some talkers prevoice /b/ more than others, making the distribution of /b/ means across talkers bimodal (Lisker & Abramson, 1964;Goldrick et al, 2013). But these studies do not include enough talkers to properly assess whether this is really representative of populationlevel variability.…”

Section: Do Inferred Beliefs Align With a Typical Talker?mentioning

confidence: 99%

“…Moreover, the exposure distribution in Kronrod et al (2012) was flat over VOTs, providing little information about the underlying distributions. Given that short-lag /b/s appear to be more common across talkers (accounting for the majority of tokens in both Lisker & Abramson, 1964;and Goldrick et al, 2013), the best guess for a single, particular talker's /b/ distribution would be the short-lag distribution. A second, related reason is that if a talker produces any short-lag VOTs at all, the prevoiced cluster will not have a substantial effect on the optimal classification boundary, since it will have lower likelihood than either the short-lag or /p/ distributions for VOTs near the boundary.…”

Section: Do Inferred Beliefs Align With a Typical Talker?mentioning

confidence: 99%

“…This underestimates the actual level of variability across talkers. Based on production data from both read (Baese-Berk & Goldrick, 2009;Goldrick et al, 2013;Lev-Ari & Peperkamp, 2013) and conversational speech (extracted from the Buckeye corpus, Wedel in prep), actual cross-talker variability is on the order of a standard deviation of 3-4ms VOT for (short-lag) /b/ and 10-15ms VOT for /p/. This is likely due to the particular mathematical form we assumed listeners' prior beliefs to have.…”

Section: Do Inferred Beliefs Accurately Reflect Cross-talker Variabilmentioning

confidence: 99%

See 4 more Smart Citations

Perception in a Variable but Structured World: The Case of Speech Perception

Kleinschmidt¹

2017

Preprint

View full text Add to dashboard Cite

Perceptual systems have to make sense out of a world that is not only noisy and ambiguous, but that also varies from situation to situation. Human speech perception is a perceptual domain where this problem has long been acknowledged: individual talkers vary substantially in how they produce linguistic units using acoustic cues. Yet, how the speech system solves this problem of talker variability remains poorly understood. This thesis presents a computational framework---the ideal adapter---for understanding this problem and how the speech perception system solves it. The basic insight of this framework is that variability in speech is not arbitrary but rather structured: talkers are reasonably consistent in the way they produce cues, and individual talkers tend to cluster into groups by gender, regional background, etc. This structure means that listeners can use their previous experience with other talkers to guide perception of unfamiliar talkers, as well as familiar talkers that they encounter again. This framework unifies a large and messy literature on how listeners cope with talker variability, leads to quantitative models that provide good fits to human behavior in a variety of situations, and makes specific, testable predictions that open up new frontiers in understanding speech perception. This framework also applies to perception in general, and highlights how speech perception can serve as a model organism for understanding how perceptual systems cope with a variable but structured world.

show abstract

Section: Do Inferred Beliefs Align With a Typical Talker?mentioning

confidence: 99%

Section: Do Inferred Beliefs Align With a Typical Talker?mentioning

confidence: 99%

Section: Do Inferred Beliefs Align With a Typical Talker?mentioning

confidence: 99%