Learning Diphone‐Based Segmentation

Daland, Robert; Pierrehumbert, Janet B.

doi:10.1111/j.1551-6709.2010.01160.x

Cited by 62 publications

(80 citation statements)

References 87 publications

(166 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Moreover, such "diphone-based segmentation" has proved valid and learnable in a computational Bayesian model recently developed by Daland and Pierrehumbert (2011). They assessed whether the model could recover word boundaries based on the identity of the surrounding diphone.…”

mentioning

confidence: 99%

Diphones-fr: A French database of diphone positional frequency

New¹,

Spinelli²

2012

Behav Res

View full text Add to dashboard Cite

The aim of this article is to describe a database of diphone positional frequencies in French. More specifically, we provide frequencies for word-initial, word-internal, and word-final diphones of all words extracted from a subtitle corpus of 50 million words that come from movie and TV series dialogue. We also provide intra-and intersyllable diphone frequencies, as well as interword diphone frequencies. To our knowledge, no other such tool is available to psycholinguists for the study of French sequential probabilities. This database and its new indicators should help researchers conducting new studies on speech segmentation.

show abstract

mentioning

confidence: 99%

Diphones-fr: A French database of diphone positional frequency

New¹,

Spinelli²

2012

Behav Res

View full text Add to dashboard Cite

show abstract

“…It complements that of recent work investigating the use of phoneme-level statistical regularities for segmentation (Adriaans & Kager, 2010;Daland & Pierrehumbert, 2011). Our work differs from these latter approaches, however, in comparing several phonotactic models, including ones relying on the syllable-based transitional probability statistics investigated in infant research.…”

Section: Introductionmentioning

confidence: 45%

“…However, when the same cues are used in the context of a simple, generative probability model with improved unsupervised parameter estimation, the syllablebased models substantially outperform the phoneme-based models. Indeed, the syllable-based transitional probability phonotactic model achieves a word token segmentation f-score of nearly 80%, which is the highest reported performance among purely phonotactically-based segmentation models (Adriaans & Kager, 2010;Daland & Pierrehumbert, 2011). Indeed, this performance compares favorably with state-of-theart segmentation models that involve learning of higher level regularities, such as the lexicon and collocations (Brent, 1999;Venkataraman, 2001;Johnson, 2008a;Goldwater et al, 2009;, and demonstrates that good segmentation performance can be achieved by exploiting simple syllable-level phonotactic cues.…”

Section: Introductionmentioning

confidence: 94%

“…A number of phonotactically-based segmentation models, focusing on the discovery of word boundaries based on phoneme-level statistics, have achieved more promising results (Adriaans & Kager, 2010;Daland & Pierrehumbert, 2011;see also Brent, 1999). For example, Brent (1999) showed that a local minimum strategy relying on phoneme bigrams correctly extracts about 50% of word tokens in English child-directed speech.…”

Section: Introductionmentioning

confidence: 99%

“…This work has shown that infants utilize these low-level statistical regularities to segment speech during the second half of the first year of life before they have developed extensive vocabularies that could provide top-down lexical information to guide segmentation. Developmental research indicates that on average infants know fewer than 100 word types during this period (Dale & Fenson, 1996;Daland & Pierrehumbert, 2011).…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Comparing Models of Phonotactics for Word Segmentation

Schrimpf¹,

Jarosz²

2014

Proceedings of the 2014 Joint Meeting of SIGMORPHON and SIGFSM

View full text Add to dashboard Cite

Developmental research indicates that infants use low-level statistical regularities, or phonotactics, to segment words from continuous speech. In this paper, we present a segmentation framework that enables the direct comparison of different phonotactic models for segmentation. We compare a model using phoneme transitional probabilities, which have been widely used in computational models, to syllable-based bigram models, which have played a prominent role in the developmental literature. We also introduce a novel estimation method, and compare it to other strategies for estimating the parameters of the phonotactic models from unsegmented data. The results show that syllable-based models outperform the phoneme models, specifically in the context of improved unsupervised parameter estimation. The syllablebased transitional probability model achieves a word token f-score of nearly 80%, the highest reported performance for a phonotactic segmentation model with no lexicon.

show abstract