2010
DOI: 10.1111/j.1551-6709.2010.01160.x
|View full text |Cite
|
Sign up to set email alerts
|

Learning Diphone‐Based Segmentation

Abstract: This paper reconsiders the diphone-based word segmentation model of Cairns, Shillcock, Chater, and Levy (1997) and Hockema (2006), previously thought to be unlearnable. A statistically principled learning model is developed using Bayes' theorem and reasonable assumptions about infants' implicit knowledge. The ability to recover phrase-medial word boundaries is tested using phonetic corpora derived from spontaneous interactions with children and adults. The (unsupervised and semisupervised) learning models are … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

2
73
0

Year Published

2012
2012
2022
2022

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 62 publications
(80 citation statements)
references
References 87 publications
(166 reference statements)
2
73
0
Order By: Relevance
“…Moreover, such "diphone-based segmentation" has proved valid and learnable in a computational Bayesian model recently developed by Daland and Pierrehumbert (2011). They assessed whether the model could recover word boundaries based on the identity of the surrounding diphone.…”
mentioning
confidence: 99%
“…Moreover, such "diphone-based segmentation" has proved valid and learnable in a computational Bayesian model recently developed by Daland and Pierrehumbert (2011). They assessed whether the model could recover word boundaries based on the identity of the surrounding diphone.…”
mentioning
confidence: 99%
“…It complements that of recent work investigating the use of phoneme-level statistical regularities for segmentation (Adriaans & Kager, 2010;Daland & Pierrehumbert, 2011). Our work differs from these latter approaches, however, in comparing several phonotactic models, including ones relying on the syllable-based transitional probability statistics investigated in infant research.…”
Section: Introductionmentioning
confidence: 45%
“…However, when the same cues are used in the context of a simple, generative probability model with improved unsupervised parameter estimation, the syllablebased models substantially outperform the phoneme-based models. Indeed, the syllable-based transitional probability phonotactic model achieves a word token segmentation f-score of nearly 80%, which is the highest reported performance among purely phonotactically-based segmentation models (Adriaans & Kager, 2010;Daland & Pierrehumbert, 2011). Indeed, this performance compares favorably with state-of-theart segmentation models that involve learning of higher level regularities, such as the lexicon and collocations (Brent, 1999;Venkataraman, 2001;Johnson, 2008a;Goldwater et al, 2009;, and demonstrates that good segmentation performance can be achieved by exploiting simple syllable-level phonotactic cues.…”
Section: Introductionmentioning
confidence: 94%
See 2 more Smart Citations