“…However, when the same cues are used in the context of a simple, generative probability model with improved unsupervised parameter estimation, the syllablebased models substantially outperform the phoneme-based models. Indeed, the syllable-based transitional probability phonotactic model achieves a word token segmentation f-score of nearly 80%, which is the highest reported performance among purely phonotactically-based segmentation models (Adriaans & Kager, 2010;Daland & Pierrehumbert, 2011). Indeed, this performance compares favorably with state-of-theart segmentation models that involve learning of higher level regularities, such as the lexicon and collocations (Brent, 1999;Venkataraman, 2001;Johnson, 2008a;Goldwater et al, 2009;, and demonstrates that good segmentation performance can be achieved by exploiting simple syllable-level phonotactic cues.…”