“…This is in contrast to many models of word segmentation that suggest that tracking of sequential probabilities arises from a completely different set of processes than learning of phonological regularities such as predominant lexical stress, phonology, or phonotactics. For example, in the StaGE model (Adriaans & Kager, 2010), learners identify words via calculation of transitional probabilities and then deduce phonological and phonotactic constraints via a hierarchical ranking of constraints similar to that of Optimality Theory (Tesar & Smolensky, 2000). Similarly, Mersad and Nazzi (2011) proposed that learners use a hierarchical ranking of phonological cues to parse the speech stream, relying on sequential statistics only in cases where these cues are uninformative.…”