Oliver Walter scite author profile

In this paper we present an algorithm for the unsupervised segmentation of a lattice produced by a phoneme recognizer into words. Using a lattice rather than a single phoneme string accounts for the uncertainty of the recognizer about the true label sequence. An example application is the discovery of lexical units from the output of an error-prone phoneme recognizer in a zero-resource setting, where neither the lexicon nor the language model (LM) is known. We propose a computationally efficient iterative approach, which alternates between the following two steps: First, the most probable string is extracted from the lattice using a phoneme LM learned on the segmentation result of the previous iteration. Second, word segmentation is performed on the extracted string using a word and phoneme LM which is learned alongside the new segmentation. We present results on lattices produced by a phoneme recognizer on the WSJ-CAM0 dataset. We show that our approach delivers superior segmentation performance than an earlier approach found in the literature, in particular for higher-order language models.

show abstract

Smartphone-based sensor fusion for improved vehicular navigation

Walter

Schmalenstroeer

Engler

et al. 2013

View full text Add to dashboard Cite

Barometric height estimation combined with map-matching in a loosely-coupled Kalman-filter

Bevermeier

Walter

Peschke

et al. 2010

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Oliver Walter

A hierarchical system for word discovery exploiting DTW-based initialization

Unsupervised word segmentation from noisy input

Iterative Bayesian word segmentation for unsupervised vocabulary discovery from phoneme lattices

Smartphone-based sensor fusion for improved vehicular navigation

Barometric height estimation combined with map-matching in a loosely-coupled Kalman-filter

Contact Info

Product

Resources

About