Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - ACL '03 2003
DOI: 10.3115/1075096.1075102
|View full text |Cite
|
Sign up to set email alerts
|

Generalized algorithms for constructing statistical language models

Abstract: Recent text and speech processing applications such as speech mining raise new and more general problems related to the construction of language models. We present and describe in detail several new and efficient algorithms to address these more general problems and report experimental results demonstrating their usefulness. We give an algorithm for computing efficiently the expected counts of any sequence in a word lattice output by a speech recognizer or any arbitrary weighted automaton; describe a new techn… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
83
0

Year Published

2005
2005
2023
2023

Publication Types

Select...
6
3
1

Relationship

1
9

Authors

Journals

citations
Cited by 95 publications
(83 citation statements)
references
References 14 publications
0
83
0
Order By: Relevance
“…Continuing the above example, we can σ-compose the decoding graph H with a bigram language model (LM) L in its FST form [15] with a slight modification. We require the output labels of the LM FST to include the history labels alongside the current label.…”
Section: Fig 1 From Left To Rightmentioning
confidence: 99%
“…Continuing the above example, we can σ-compose the decoding graph H with a bigram language model (LM) L in its FST form [15] with a slight modification. We require the output labels of the LM FST to include the history labels alongside the current label.…”
Section: Fig 1 From Left To Rightmentioning
confidence: 99%
“…Our procedure for building A[W] is sketched in Algorithm 1, where TrieInsert inserts a string in a trie, Pref(W) computes the set of prefixes of the strings in W, 3 LgSuff(v, U) returns the longest suffix of v in U, and FailureTrans is a special ε-transition used only when no labelled transition exists (Allauzen et al, 2003). 4 Each state (or pattern prefix) v in A[W] is associated with a set of feature functions {f u,g , ∀u ∈ Suff(v), g}.…”
Section: Variable Order Crfs (Vocrfs)mentioning
confidence: 99%
“…These algorithms are useful in a variety of applications including statistical language modeling [7], parsing [45,44,8], phonological rule compilation [29,30,49], speech recognition [38,43,48], speech synthesis [56,6], image processing [2], bioinformatics [21,9], sequence modeling and prediction [18], optical character recognition [14], and more generally any problem related to sequences and probabilistic models defined over sequences [33,43]. An efficient implementation of these algorithms and several others, including an on-demand implementation when possible, is available from the FSM library (executables only) [47] and the OpenFst library (source and executables) [10].…”
Section: Resultsmentioning
confidence: 99%