Evaluating hierarchical hybrid statistical language models

Galescu, Lucian; Allen, James F.

doi:10.21437/icslp.2000-431

Search citation statements

Order By: Relevance

Paper Sections

Select...

Results1

The Models and The Adaptation Procedures1

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2000

Publication Types

Select...

Other1

Relationship

Self Cite1

Independent0

Authors

Journals

Cited by 1 publication

(2 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We compute the APP value on the full test set, and thus we can compare two models with different vocabularies. We again refer the reader to [5] for more details on our evaluation procedure.…”

Section: Resultsmentioning

confidence: 99%

“…Note that the probability depends only on the fact that the word seen in the test data is a number, and not on which number it is; in particular, numbers not encountered in the training data will receive the same probability as the ones encountered. For a comparison between this probability model and a more conventional one, see [5].…”

Section: The Models and The Adaptation Proceduresmentioning

confidence: 99%

See 1 more Smart Citation

Hierarchical statistical language models: experiments on in-domain adaptation

Galescu¹,

Allen²

2000

6th International Conference on Spoken Language Processing (ICSLP 2000)

Self Cite

View full text Add to dashboard Cite

We introduce a hierarchical statistical language model, represented as a collection of local models plus a general sentence model. We provide an example that mixes a trigram general model and a PFSA local model for the class of decimal numbers, described in terms of sub-word units (graphemes). This model practically extends the vocabulary of the overall model to an infinite size, but still has better performance compared to a word-based model.Using in-domain language model adaptation experiments, we show that local models can encode enough linguistic information, if well trained, that they may be ported to new language models without re-estimation.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: The Models and The Adaptation Proceduresmentioning

confidence: 99%