Stochastic Analysis of Lexical and Semantic Enhanced Structural Language Model

Wang, Shaojun; Wang, Shaomin; Cheng, Li; Greiner, Russell; Schuurmans, Dale

doi:10.1007/11872436_9

Cited by 2 publications

(12 citation statements)

References 17 publications

(24 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…When combining n-gram, m-SLM, and PLSA together to build a composite generative language model under the directed MRF paradigm (Wang et al 2005b(Wang et al , 2006, the composite language model is simply a complicated generative model that has four operators: WORD-PREDICTOR, TAGGER, CONSTRUCTOR, and SEMANTIZER. The TAGGER and CONSTRUCTOR in SLM and the SEMANTIZER in PLSA remain unchanged; the WORD-PREDICTORs in n-gram, m-SLM, and PLSA, however, are combined to form a stronger WORD-PREDICTOR that generates the next word, w k+1 , not only depending on the m most recently exposed headwords h −1 −m in the word-parse k-prefix but also its n-gram history w k k−n+2 and its semantic content g k+1 .…”

Section: The Composite N-gram/slm/plsa Language Modelmentioning

confidence: 99%

“…The composite n-gram/m-SLM/PLSA language model can be formulated as a rather complex chain-tree-table directed MRF model (Wang et al 2006) A composite n-gram/m-SLM/PLSA language model where the hidden information is the parse tree T and semantic content g. The n-gram encodes local word interactions, the m-SLM models the sentence's syntactic structure, and the PLSA captures the document's semantic content; all interact together to constrain the generation of natural language. The WORD-PREDICTOR generates the next word w k+1 with probability p…”

Section: The Composite N-gram/slm/plsa Language Modelmentioning

confidence: 99%

“…When applying this to the Wall Street Journal corpus with 40 million tokens, they achieved moderate perplexity reduction. Because the probabilistic dependency structure in a structured language model (SLM) (Chelba 2000;Chelba and Jelinek 2000) is more complex and powerful than that in a PCFG, Wang et al (2006) studied the stochastic properties for the composite language model that integrates n-gram, SLM, and PLSA under the directed MRF framework (Wang et al 2005b) and derived another generalized inside-outside algorithm to train a composite ngram, SLM, and PLSA language model from a general expectation maximization (EM) (Dempster, Laird, and Rubin 1977) algorithm by following Jelinek's ingenious definition of the inside and outside probabilities for SLM (Jelinek 2004). Again, the generalized inside-outside algorithm alters Jelinek's inside-outside algorithm with modular modification and has the same sixth order of sentence-length time complexity.…”

Section: Introductionmentioning

confidence: 99%

“…In this article, we study the same composite n-gram, SLM, and PLSA model under the directed MRF framework as in Wang et al (2006). The composite n-gram/ SLM/PLSA language model under the directed MRF paradigm is first introduced in Section 2.…”

Section: Introductionmentioning

confidence: 99%

“…The composite n-gram/ SLM/PLSA language model under the directed MRF paradigm is first introduced in Section 2. In Section 3, instead of using the sixth order generalized inside-outside algorithm proposed in Wang et al (2006), we show how to train this composite model via an N-best list approximate EM algorithm that has linear time complexity and a follow-up EM algorithm to improve word prediction power. We prove the convergence of the N-best list approximate EM algorithm.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

A Scalable Distributed Syntactic, Semantic, and Lexical Language Model

Tan

Zhou

Zheng

et al. 2012

Computational Linguistics

Self Cite

View full text Add to dashboard Cite

This paper presents an attempt at building a large scale distributed composite language model that is formed by seamlessly integrating an n-gram model, a structured language model, and probabilistic latent semantic analysis under a directed Markov random field paradigm to simultaneously account for local word lexical information, mid-range sentence syntactic structure, and long-span document semantic content. The composite language model has been trained by performing a convergent N-best list approximate EM algorithm and a follow-up EM algorithm to improve word prediction power on corpora with up to a billion tokens and stored on a supercomputer. The large scale distributed composite language model gives drastic perplexity reduction over n-grams and achieves significantly better translation quality measured by the Bleu score and “readability” of translations when applied to the task of re-ranking the N-best list from a state-of-the-art parsing-based machine translation system.

show abstract

Section: The Composite N-gram/slm/plsa Language Modelmentioning

confidence: 99%

Section: The Composite N-gram/slm/plsa Language Modelmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

A Scalable Distributed Syntactic, Semantic, and Lexical Language Model

Tan

Zhou

Zheng

et al. 2012

Computational Linguistics

Self Cite

View full text Add to dashboard Cite

show abstract

Getting Past the Language Gap: Innovations in Machine Translation

Delmonte

2012

Mobile Speech and Advanced Natural Language Solutions

View full text Add to dashboard Cite

In this chapter, we will be reviewing state of the art machine translation systems, and will discuss innovative methods for machine translation, highlighting the most promising techniques and applications. Machine translation (MT) has benefited from a revitalization in the last 10 years or so, after a period of relatively slow activity. In 2005 the field received a jumpstart when a powerful complete experimental package for building MT systems from scratch became freely available as a result of the unified efforts of the MOSES international consortium. Around the same time, hierarchical methods had been introduced by Chinese researchers, which allowed the introduction and use of syntactic information in translation modeling. Furthermore, the advances in the related field of computational linguistics, making off-the-shelf taggers and parsers readily available, helped give MT an additional boost. Yet there is still more progress to be made. For example, MT will be enhanced greatly when both syntax and semantics are on board: this still presents a major challenge though many advanced research groups are currently pursuing ways to meet this challenge head-on. The next generation of MT will consist of a collection of hybrid systems. It also augurs well for the mobile environment, as we look forward to more advanced and improved technologies that enable the working of Speech-To-Speech machine translation on hand-held devices, i.e. speech recognition and speech synthesis. We review all of these developments and point out in the final section some of the most promising research avenues for the future of MT.

show abstract

Stochastic Analysis of Lexical and Semantic Enhanced Structural Language Model

Cited by 2 publications

References 17 publications

A Scalable Distributed Syntactic, Semantic, and Lexical Language Model

A Scalable Distributed Syntactic, Semantic, and Lexical Language Model

Getting Past the Language Gap: Innovations in Machine Translation

Contact Info

Product

Resources

About