Paraphrastic neural network language models

Liu, Xunying; Gales, Mark J. F.; Woodland, Philip C.

doi:10.1109/icassp.2014.6854534

Cited by 1 publication

(3 citation statements)

References 21 publications

(37 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Experimental results suggest the proposed method is also effective in improving RNNLM performance, consistent with the improvements reported in the earlier research on back-off n-gram LMs [14] and feedforward NNLMs [15]. Significant error rate reductions of 1.94% absolute (12% relative) were obtained on a state-of-the-art large vocabulary speech recognition task.…”

Section: Conclusion and Relation To Prior Worksupporting

confidence: 85%

“…This form of intuitive and interpretable counts smoothing automatically re-distributes statistics to alternative expressions of the same observed word sequence. It was previously exploited to improve the context coverage and generalization for several forms of LMs that do not explicitly capture the expressive richness related variability in natural languages, including back-off n-gram LMs [14], and feedforward NNLMs [15].…”

Section: Generation Of Paraphrase Variantsmentioning

confidence: 99%

“…This intuitively and interpretable discounting method can be exploited by many different forms of LMs that do not explicitly model the expressive richness of natural languages. In previous research, this technique were used to improve the performance of back-off n-gram LMs [14] and feedforward NNLMs [15].…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Paraphrastic recurrent neural network language models

Liu

Chen

Gales

et al. 2015

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Self Cite

View full text Add to dashboard Cite

Recurrent neural network language models (RNNLM) have become an increasingly popular choice for state-of-the-art speech recognition systems. Linguistic factors influencing the realization of surface word sequences, for example, expressive richness, are only implicitly learned by RNNLMs. Observed sentences and their associated alternative paraphrases representing the same meaning are not explicitly related during training. In order to improve context coverage and generalization, paraphrastic RNNLMs are investigated in this paper. Multiple paraphrase variants were automatically generated and used in paraphrastic RNNLM training. Using a paraphrastic multi-level RNNLM modelling both word and phrase sequences, significant error rate reductions of 0.6% absolute and perplexity reduction of 10% relative were obtained over the baseline RNNLM on a large vocabulary conversational telephone speech recognition system trained on 2000 hours of audio and 545 million words of texts. The overall improvement over the baseline n-gram LM was increased from 8.4% to 11.6% relative.

show abstract

Section: Conclusion and Relation To Prior Worksupporting

confidence: 85%

Section: Generation Of Paraphrase Variantsmentioning

confidence: 99%