Rami Botros scite author profile

In this work, we uncover a theoretical connection between two language model interpolation techniques, count merging and Bayesian interpolation. We compare these techniques as well as linear interpolation in three scenarios with abundant training data per component model. Consistent with prior work, we show that both count merging and Bayesian interpolation outperform linear interpolation. We include the first (to our knowledge) published comparison of count merging and Bayesian interpolation, showing that the two techniques perform similarly. Finally, we argue that other considerations will make Bayesian interpolation the preferred approach in most circumstances.

show abstract

A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes

Ding¹,

Wang²,

Zhao³

et al. 2022

View full text Add to dashboard Cite

Connecting and Comparing Language Model Interpolation Techniques

Pusateri¹,

Gysel²,

Botros³

et al. 2019

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Rami Botros

A deep learning approach to traffic lights: Detection, tracking, and classification

An Efficient Streaming Non-Recurrent On-Device End-to-End Model with Improvements to Rare-Word Modeling

Tied & Reduced RNN-T Decoder

Improving The Latency And Quality Of Cascaded Encoders

On efficient training of word classes and their application to recurrent neural network language models

Connecting and Comparing Language Model Interpolation Techniques

A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes

Connecting and Comparing Language Model Interpolation Techniques

Contact Info

Product

Resources

About

Rami Botros

A deep learning approach to traffic lights: Detection, tracking, and classification

An Efficient Streaming Non-Recurrent On-Device End-to-End Model with Improvements to Rare-Word Modeling

Tied &amp; Reduced RNN-T Decoder

Improving The Latency And Quality Of Cascaded Encoders

On efficient training of word classes and their application to recurrent neural network language models

Connecting and Comparing Language Model Interpolation Techniques

A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes

Connecting and Comparing Language Model Interpolation Techniques

Contact Info

Product

Resources

About

Tied & Reduced RNN-T Decoder