Hydroxyl Radical-Stressed Whey Protein Isolate: Functional and Rheological Properties

In recent years, many alternative models have been proposed to address some of the shortcomings of the hidden Markov model, currently the most popular approach to speech recognition. In particular, a variety of models that could be broadly classi ed as segment models have been described for representing a variable-length sequence of observation vectors in speech recognition applications. Since there are many aspects in common between these approaches, including the general recognition and training problems, it is useful to consider them in a uni ed framework. Thus, the goal of this paper will be to describe a general stochastic model that encompasses most of the models proposed in the literature, pointing out similarities of the models in terms of correlation and parameter tying assumptions, and drawing analogies between segment models and hidden Markov models. In addition, we summarize experimental results assessing di erent modeling assumptions, and point out remaining open questions.

show abstract

Context-dependent modeling for acoustic-phonetic recognition of continuous speech

Schwartz¹,

Chow²,

Kimball³

et al.

168

View full text Add to dashboard Cite

Unsupervised Training on Large Amounts of Broadcast News Data

Ma¹,

Matsoukas²,

Kimball³

et al.

View full text Add to dashboard Cite

This paper presents our recent effort that aims at improving our Arabic Broadcast News (BN) recognition system by using thousands of hours of un-transcribed Arabic audio in the way of unsupervised training. Unsupervised training is first carried out on the 1,900-hour English Topic Detection and Tracking (TDT) data and is compared with the lightly-supervised training method that we have used for the DARPA EARS evaluations. The comparison shows that unsupervised training produces a 21.7% relative reduction in word error rate (WER), which is comparable to the gain obtained with light supervision methods. The same unsupervised training strategy carried out on a similar amount of Arabic BN data produces an 11.6% relative gain. The gain, though considerable, is substantially smaller than what is observed on the English data. Our initial work towards understanding the reasons for this difference is also described.

show abstract

Integration of diverse recognition methodologies through reevaluation of N-best sentence hypotheses

Ostendorf¹,

Kannan²,

Austin³

et al. 1991

View full text Add to dashboard Cite

This paper describes a general formalism for integrating two or more speech recognition technologies, which could be developed at different research sites using different recognition strategies. In this formalism, one system uses the N-best search strategy to generate a list of candidate sentences; the list is rescorred by other systems; and the different scores axe combined to optimize performance. Specifically, we report on combining the BU system based on stochastic segment models and the BBN system based on hidden Markov models. In addition to facilitating integration of different systems, the N-best approach results in a large reduction in computation for word recognition using the stochastic segment model

show abstract

BYBLOS: The BBN continuous speech recognition system

Chow¹,

Dunham²,

Kimball³

et al.

111

View full text Add to dashboard Cite

Rapid and accurate spoken term detection

Miller¹,

Kleber²,

Kao³

et al. 2007

148

View full text Add to dashboard Cite

Advances in transcription of broadcast news and conversational telephone speech within the combined EARS BBN/LIMSI system

Matsoukas¹,

Gauvain

Adda

et al. 2006

IEEE Trans. Audio Speech Lang. Process.

View full text Add to dashboard Cite

Robust smoothing methods for discrete hidden Markov models

Schwartz¹,

Kimball²,

Kubala³

et al.

View full text Add to dashboard Cite

12 3 4 5 6

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Owen Kimball

From HMM's to segment models: a unified view of stochastic modeling for speech recognition

Context-dependent modeling for acoustic-phonetic recognition of continuous speech

Unsupervised Training on Large Amounts of Broadcast News Data

Integration of diverse recognition methodologies through reevaluation of N-best sentence hypotheses

BYBLOS: The BBN continuous speech recognition system

Rapid and accurate spoken term detection

Advances in transcription of broadcast news and conversational telephone speech within the combined EARS BBN/LIMSI system

Robust smoothing methods for discrete hidden Markov models

Contact Info

Product

Resources

About