Smoothed local adaptation of connectionist systems

Waterhouse, Steve R.; Kershaw, Dan J.; Robinson, Tony

doi:10.1109/icslp.1996.607854

Cited by 3 publications

(3 citation statements)

References 5 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Although we cannot compare the framewise phoneme error of BLSTM directly with the phoneme error of the HMM we expect that a BLSTM-HMM hybrid (under construction) will outperform both plain BLSTM on frame by frame and plain HMMs on the phoneme level, inheriting the best of both worlds, namely reduction of training material and training time (BLSTM), as well as more built-in structural bias (HMMs).This expectation is encouraged by experiments on read speech by Chen and Jamieson [3], Shire [14], Waterhouse, Kershaw and Robinson [16], and Elenius and Blomberg [4]. They all achieved better results on the phoneme level using an ANN-HMM hybrid approach, as shown in As can be seen from table 4 the framewise errors are quite high for noisy input sequences (several microphones or enriched with background noise) as opposed to clean speech.…”

Section: Resultsmentioning

confidence: 95%

See 1 more Smart Citation

Classifying Unprompted Speech by Retraining LSTM Nets

Beringer

Graves

Schiel³

et al. 2005

Artificial Neural Networks: Biological Inspirations – ICANN 2005

View full text Add to dashboard Cite

We apply Long Short-Term Memory (LSTM) recurrent neural networks to a large corpus of unprompted speech-the German part of the VERBMOBIL corpus. Training first on a fraction of the data, then retraining on another fraction, both reduces time costs and significantly improves recognition rates. For comparison we show recognition rates of Hidden Markov Models (HMMs) on the same corpus, and provide a promising extrapolation for HMM-LSTM hybrids.

show abstract

Section: Resultsmentioning

confidence: 95%

“…Retraining both significantly reduced both time costs and training set size and improved recognition results. An extrapolation based on previous work on read speech [16,3,14,4] promises significant additional improvements on the phoneme level through a BLSTM-HMM hybrid, which we are currently implementing.…”

Section: Discussionmentioning

confidence: 99%

Classifying Unprompted Speech by Retraining LSTM Nets

Beringer

Graves

Schiel³

et al. 2005

Artificial Neural Networks: Biological Inspirations – ICANN 2005

View full text Add to dashboard Cite

show abstract

“…Recent improvements to the ABBOT system include training of the recurrent networks for effective use of the SI284 training corpus [2], and local speaker-adaptation approaches [12], while application of state-based context-dependent phone modelling is planned for the near future.…”

Section: Discussionmentioning

confidence: 99%

The 1995 ABBOT LVCSR system for multiple unknown microphones

Kershaw

Robinson

Renals

Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96

Self Cite

View full text Add to dashboard Cite

ABBOT is the hybrid connectionist-hidden Markov model largevocabulary speech recognition system developed at Cambridge University. In this system, a recurrent network maps each acoustic vector to an estimate of the posterior probabilities of the phone classes, which are used as observation probabilities within an HMM. This paper describes the system which participated in the November 1995 ARPA Hub-3 Multiple Unknown Microphones (MUM) evaluation of continuous speech recognition systems, under the guise of the CU-CON system. The emphasis of the paper is on the changes made to the 1994 ABBOT system, specifically to accomodate the H3 task. This includes improved acoustic modelling using limited word-internal context-dependent models, training on the Wall Street Journal secondary channel database, and using the linear input network for speaker and environmental adaptation. Experimental results are reported for various test and development sets from the November 1994 and 1995 ARPA benchmark tests.

show abstract

Linear input network for neural network automata model adaptation

Mana¹,

Gemello²

Proceedings of the 12th IEEE Workshop on Neural Networks for Signal Processing

View full text Add to dashboard Cite

Smoothed local adaptation of connectionist systems

Cited by 3 publications

References 5 publications

Classifying Unprompted Speech by Retraining LSTM Nets

Classifying Unprompted Speech by Retraining LSTM Nets

The 1995 ABBOT LVCSR system for multiple unknown microphones

Linear input network for neural network automata model adaptation

Contact Info

Product

Resources

About