2002
DOI: 10.1016/s0167-6393(01)00058-9
|View full text |Cite
|
Sign up to set email alerts
|

Connectionist speech recognition of Broadcast News

Abstract: This paper describes connectionist techniques for recognition of Broadcast News. The fundamental difference between connectionist systems and more conventional mixture-of-Gaussian systems is that connectionist models directly estimate posterior probabilities as opposed to likelihoods. Access to posterior probabilities has enabled us to develop a number of novel approaches to confidence estimation, pronunciation modelling and search. In addition we have investigated a new feature extraction technique based on t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
33
0

Year Published

2005
2005
2020
2020

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 41 publications
(37 citation statements)
references
References 28 publications
2
33
0
Order By: Relevance
“…A variety of different architectures and training algorithms have been proposed in the literature (see the comprehensive survey in [37]). Among these techniques, the ones most relevant to this work are those that use the ANNs to estimate the HMM stateposterior probabilities [38]- [45], which have been referred to as ANN-HMM hybrid models in the literature. In these ANN-HMM hybrid architectures, each output unit of the ANN is trained to estimate the posterior probability of a continuous density HMMs' state given the acoustic observations.…”
Section: A Previous Work Using Neural Network Acoustic Modelsmentioning
confidence: 99%
See 2 more Smart Citations
“…A variety of different architectures and training algorithms have been proposed in the literature (see the comprehensive survey in [37]). Among these techniques, the ones most relevant to this work are those that use the ANNs to estimate the HMM stateposterior probabilities [38]- [45], which have been referred to as ANN-HMM hybrid models in the literature. In these ANN-HMM hybrid architectures, each output unit of the ANN is trained to estimate the posterior probability of a continuous density HMMs' state given the acoustic observations.…”
Section: A Previous Work Using Neural Network Acoustic Modelsmentioning
confidence: 99%
“…ANN-HMMs were later extended to model context-dependent phones and were applied to mid-vocabulary and some large vocabulary ASR tasks (e.g. in [45], which also employed recurrent neural architectures). However, in earlier work on context dependent ANN-HMM hybrid architectures [46], the posterior probability of the context-dependent phone was modeled as either…”
Section: A Previous Work Using Neural Network Acoustic Modelsmentioning
confidence: 99%
See 1 more Smart Citation
“…This training set was augmented with those sentences from the Hub-4 Broadcast News and Switchboard language model training corpora which had a low perplexity with respect to the initial language model and the language model reestimated. We used a pronunciation dictionary containing around 10,000 words derived from the training data with pronunciations obtained from the SPRACH broadcast news system [Robinson et al 2002], plus 1,000 new words with pronunciations mainly constructed following the rules used to construct the broadcast news dictionary. The OOV rates were 1.6% on test42 and 2.0% on test50.…”
Section: The Voicemail Corpusmentioning
confidence: 99%
“…During the 1990s hybrid systems achieved good experimental results on some large vocabulary tasks [6,7]. However, HMM-GMM systems became increasingly more accurate owing to the use various techniques such as contextdependent phone modelling [8] speaker adaptation using MLLR [9], sequence-level discriminative training techniques such as MMI and MPE [10], and high-dimension feature space transforms such as fMPE [11], which either took advantage of the GMM structure or were much more computationally feasible for GMM-based systems compared with hybrid systems.…”
Section: Introductionmentioning
confidence: 99%