2007 IEEE Workshop on Automatic Speech Recognition &Amp; Understanding (ASRU) 2007
DOI: 10.1109/asru.2007.4430129
|View full text |Cite
|
Sign up to set email alerts
|

Broad phonetic class recognition in a Hidden Markov model framework using extended Baum-Welch transformations

Abstract: In many pattern recognition tasks, given some input data and a model, a probabilistic likelihood score is often computed to measure how well the model describes the data. Extended Baum-Welch (EBW) transformations are most commonly used as a discriminative technique for estimating parameters of Gaussian mixtures, though recently they have been used to derive a gradient steepness measurement to evaluate the quality of the model to match the distribution of the data. In this paper, we explore applying the EBW gra… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2008
2008
2016
2016

Publication Types

Select...
5

Relationship

3
2

Authors

Journals

citations
Cited by 5 publications
(7 citation statements)
references
References 14 publications
0
7
0
Order By: Relevance
“…The larger the value in T indicates that the gradient to adapt the initial model to the data is steeper and the data is much better explained by the updated modelλ j (C). These gradient steepness metrics were succesfully used in various speech recognition classification, segmentation and decoding tasks (see [7], [8]…”
Section: Linearization Of Ebw Gradient Steepnessmentioning
confidence: 99%
“…The larger the value in T indicates that the gradient to adapt the initial model to the data is steeper and the data is much better explained by the updated modelλ j (C). These gradient steepness metrics were succesfully used in various speech recognition classification, segmentation and decoding tasks (see [7], [8]…”
Section: Linearization Of Ebw Gradient Steepnessmentioning
confidence: 99%
“…In [5], we compared scoring b k (ot) using likelihood and the gradient metric given in Equation 6 for the recognition of Broad Phonetic Classes (BPC). Below, we discuss novel learning rate and model update methods in the EBW framework.…”
Section: Ebw Gradient Metricmentioning
confidence: 99%
“…In a phoneme recognition study [12], it was found that almost 80% of misclassified phonemes were within the same broad phonetic class: vowels/semi-vowels, nasals/flaps, stops, weak fricatives, strong fricatives, and closures/silence. Broad phonetic classes have been applied for improved phone recognition [13,14], and have been shown to be more robust in noise [14,15]. Broad phonetic classes have also been used in large vocabulary ASR to overcome the issue of data sparsity and robustness.…”
Section: Introductionmentioning
confidence: 99%