2014 4th Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA) 2014
DOI: 10.1109/hscma.2014.6843278
|View full text |Cite
|
Sign up to set email alerts
|

Investigating stranded GMM for improving automatic speech recognition

Abstract: This paper investigates recently proposed Stranded Gaussian Mixture acoustic Model (SGMM) for Automatic Speech Recognition (ASR). This model extends conventional hidden Markov model (HMM-GMM) by explicitly introducing dependencies between components of the observation Gaussian mixture densities. The main objective of the paper is to experimentally study, how useful SGMM can be for dealing with data, which contains different sources of acoustic variability. First studied sources of variability are age and gende… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2014
2014
2017
2017

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 17 publications
0
2
0
Order By: Relevance
“…Each state density is modeled by 32 Gaussian components. The front-end is the same in all experiments described in the paper, and it consists of 13 Training on adult data provides the best results for adult speakers, but shows a weak performance on child speech. When child data are included in the training set, the performance improves on child, but degrades on adult subset.…”
Section: Baseline Asr System For Tidigits Taskmentioning
confidence: 99%
See 1 more Smart Citation
“…Each state density is modeled by 32 Gaussian components. The front-end is the same in all experiments described in the paper, and it consists of 13 Training on adult data provides the best results for adult speakers, but shows a weak performance on child speech. When child data are included in the training set, the performance improves on child, but degrades on adult subset.…”
Section: Baseline Asr System For Tidigits Taskmentioning
confidence: 99%
“…Another way of using classstructured GMM is to replace class-dependent mixture weights by Mixture Transition Matrices (MTMs) of the Stranded Gaussian Mixture Model (SGMM). SGMM is similar to conditional Gaussian model [11], which was recently extended, reformulated and investigated for robust ASR [12] and investigated for child data and non-stationary noise conditions [13]. MTMs explicitly define the dependencies between the components of the adjacent Gaussian mixture observation densities.…”
Section: Introductionmentioning
confidence: 99%