1998
DOI: 10.1016/s0167-6393(98)00025-9
|View full text |Cite
|
Sign up to set email alerts
|

Data-driven environmental compensation for speech recognition: A unified approach

Abstract: Environmental robustness for automatic speech recognition systems based on parameter modi®cation can be accomplished in two complementary ways. One approach is to modify the incoming features of environmentally-degraded speech to more closely resemble the features of the (normally undegraded) speech used to train the classi®er. The other approach is to modifying the internal statistical representations of speech features used by the classi®er to more closely resemble the features representing degraded speech i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0
3

Year Published

2001
2001
2017
2017

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 53 publications
(21 citation statements)
references
References 30 publications
0
18
0
3
Order By: Relevance
“…This represents one of the most commonly used techniques for additive noise suppression and removal of channel distortion respectively. We also evaluated a feature compensation method, Vector Taylor Series (VTS) [5] for performance comparison where the noisy speech GMM is adaptively estimated using the Expectation-Maximization (EM) algorithm over each test utterance [5]. The Advanced Front-End (AFE) algorithm developed by ETSI was also evaluated as one state-of-the-art method, which contains an iterative Wiener filter and blind equalization [12].…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…This represents one of the most commonly used techniques for additive noise suppression and removal of channel distortion respectively. We also evaluated a feature compensation method, Vector Taylor Series (VTS) [5] for performance comparison where the noisy speech GMM is adaptively estimated using the Expectation-Maximization (EM) algorithm over each test utterance [5]. The Advanced Front-End (AFE) algorithm developed by ETSI was also evaluated as one state-of-the-art method, which contains an iterative Wiener filter and blind equalization [12].…”
Section: Methodsmentioning
confidence: 99%
“…In addition, the acoustic model employed by the feature reconstruction methods also should match the acoustic model (i.e., Hidden Markov Model) of the speech recognizer in terms of the training database and the recording condition to provide the best speech recognition performance. Many feature reconstruction methods employ acoustic model such as Gaussian Mixture Model (GMM) [5] [6].…”
Section: Introductionmentioning
confidence: 99%
“…The use of a single vector r can only compensate for convolutional noise in the feature domain. In [50], a method called multivaRiate gAussian-based cepsTral normaliZation (RATZ) is proposed to use multiple correction vectors. In RATZ, the clean feature space is modeled by a GMM.…”
Section: Data-driven Feature Compensationmentioning
confidence: 99%
“…The STAR algorithm of Moreno [50] is closely related to the RATZ feature compensation algorithm described in section 3.2. Feature compensation methods usually have a model adaptation counterpart.…”
Section: Starmentioning
confidence: 99%
See 1 more Smart Citation