Data-driven environmental compensation for speech recognition: A unified approach

Moreno, Pedro J.; Raj, Bhiksha; Stern, Richard M.

doi:10.1016/s0167-6393(98)00025-9

Cited by 53 publications

(21 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…This represents one of the most commonly used techniques for additive noise suppression and removal of channel distortion respectively. We also evaluated a feature compensation method, Vector Taylor Series (VTS) [5] for performance comparison where the noisy speech GMM is adaptively estimated using the Expectation-Maximization (EM) algorithm over each test utterance [5]. The Advanced Front-End (AFE) algorithm developed by ETSI was also evaluated as one state-of-the-art method, which contains an iterative Wiener filter and blind equalization [12].…”

Section: Methodsmentioning

confidence: 99%

“…In addition, the acoustic model employed by the feature reconstruction methods also should match the acoustic model (i.e., Hidden Markov Model) of the speech recognizer in terms of the training database and the recording condition to provide the best speech recognition performance. Many feature reconstruction methods employ acoustic model such as Gaussian Mixture Model (GMM) [5] [6].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Speech Enhancement Based on Feature Reconstruction for Automatic Speech Recognition System with Unknown Structure

Kim¹

2017

IJISSE

View full text Add to dashboard Cite

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Speech Enhancement Based on Feature Reconstruction for Automatic Speech Recognition System with Unknown Structure

Kim¹

2017

IJISSE

View full text Add to dashboard Cite

show abstract

“…The use of a single vector r can only compensate for convolutional noise in the feature domain. In [50], a method called multivaRiate gAussian-based cepsTral normaliZation (RATZ) is proposed to use multiple correction vectors. In RATZ, the clean feature space is modeled by a GMM.…”

Section: Data-driven Feature Compensationmentioning

confidence: 99%

“…The STAR algorithm of Moreno [50] is closely related to the RATZ feature compensation algorithm described in section 3.2. Feature compensation methods usually have a model adaptation counterpart.…”

Section: Starmentioning

confidence: 99%

“…However, unlike RATZ which uses a separate GMM for the prior distribution of clean speech, STAR utilizes the HMM. STAR estimates the correcting terms, μ k and ∑ k , for the 256 Gaussians using the same way as RATZ, and then compensates the clean mean and variance vectors to approximate the noisy speech distribution [50]. As these Gaussians are shared by all HMM models, once they are compensated, all the HMM states are adapted.…”

Section: Starmentioning

confidence: 99%

See 1 more Smart Citation

Features and Model Adaptation Techniques for Robust Speech Recognition: A Review

Legoh¹,

Bhattacharjee²,

Tuithung³

2015

CAE

View full text Add to dashboard Cite

In this paper, major speech features used in state-of-the-art technology in speech recognition research are reviewed. Also a brief review of major technological advancements during last few decades and a trend towards development of robust speech recognition system in terms of feature and model adaptation techniques is given. It has been the dream of researchers to develop a machine that recognizes speech and understands natural language like human but the reality is that the performance of the speech recognition system drastically degrades due to various adverse conditions like noise, variability in speaker, channel, device and mismatches in training and testing. This paper may be useful as a tutorial and review on state-of-the-art techniques for feature selection, feature normalization and model adaptation techniques for development of robust speech recognition system.

show abstract

Bayesian Noise Compensation of Time Trajectories of Spectral Coefficients for Robust Speech Recognition

Potamitis

Fakotakis

Kokkinakis

2001

Text, Speech and Dialogue

View full text Add to dashboard Cite

Data-driven environmental compensation for speech recognition: A unified approach

Cited by 53 publications

References 30 publications

Speech Enhancement Based on Feature Reconstruction for Automatic Speech Recognition System with Unknown Structure

Speech Enhancement Based on Feature Reconstruction for Automatic Speech Recognition System with Unknown Structure

Features and Model Adaptation Techniques for Robust Speech Recognition: A Review

Bayesian Noise Compensation of Time Trajectories of Spectral Coefficients for Robust Speech Recognition

Contact Info

Product

Resources

About