2013 IEEE International Conference on Acoustics, Speech and Signal Processing 2013
DOI: 10.1109/icassp.2013.6639204
|View full text |Cite
|
Sign up to set email alerts
|

Temporal filter design by minimum KL divergence criterion for robust speech recognition

Abstract: In this paper, we propose a new temporal filter design method based on minimum KL divergence criterion for robust recognition of noisy and reverberant speech. The main idea is to optimize the filter parameters by minimizing the KL divergence of two distributions, of which one is the feature distribution in the test environment, and another is the feature distribution represented by the acoustic model. The minimization of the KL divergence reduces the mismatch between the acoustic model and the test data. Exper… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
14
0

Year Published

2014
2014
2016
2016

Publication Types

Select...
2
2

Relationship

3
1

Authors

Journals

citations
Cited by 4 publications
(14 citation statements)
references
References 18 publications
0
14
0
Order By: Relevance
“…In temporal filtering, if we are using a finite impulse response (FIR) filter such as in TSN [22] and MNLLF [26], we have In linear transformation such as fMLLR [10,14], we have…”
Section: Generalized Transformmentioning
confidence: 99%
See 2 more Smart Citations
“…In temporal filtering, if we are using a finite impulse response (FIR) filter such as in TSN [22] and MNLLF [26], we have In linear transformation such as fMLLR [10,14], we have…”
Section: Generalized Transformmentioning
confidence: 99%
“…Hence, linear transformation uses inter-dimensional correlation information, while temporal filter uses inter-frame correlation information. In our previous study, we have shown that linear transformation and temporal filter are complementary and applying fMLLR after MNLLF [26] produces better results than the two techniques alone.…”
Section: Introductionmentioning
confidence: 97%
See 1 more Smart Citation
“…Linear transformation uses all dimensions of the current frame to predict new features that fit the acoustic model under the maximum-likelihood (ML) criterion [30,41]. On the other hand, temporal filtering uses the context information in neighboring frames to estimate features that fit the acoustic model [42][43][44][45]. While linear transformation uses inter-dimensional correlation information (or spectral information) to process features, temporal filtering uses inter-frame correlation information (or temporal information).…”
Section: Cross Transform Feature Adaptationmentioning
confidence: 99%
“…Similar to maximum normalized likelihood linear filtering in [45], the parameters of the cross transform can be estimated by minimizing an approximated KL divergence between the distribution of the processed features, p y , and the distribution of clean training features, p . In this work, p y is modelled by a single Gaussian and p by a GMM with parameter set = {c j , μ j , j |j = 1, .…”
Section: Cross Transform Feature Adaptationmentioning
confidence: 99%