2004
DOI: 10.1155/s1110865704308048
|View full text |Cite
|
Sign up to set email alerts
|

Stochastic Feature Transformation with Divergence-Based Out-of-Handset Rejection for Robust Speaker Verification

Abstract: The performance of telephone-based speaker verification systems can be severely degraded by linear and nonlinear acoustic distortion caused by telephone handsets. This paper proposes to combine a handset selector with stochastic feature transformation to reduce the distortion. Specifically, a Gaussian mixture model (GMM)-based handset selector is trained to identify the most likely handset used by the claimants, and then handset-specific stochastic feature transformations are applied to the distorted feature v… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2005
2005
2010
2010

Publication Types

Select...
6

Relationship

2
4

Authors

Journals

citations
Cited by 16 publications
(8 citation statements)
references
References 27 publications
(47 reference statements)
0
8
0
Order By: Relevance
“…The graph structure was motivated by invariance against the affine feature distortion model for cepstral features (e.g. [151,155]). The method requires further development to validate the assumptions of the feature distortion model and to improve computational efficiency.…”
Section: Feature Normalizationmentioning
confidence: 99%
“…The graph structure was motivated by invariance against the affine feature distortion model for cepstral features (e.g. [151,155]). The method requires further development to validate the assumptions of the feature distortion model and to improve computational efficiency.…”
Section: Feature Normalizationmentioning
confidence: 99%
“…In this work, the feature transformation was combined with a handset selector (Tsang et al, 2002;Mak et al, 2004) for robust speaker verification. Specifically, before verification takes place, we compute one set of transformation parameters for each type of handsets that claimants are likely to use.…”
Section: Stochastic Feature Transformation and Handset Identificationmentioning
confidence: 99%
“…Similar to our previous work Mak et al, 2004;Yiu et al, 2003;Tsang et al, 2002), we trained a personalized 32-center GMM to model the characteristics of each client speaker in the system. 1 The feature vectors derived from the SA and SX sentence sets of the corresponding speaker were used for training, i.e., 7 sentences per GMM.…”
Section: Enrollment Proceduresmentioning
confidence: 99%
See 1 more Smart Citation
“…However, this approach may not be practical because users may use a new handset, which is not well represented in the training set, during verification. While this problem can be partially resolved by using a handset classifier with out-of-handset rejection capability [8,9], it is difficult to find a threshold for detecting unseen handsets. On the other hand, unsupervised (blind) compensation does not assume any knowledge of the channel characteristics.…”
Section: Introductionmentioning
confidence: 99%