2007
DOI: 10.1109/maes.2007.327534
|View full text |Cite
|
Sign up to set email alerts
|

Speaker Recognition The A TVS-UAM System at NIST SRE 05

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2007
2007
2018
2018

Publication Types

Select...
3
2
1

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 13 publications
0
4
0
Order By: Relevance
“…Actually the system is the sum fusion of two SVM systems, one using 19 MFCC coefficients plus deltas and the other using SDC-MFCCs (7-2-3-7) [14]. In order to avoid channel mismatch effects, Cepstral Mean Normalization is applied, followed by RASTA filtering and feature mapping [15]. Both systems use a kernel expansion on the whole observation sequence, and a separating hyperplane is computed between the target language features and the background model.…”
Section: Svm Sytems With Mfcc and Sdc-mfcc Featuresmentioning
confidence: 99%
“…Actually the system is the sum fusion of two SVM systems, one using 19 MFCC coefficients plus deltas and the other using SDC-MFCCs (7-2-3-7) [14]. In order to avoid channel mismatch effects, Cepstral Mean Normalization is applied, followed by RASTA filtering and feature mapping [15]. Both systems use a kernel expansion on the whole observation sequence, and a separating hyperplane is computed between the target language features and the background model.…”
Section: Svm Sytems With Mfcc and Sdc-mfcc Featuresmentioning
confidence: 99%
“…Feature extraction obtains 19 MFCC coefficients plus deltas. In order to avoid channel mismatch effects, cepstral mean normalization is applied, followed by RASTA filtering and feature mapping (see [4] for details). The similarity computation is based on SVC [3].…”
Section: Baseline Systemmentioning
confidence: 99%
“…Speaker verification has been dominated in the last decade by systems working at the spectral level of the speaker identity [1]. Techniques like Gaussian Mixture Models (GMM) [2] or Support Vector Machines (SVM) using Generalized Linear Discriminant Sequence (GLDS) kernels [3] have demonstrated its superiority to higher level approaches [1,4]. In recent years, hybrid approaches such as GMM-SVM systems [5] and channel compensation techniques like factor analysis [6] or nuisance attribute projection [7] have led to a significant improvement of the state-of-the-art performance.…”
Section: Introductionmentioning
confidence: 99%
“…Some spectral characteristics in the speech signal such as formant distribution and variation are related to speaker-dependent characteristics. Systems exploiting acoustic information are based on the short-term spectral identity information and this spectral information may be analyzed in order to recognize the speaker identity [1]. In present rapid development and during putting automated systems into existence they may be applied also in the devices for speaker recognition, for instance authorised check in, banking via telephone, and in the telephone credit cards [2].The goal of speaker recognition is to automatically extract information from speech signal.…”
Section: Introductionmentioning
confidence: 99%