2016
DOI: 10.1007/s00521-016-2470-x
|View full text |Cite
|
Sign up to set email alerts
|

Enhancement of a text-independent speaker verification system by using feature combination and parallel structure classifiers

Abstract: Speaker Verification (SV) systems involve mainly two individual stages: feature extraction and classification. In this paper, we explore these two modules with the aim of improving the performance of a speaker verification system under noisy conditions. On the one hand, the choice of the most appropriate acoustic features is a crucial factor for performing robust speaker verification. The acoustic parameters used in the proposed system are: Mel Frequency Cepstral Coefficients (MFCC), their first and second der… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2019
2019
2025
2025

Publication Types

Select...
9
1

Relationship

2
8

Authors

Journals

citations
Cited by 12 publications
(5 citation statements)
references
References 51 publications
0
5
0
Order By: Relevance
“…Pitch frequency is a perceptual characteristic of the speech signal with physical properties denoted by F0 and is used to improve the performance of speaker identification [23,24]. Many studies use these features for speaker identification not only separately but also in combination [25][26][27][28]. In this paper, six different feature extraction approaches, namely Mel Frequency Cepstral Coefficients (MFCC)+Pitch, Gammatone Cepstral Coefficients (GTCC)+Pitch, MFCC+GTCC+Pitch+eight spectral features, spectrograms, i-vectors, and Alexnet feature vectors were used.…”
Section: Related Workmentioning
confidence: 99%
“…Pitch frequency is a perceptual characteristic of the speech signal with physical properties denoted by F0 and is used to improve the performance of speaker identification [23,24]. Many studies use these features for speaker identification not only separately but also in combination [25][26][27][28]. In this paper, six different feature extraction approaches, namely Mel Frequency Cepstral Coefficients (MFCC)+Pitch, Gammatone Cepstral Coefficients (GTCC)+Pitch, MFCC+GTCC+Pitch+eight spectral features, spectrograms, i-vectors, and Alexnet feature vectors were used.…”
Section: Related Workmentioning
confidence: 99%
“…It enables us to use many datasets that are present in the database during the training phase. There are various kinds of databases available in this field, and a list of the databases that are used in speaker recognition specifically for speaker identification and speaker verification domain [114,[168][169][170] are given in Table 11 [58,62,89,90,103,.…”
Section: Databases Used In Si and Svmentioning
confidence: 99%
“…SVM is a technique according to the theory of statistical learning applied to determine the decisive boundary via separating different classes and increasing the margin [28][29][30]. SVM is fit for non-linear data set problems and less number of training data but with huge number of input.…”
Section: Support Vector Machine (Svm)mentioning
confidence: 99%