1996
DOI: 10.1109/79.536825
|View full text |Cite
|
Sign up to set email alerts
|

Robust speaker recognition: a feature-based approach

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
89
0
6

Year Published

1996
1996
2016
2016

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 263 publications
(96 citation statements)
references
References 28 publications
1
89
0
6
Order By: Relevance
“…Next, the amplitude values are converted to mel-filter bank outputs, and the output from each filter is log-compressed and transformed via the discrete cosine transform to cepstral coefficients. The details of these feature extraction techniques may be found in [23][24][25].…”
Section: Feature Extractionmentioning
confidence: 99%
“…Next, the amplitude values are converted to mel-filter bank outputs, and the output from each filter is log-compressed and transformed via the discrete cosine transform to cepstral coefficients. The details of these feature extraction techniques may be found in [23][24][25].…”
Section: Feature Extractionmentioning
confidence: 99%
“…Only the envelope of the spectrum is of interest, hence to get a smoothing spectrum the LPC method is employed.A detail description of calculation of the method is given in our earlier work. [21] Here the LPC is used to estimate the main parameters of the signal. According to [22] the speech production model can be often called as linear production model or autoregressive model.…”
Section: Figure 1 Calculation Of Lpc-mfccsmentioning
confidence: 99%
“…To obtain the vectors of CMSCs the utterances were first converted to sequences of vectors of 12 LPC coefficients. These vectors were then converted to vectors of 12 LPC cepstral coefficients (LPCCs) and finally to the vectors of 12 CMSCs according to the formula [2] …”
Section: Speech Database and Feature Analysismentioning
confidence: 99%
“…The output energy of the filters were then transformed into vectors of 16 MFCCs using the discrete cosine transform [2].…”
Section: Speech Database and Feature Analysismentioning
confidence: 99%