1999
DOI: 10.1109/89.759036
|View full text |Cite
|
Sign up to set email alerts
|

A comparison of speaker identification results using features based on cepstrum and Fourier-Bessel expansion

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
29
0

Year Published

2005
2005
2015
2015

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 45 publications
(29 citation statements)
references
References 4 publications
0
29
0
Order By: Relevance
“…However, unlike the sinusoidal basis functions in the Fourier series, the Bessel functions decay over time. This feature of the Bessel functions make the FB series expansion suitable for nonstationary signals [15,[17][18][19][20][21][22][23].…”
Section: Fourier-bessel Series Expansionmentioning
confidence: 99%
“…However, unlike the sinusoidal basis functions in the Fourier series, the Bessel functions decay over time. This feature of the Bessel functions make the FB series expansion suitable for nonstationary signals [15,[17][18][19][20][21][22][23].…”
Section: Fourier-bessel Series Expansionmentioning
confidence: 99%
“…In the literature, some models have been proposed for representing speech features [1,[3][4][5][6][17][18][19]. In this paper, four well-known models, Linear Predict Coding Cepstrum (LPCC) [1], Fourier Transform Cepstral Coefficients Test Utterance (Sec)…”
Section: Comparison With Existing Methodsmentioning
confidence: 99%
“…In addition to the above three most commonly used feature extractions, there are other special speech features extracted often for speaker identification. Gopalan et al [5] propose a compact representation for speech using Bessel functions because of the similarity between voiced speech and Bessel functions. It has been shown that the features obtained from the Fourier-Bessel expansion of speech are comparable to the cepstral features in representing the spectral energy.…”
Section: Introductionmentioning
confidence: 99%
“…Three speech analysis models that are based on short-term spectrum of speech and use concepts of the psychophysics of hearing, such as the critical-band spectral resolution, the equal-loudness curve and the intensity-loudness power law to derive an estimate of the auditory spectrum [19], have been selected to produce three versions of the proposed speech quality measure (See Section 3.1 for details). The first version of the measure (Version I) utilises a 5 th order Perceptual Linear Prediction (PLP) model [24], the second version (Version II) utilises a 17 th order Bark Spectrum (BS) analysis model [7], and the third version (Version III) utilises a 13 th order Mel-Frequency Cepstrum Coefficients (MFCC) [25]. This selection was also based on the abilities of these speech analysis models in suppressing speaker-dependent information, as investigated in Section 3.2.…”
Section: The Proposed Output-based Speech Quality Measurementioning
confidence: 99%
“…The mel-frequency cepstrum analysis model [25] is a perceptual-based speech analysis motivated by the observation that the human auditory system perceives information based on the energy in a band of frequencies rather than that at a single frequency, and by the fact that most signals can be described in terms of source-filter model. The model is widely accepted as a standard in the speech technology field for a number of challenging tasks, including speech recognition and speaker identification.…”
Section: Mel-frequency Cepstrum Analysismentioning
confidence: 99%