18th International Conference on Pattern Recognition (ICPR'06) 2006
DOI: 10.1109/icpr.2006.1071
|View full text |Cite
|
Sign up to set email alerts
|

Speaker Verification Using A Novel Set of Dynamic Features

Abstract: Dynamic cepstral features such as delta and deltadelta cepstra have been shown to play an essential role in capturing the transitional characteristics of the speech signal. In this paper, a set of new dynamic features for speaker verification system are introduced. These new features, known as Delta Cepstral Energy (DCE) and Delta-Delta Cepstral Energy (DDCE), can compactly represent the information in the delta and delta-delta cepstra. Further, it is shown theoretically that DCE carries the same information a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
3
0

Year Published

2008
2008
2014
2014

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 5 publications
0
3
0
Order By: Relevance
“…Given that the speech signal has been sliced into M frames, and each frame contains N samples, we can conduct a frame based on noisy speech model from (8) Where y(m,n) denotes the noisy speech signal, x(m,n) denotes the clean speech signal, h(m,n) represents the convolutional noise, and w(m,n) is the additive noise. Hereon, we not only assume that the additive noise is stationary and is uncorrelated with the speech, but also asswne the power spectrum of the convolutional noise is stationary or changes considerably slow.…”
Section: IImentioning
confidence: 99%
See 1 more Smart Citation
“…Given that the speech signal has been sliced into M frames, and each frame contains N samples, we can conduct a frame based on noisy speech model from (8) Where y(m,n) denotes the noisy speech signal, x(m,n) denotes the clean speech signal, h(m,n) represents the convolutional noise, and w(m,n) is the additive noise. Hereon, we not only assume that the additive noise is stationary and is uncorrelated with the speech, but also asswne the power spectrum of the convolutional noise is stationary or changes considerably slow.…”
Section: IImentioning
confidence: 99%
“…As we all know, Mel-scale frequency cepstrum coefficients (MFCCs) curr ently are commonly used features in speaker recognition systems, Furthermore, dynamic cepstral features such as delta and delta-delta cepstra have been shown to play an essential role in capturing the transitional characteristics of the speech signal. So, LlMFCC, Ll 2 MFCC [7], and other related features such as delta cepstral energy (DCE) [8] and delta-delta cepstral energy (DDCE) have also been introduced into the speaker recognition systems. Since MFCC values are not very robust in the presence of noise, researchers propose various modifications to the basic MFCC to improve robustness.…”
Section: Introducrionmentioning
confidence: 99%
“…coefficients [2,9], octave-based spectral contrast (OSC) [3,4], etc. Generally, the transitional information between short-term features of two neighboring frames, including delta and delta-delta values, have been proven useful in speech recognition system [10,11].…”
Section: Introductionmentioning
confidence: 99%
“…In [Kumar et al, 2011], it is shown that the addition of delta-cepstral features to the static 13-dimensional MFCC features improves speech recognition accuracy, and a further (smaller) improvement is provided by the addition of double-delta cepstral. It is also reported in [Nosratighods et al, 2006] that the short-term dynamic features such as delta and delta-delta coefficients can be used to improve speech and speaker verification system by modelling the short-term transitional information in the speech.…”
Section: Dynamic Featuresmentioning
confidence: 99%