2013 International Conference on Pattern Recognition, Informatics and Mobile Engineering 2013
DOI: 10.1109/icprime.2013.6496469
|View full text |Cite
|
Sign up to set email alerts
|

A novel approach for speech feature extraction by Cubic-Log compression in MFCC

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2014
2014
2022
2022

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(5 citation statements)
references
References 11 publications
0
5
0
Order By: Relevance
“…The average classification accuracy (%) for various feature size (M) are listed in Table 3, where M = 16 gives the best result by the proposed scheme. The comparison results with MFCC (mel frequency ceptral coefficients) [13] features using the same training setup are presented in Table 4. The parameters for the MFCC are set as follows: MFCC window length = 20 ms (320 samples), number of MFCC features = 12, MFCC window overlapping = 50%.…”
Section: Food Anticipation 100mentioning
confidence: 99%
“…The average classification accuracy (%) for various feature size (M) are listed in Table 3, where M = 16 gives the best result by the proposed scheme. The comparison results with MFCC (mel frequency ceptral coefficients) [13] features using the same training setup are presented in Table 4. The parameters for the MFCC are set as follows: MFCC window length = 20 ms (320 samples), number of MFCC features = 12, MFCC window overlapping = 50%.…”
Section: Food Anticipation 100mentioning
confidence: 99%
“…Typical values for N and M are N ¼ 256. Depending upon sampling frequency rate have to construct the frame (Devi & Ravichandran, 2013;Martin & Juliet, 2010)…”
mentioning
confidence: 99%
“…MFCC coefficients are a set of DCT decorrelated parameters, which are computed through a transformation of the logarithmically compressed filter-output energies, derived through a perceptually spaced triangular filter bank that processes the Discrete Fourier Transformed (DFT) speech signal. In cubic-log compression MFCC process instead of logtithmic energy in standard MFCC process, Cubic-log energy at each of the mel frequencies is taken [18].…”
Section: Fig 1: Speech Recognition System Modelmentioning
confidence: 99%
“…There are several methods has been proposed for feature extraction from speech signals like Perceptually Based Linear Predictive Analysis (PLP) [7], Linear Discriminant Analysis (LDA) [8], Linear Predictive Coding (LPC) Analysis [9] and Mel-Frequency Cepstrum Coefficients (MFCC) [10]- [18].…”
Section: Introductionmentioning
confidence: 99%