2006 International Conference on Computing &Amp; Informatics 2006
DOI: 10.1109/icoci.2006.5276486
|View full text |Cite
|
Sign up to set email alerts
|

Mel-frequency cepstral coefficient analysis in speech recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
20
0
1

Year Published

2010
2010
2023
2023

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 41 publications
(21 citation statements)
references
References 1 publication
0
20
0
1
Order By: Relevance
“…In this paper, we use ResNet [26] as our CNN model, the input of the CNN model is mel-frequency cepstral coefficient spectrum (MFCC) [19] of songs, including 500 frames in the time dimension and 12 frequency-bins in the frequency dimension. The output vectors are the 20-dimensional predicted latent feature vector of songs.…”
Section: Music Feature Extractionmentioning
confidence: 99%
“…In this paper, we use ResNet [26] as our CNN model, the input of the CNN model is mel-frequency cepstral coefficient spectrum (MFCC) [19] of songs, including 500 frames in the time dimension and 12 frequency-bins in the frequency dimension. The output vectors are the 20-dimensional predicted latent feature vector of songs.…”
Section: Music Feature Extractionmentioning
confidence: 99%
“…Human ear has been proven to resolve frequencies non-linearly across the audio spectrum, thus filter bank analysis is more desirable because it is spectrally based method [8]. Since MFCC fully simulate the human auditory characteristics without any assumptions, it has been widely used in the field of speech recognition.…”
Section: A Selection Of Speech Featuresmentioning
confidence: 99%
“…Backpropagation neural networks have broad application in classification, approximation, prediction, and control aspects. Based on biological analogy, neural networks try to emulate the human brain's ability to learn from examples or incomplete data and especially to generalize concepts [8]. For the excellent ability to classification of BP NN, it is considered as the classifier in this paper.…”
Section: Endpoint Detection Algorithm Based On Bp Nnmentioning
confidence: 99%
“…Mel-Frequency Cepstral coefficients: MFC analysis has been a popular signal representation method used in many audio classification tasks, especially in speech recognition systems [18]. The basis for the mel-frequency scale is derived from the human perceptual system.…”
Section: Feature Extractionmentioning
confidence: 99%