2015
DOI: 10.1007/978-3-319-13728-5_31
|View full text |Cite
|
Sign up to set email alerts
|

Singer Identification Using MFCC and LPC Coefficients from Indian Video Songs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

1
2
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
2
2
2

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(3 citation statements)
references
References 9 publications
1
2
0
Order By: Relevance
“…After experimenting with different combinations of these features, we have come to the conclusion that the combination of MFCC and LPC works best for the task of language identification. This claim is also supported in [32][33] where this combination has proved to be very effective in improving the performance of the model. Moreover, we have found from our experiments that this combination outperforms some new feature extraction techniques like i-vector, x-vector, fusion of DWT and MFCC feature warping and combination of MFCC with GFCC.…”
Section: B Feature Extractionsupporting
confidence: 55%
“…After experimenting with different combinations of these features, we have come to the conclusion that the combination of MFCC and LPC works best for the task of language identification. This claim is also supported in [32][33] where this combination has proved to be very effective in improving the performance of the model. Moreover, we have found from our experiments that this combination outperforms some new feature extraction techniques like i-vector, x-vector, fusion of DWT and MFCC feature warping and combination of MFCC with GFCC.…”
Section: B Feature Extractionsupporting
confidence: 55%
“…It can be seen from the picture that each person's voice is very different in the feature expression. The MFCC is widely used as a token of speech information because it conforms to the auditory awareness of the human ear [2]. …”
Section: Mfcc Feature Extractionmentioning
confidence: 99%
“…The pre‐processing should be able to reveal the key‐features of phonemes, in order to exploit the capabilities of the classification phase [1]. The most widely used features for speech recognition, and also applied for different tasks involving speech and music signals, are the mel‐frequency cepstral coefficients (MFCCs) [2]. The MFCC are based on the linear model of voice production and a psycho‐acoustic frequency mapping according to the mel scale [1].…”
Section: Introductionmentioning
confidence: 99%