Proceedings of the First ACM Workshop on Information Hiding and Multimedia Security 2013
DOI: 10.1145/2482513.2482520
|View full text |Cite
|
Sign up to set email alerts
|

Optimizing acoustic features for source cell-phone recognition using speech signals

Abstract: This paper presents comparison and optimization of acoustic features for source cell-phone recognition using recorded speech signals. Different acoustic feature extraction methods such as Mel-frequency, linear frequency and Bark frequency cepstral coefficients (MFCC, LFCC and BFCC) and linear prediction cepstral coefficients (LPCC) are considered. In addition to different feature sets, the effect of dynamic features, delta and double-delta coefficients (∆ and ∆ 2 ), and feature normalizations, cepstral mean no… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
9
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 17 publications
(9 citation statements)
references
References 31 publications
0
9
0
Order By: Relevance
“…We compare the source identification performances of these two different 100 acoustic feature sets. Detailed comparison of different feature extraction and normalization methods on source cell-phone identification can be found in [18].…”
mentioning
confidence: 99%
“…We compare the source identification performances of these two different 100 acoustic feature sets. Detailed comparison of different feature extraction and normalization methods on source cell-phone identification can be found in [18].…”
mentioning
confidence: 99%
“…It is observed from the detailed review of the existing Cepstral coefficients that the two parameters such as the conversion of frequency from linear scale to perceptual scale and the selection of cut-off frequencies are the determining factor for the performance of the cepstral analysis. This issue is addressed in the literature [13] , [14] , [15] , [16] , [17] , [18] , [19] . This problem has been further investigated in Section 5.2 .…”
Section: Related Workmentioning
confidence: 99%
“…The use of these features achieves faster convergence of the ANN model and improvement in recognition accuracy at different SNR levels. The source recognition of Cell-Phone is carried out using the optimization of different cepstral coefficients such as Mel, linear, and Bark frequency [13] . The use of minimum and maximum frequencies of MFCC and cepstral variance normalization has enhanced the identification rate to 96.85%.…”
Section: Introductionmentioning
confidence: 99%
“…A best accuracy of 96.42% was obtained on their close set. In [3], an SVM classifier was used to compare the performance of various acoustic features, including Mel-frequency, linear frequency, bark frequency cepstral coefficients (MFCCs, LFCCs and BFCCs, respectively) as well as the linear prediction cepstral coefficients (LPCCs). Their study showed that MFCCs achieved the best performance for source identification.…”
Section: Introductionmentioning
confidence: 99%