2021
DOI: 10.3390/info12070263
|View full text |Cite
|
Sign up to set email alerts
|

Identification of Fake Stereo Audio Using SVM and CNN

Abstract: The number of channels is one of the important criteria in regard to digital audio quality. Generally, stereo audio with two channels can provide better perceptual quality than mono audio. To seek illegal commercial benefit, one might convert a mono audio system to stereo with fake quality. Identifying stereo-faking audio is a lesser-investigated audio forensic issue. In this paper, a stereo faking corpus is first presented, which is created using the Haas effect technique. Two identification algorithms for fa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
12
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8
2

Relationship

0
10

Authors

Journals

citations
Cited by 25 publications
(19 citation statements)
references
References 29 publications
0
12
0
Order By: Relevance
“…Sanchez et al in [99] proposed a model based on the statistical classifier for synthetic speech and the MFCC was used as an authorized baseline. Lui et al in [100] developed a model for fake Stereo Audio detection where the classifier was the SVM, and the feature was MFCC. Based on their result, the MFCC can detect stereo-faking audio very effectively.…”
Section: Fake Speech Detectionmentioning
confidence: 99%
“…Sanchez et al in [99] proposed a model based on the statistical classifier for synthetic speech and the MFCC was used as an authorized baseline. Lui et al in [100] developed a model for fake Stereo Audio detection where the classifier was the SVM, and the feature was MFCC. Based on their result, the MFCC can detect stereo-faking audio very effectively.…”
Section: Fake Speech Detectionmentioning
confidence: 99%
“…The results show that RF performs best compared to SVM with a 71% accuracy result. In a similar way, [110] also used the H-Voice dataset and compared the effectiveness of SVM with the DL technique CNN to distinguish fake audio from actual stereo audio. The study discovered that the CNN is more resilient than the SVM, even though both obtained a high classification accuracy of 99%.…”
Section: Deepfake Audio Detection Techniquesmentioning
confidence: 99%
“…It is one of the most sophisticated technologies and is based on the fact that the crucial bandwidths of the human ear vary in frequency. The Mel-frequency scale, which is a linear frequency space below 1000 Hz and a logarithmic space above 1000 Hz, is used to show this information [24] Spectral Centroid (SC) SC (also known as brightness) represents the focal point in the spectral power distribution of a signal in a sample frame [25].…”
Section: Mel-frequency Cepstral Coefficients (Mfcc)mentioning
confidence: 99%