2016
DOI: 10.1109/tmm.2016.2571999
|View full text |Cite
|
Sign up to set email alerts
|

Audio Recapture Detection With Convolutional Neural Networks

Abstract: In this work, we investigate how features can be effectively learned by deep neural networks for audio forensic problems. By providing a preliminary feature preprocessing based on Electric Network Frequency (ENF) analysis, we propose a convolutional neural network (CNN) for training and classification of genuine and recaptured audio recordings. Hierarchical representations which contain levels of details of the ENF components are learned from the deep neural networks and can be used for further classification.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
5
3
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 44 publications
(15 citation statements)
references
References 22 publications
0
13
0
Order By: Relevance
“…ENF is appied for audio recapture detection. Lin et al [26] takes ENF spectrogram as the convolutional neural network input for audio recapture detection. 3.…”
Section: Detection Methods Based On Deep Featuresmentioning
confidence: 99%
“…ENF is appied for audio recapture detection. Lin et al [26] takes ENF spectrogram as the convolutional neural network input for audio recapture detection. 3.…”
Section: Detection Methods Based On Deep Featuresmentioning
confidence: 99%
“…Since many decades, machine learning and neural network methods have been successfully employed in a wide range of speech and audio processing applications, such as automatic speech recognition (ASR) [38]- [41], audio forensic [42], music information retrieval [43], [44], sound classification [45]. However, their use for the improvement or the new design of multichannel processing localization schemes has been explored only recently [25], [26], [46], [47].…”
Section: B Machine Learning Methods For Multichannel Processingmentioning
confidence: 99%
“…CNNs can be exploited in order to learn features emerging from ENF audio recordings. A CNN-based system using spectrograms for audio recapture detection was proposed in [45].…”
Section: Related Workmentioning
confidence: 99%