2021
DOI: 10.48550/arxiv.2108.05080
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

FakeAVCeleb: A Novel Audio-Video Multimodal Deepfake Dataset

Abstract: While the significant advancements have made in the generation of deepfakes using deep learning technologies, its misuse is a well-known issue now. Deepfakes can cause severe security and privacy issues as they can be used to impersonate a person's identity in a video by replacing his/her face with another person's face. Recently, a new problem of generating synthesized human voice of a person is emerging, where AI-based deep learning models can synthesize any person's voice requiring just a few seconds of aud… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
28
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 13 publications
(28 citation statements)
references
References 38 publications
(109 reference statements)
0
28
0
Order By: Relevance
“…The number of faces in each frame is more than one. Recently, FakeAVCeleb [31] was released focusing on both face-swap and face-reenactment methods with manipulated audio and video. ForgeryNet [23] is the latest contribution to the growing list of deepfake detection datasets.…”
Section: Related Workmentioning
confidence: 99%
“…The number of faces in each frame is more than one. Recently, FakeAVCeleb [31] was released focusing on both face-swap and face-reenactment methods with manipulated audio and video. ForgeryNet [23] is the latest contribution to the growing list of deepfake detection datasets.…”
Section: Related Workmentioning
confidence: 99%
“…Inspired by the emergence of DeepFakes algorithm to the public, various methods, i.e., FaceSwap [26], NeuralTextures [7], Face2Face [24], and FSGAN [13], have been proposed to synthesize hyper-realistic deepfake images that are unrecognizable to human eyes. These methods allowed to generate numerous deepfake datasets [8,12,16] for public usage in the research community. Furthermore, Wav2Lips [15] has shown a lip-synchronization network, generating lip-syncing arbitrary talking face videos with arbitrary speech.…”
Section: Related Workmentioning
confidence: 99%
“…In this work, we used FaceForensics++ [16] C40, a compressed version of original FaceForensics++, and FakeAVCeleb [8] to train each model and assess the models on each dataset. The number of fake/real images used in each dataset is provided in Table 2.…”
Section: Datasetsmentioning
confidence: 99%
See 1 more Smart Citation
“…Overdub, iSpeech, and VoiceApp are instances of voice cloning open-access platforms that can generate synthesized deepfake sounds that nearly resemble the target human's speech [3]. The work of [4] is an example of these manipulation methods, which involves the creation of highly realistic deepfake videos with a precise lip-sync using a group of AI technologies; FaceSwap, FaceSwap GAN, DeepFaceLab, SV2TTS [5], and Wav2Lip [6].…”
Section: Introductionmentioning
confidence: 99%