2020
DOI: 10.1109/access.2020.3016097
|View full text |Cite
|
Sign up to set email alerts
|

Validating Seed Data Samples for Synthetic Identities – Methodology and Uniqueness Metrics

Abstract: This work explores the identity attribute of synthetic face samples derived from Generative Adversarial Networks. The goal is to determine if individual samples are unique in terms of identity, firstly with respect to the seed dataset that trains the GAN model and secondly with respect to other synthetic face samples. Two approaches are introduced to enable the comparative analysis of large sets of synthetic face samples. The first of these uses ROC curves to determine identity uniqueness using a number of lar… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
8
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
3
1

Relationship

3
4

Authors

Journals

citations
Cited by 26 publications
(10 citation statements)
references
References 25 publications
(42 reference statements)
1
8
0
Order By: Relevance
“…This indicates there is a higher density of lookalikes between Sy and Se than in a real population, which is here modeled by IJB-C. This is similar to the observation made in [32] using StyleGAN. In the article, they notice that this is caused by the presence of children in FFHQ, and thus also in StyleGAN's output distribution, while SOTA face recognition networks are typically not trained on faces of children and thus perform poorly on this population.…”
Section: Are Stylegan2 Identities New ?supporting
confidence: 82%
See 1 more Smart Citation
“…This indicates there is a higher density of lookalikes between Sy and Se than in a real population, which is here modeled by IJB-C. This is similar to the observation made in [32] using StyleGAN. In the article, they notice that this is caused by the presence of children in FFHQ, and thus also in StyleGAN's output distribution, while SOTA face recognition networks are typically not trained on faces of children and thus perform poorly on this population.…”
Section: Are Stylegan2 Identities New ?supporting
confidence: 82%
“…To assess the requirement of privacy, we need to verify that generated identities do not simply reproduce existing identities from the FFHQ dataset. We evaluate this by reproducing on our synthetic dataset an experiment originally proposed in [32] on the first version of StyleGAN. It consists in comparing the identity similarity between a synthetic dataset (Sy) and a seed dataset (Se), which is the dataset used to train the face generator.…”
Section: Are Stylegan2 Identities New ?mentioning
confidence: 99%
“…The closer an ROC curve is to unity, the better the performance of the FR model on the selected samples. More information regarding the ROC and its interpretation and use can be found in [40].…”
Section: Using Roc Curves As a Metricmentioning
confidence: 99%
“…The effect of these augmentations on the performance of a SoA FR method is quantified using Receiver Operating Characteristic curve (ROC) techniques. A similar approach was used recently to validate synthetic facial identities [40]. Note that a re-lighting augmentation approach was adopted as existing public datasets do not provide sufficient lighting variability.…”
Section: Introductionmentioning
confidence: 99%
“…The generator learns to distribute data from a real training dataset and the discriminator is trained to judge whether a sample is a real or a generated sample. The ultimate goal is to train the generator to produce high-quality synthetic images that can fool the discriminator [13,14,19]. At present, the StyleGAN is a representative of the most advanced GAN techniques and can produce sufficiently high-quality and photo-realistic samples with relatively low computation cost [13,20,21].…”
Section: Introductionmentioning
confidence: 99%