Proceedings of the 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics 2020
DOI: 10.1145/3388440.3412475
|View full text |Cite
|
Sign up to set email alerts
|

Population-scale Genomic Data Augmentation Based on Conditional Generative Adversarial Networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 29 publications
(22 reference statements)
0
5
0
Order By: Relevance
“…Chen et al. [47] developed a Population‐scale Genomic Data Augmentation based on conditional GAN (PG‐cGAN). The PG‐cGAN's task was to enhance the amount and diversity of genomic data by transforming current data instead of collecting new samples.…”
Section: Methodsmentioning
confidence: 99%
“…Chen et al. [47] developed a Population‐scale Genomic Data Augmentation based on conditional GAN (PG‐cGAN). The PG‐cGAN's task was to enhance the amount and diversity of genomic data by transforming current data instead of collecting new samples.…”
Section: Methodsmentioning
confidence: 99%
“…The generation of novel genomic data with the same statistical properties as the real databases could increase data accessibility immensely and accelerate research without breaching the privacy of biobank donors. In this context, GANs, VAEs, and their derivatives have recently been suggested for generating realistic human genome segments (26,(41)(42)(43)(44)(45). These models have learned not only the global population stratification in real datasets but also complex underlying structures, such as LD patterns along the genome, haplotype-based selection signals, and genomic local ancestry proportions; this indicates that they might be used as reliable second-best alternatives for real genomes in the future (26,41).…”
Section: Applications In Evolutionary Biology and Population Geneticsmentioning
confidence: 99%
“…These models have learned not only the global population stratification in real datasets but also complex underlying structures, such as LD patterns along the genome, haplotype-based selection signals, and genomic local ancestry proportions; this indicates that they might be used as reliable second-best alternatives for real genomes in the future (26,41). Furthermore, they can be conditioned on extra variables, such as population labels, to generate targeted genomes depending on the task (41,42). Finally, it was shown that the generated genomes could be good at preventing privacy leakage from genome donors in the training datasets, yet extensive research in this regard is still needed for further confirmation and improvements before these models can be applied in practical cases (26).…”
Section: Applications In Evolutionary Biology and Population Geneticsmentioning
confidence: 99%
“…However, the model has a large number of layers, and some features may be lost during forward propagation, which results in the features being acquired incompletely. Ren et al [39][40][41][42] added a skip-layer connection among the network layers, which can promote function re-use between layers and preserve useful information. Even if some of them are lost in training, the key features can be well retained.…”
Section: Related Workmentioning
confidence: 99%