2022
DOI: 10.1101/2022.07.06.499052
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Timesweeper: Accurately Identifying Selective Sweeps Using Population Genomic Time Series

Abstract: Despite decades of research, identifying selective sweeps, the genomic footprints of positive selection, remains a core problem in population genetics. Of the myriad methods that have been made towards this goal, few are designed to leverage the potential of genomic time-series data. This is because in most population genetic studies only a single, brief period of time can be sampled for a study. Recent advancements in sequencing technology, including improvements in extracting and sequencing ancient DNA, have… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
17
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2

Relationship

2
3

Authors

Journals

citations
Cited by 6 publications
(17 citation statements)
references
References 96 publications
0
17
0
Order By: Relevance
“…One key aspect to make deep learning a popular framework in population genetics, is to ensure reproducible analyses and avoid repeating training of highly parameterized networks from scratch. In this context, recent efforts to provide users with documented workflows ( Whitehouse and Schrider 2022 ) and pre-trained networks ( Hamid et al 2022 ) will both reduce carbon footprint ( Grealey et al 2022 ) and facilitate the application of deep learning to a wider range of data sets, allowing users to modify the network’s parameters according to the specific requirements of the biological system under examination.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…One key aspect to make deep learning a popular framework in population genetics, is to ensure reproducible analyses and avoid repeating training of highly parameterized networks from scratch. In this context, recent efforts to provide users with documented workflows ( Whitehouse and Schrider 2022 ) and pre-trained networks ( Hamid et al 2022 ) will both reduce carbon footprint ( Grealey et al 2022 ) and facilitate the application of deep learning to a wider range of data sets, allowing users to modify the network’s parameters according to the specific requirements of the biological system under examination.…”
Section: Discussionmentioning
confidence: 99%
“…In recent years, the generation of sequencing data from ancient or historical samples, as well as from capture-recapture and evolve-and-resequence experiments, has allowed for a direct observation of how genetic diversity and allele frequencies change under natural or controlled conditions over time. To detect positive selection with time-series data, Whitehouse and Schrider (2022) proposed to stack either allele frequency or haplotype data over sampling times to be fed as input to one-dimensional CNNs. Their method was implemented in the software , and evaluated under various sampling conditions.…”
Section: Deep Learning Algorithmsmentioning
confidence: 99%
See 1 more Smart Citation
“…unknown true demographic history, low data quality, etc). Thus, this approach presents the possibility of combining the strengths of reductionist generative models with those of GANs, and could potentially be used to produce more accurate training data for machine learning-based methods for detecting selective sweeps [17,19,20,69]. Alternatively, one could take alignments from empirical data and transfer various demographic hypotheses to those alignments to help generate the reference table for performing approximate Bayesian computation.…”
Section: Discussionmentioning
confidence: 99%
“…14]. In population genetics, deep-learning methods have been used for a number of purposes, such as detecting introgression [15]-including adaptive introgression [16]; distinguishing among various modes of natural selection as well as neutral evolution [17][18][19][20]; estimating parameters such as selection coefficients and recombination rates [15,21], dispersal distances [22], and population sizes [23]; and visualizing population structure [24].…”
Section: Introductionmentioning
confidence: 99%