2013
DOI: 10.1002/ece3.722
|View full text |Cite
|
Sign up to set email alerts
|

Why to account for finite sites in population genetic studies and how to do this with Jaatha 2.0

Abstract: With the advent of next-generation sequencing technologies, large data sets of several thousand loci from multiple conspecific individuals are available. Such data sets should make it possible to obtain accurate estimates of population genetic parameters, even for complex models of population history. In the analyses of large data sets, it is difficult to consider finite-sites mutation models (FSMs). Here, we use extensive simulations to demonstrate that the inclusion of FSMs is necessary to avoid severe biase… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
15
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
7

Relationship

2
5

Authors

Journals

citations
Cited by 17 publications
(17 citation statements)
references
References 79 publications
1
15
0
Order By: Relevance
“…In particular, two robust predictions concerning the demography emerge from our results. First, we estimate a surprisingly ancient split time between northern and southern Sweden, about 153 kya (124–182 kya), which is older than a previous estimate of approximately 14 kya close to the beginning of our current warm period ( François et al 2008 ), but more similar to estimates for the split time among Spanish and Italian A. thaliana of 83 kya ( Mathew et al 2013 ). This old split time does not depend on data preprocessing (such as subsampling) and it is not specific to our best-fitting model, but robust across various model topologies.…”
Section: Discussionsupporting
confidence: 46%
See 1 more Smart Citation
“…In particular, two robust predictions concerning the demography emerge from our results. First, we estimate a surprisingly ancient split time between northern and southern Sweden, about 153 kya (124–182 kya), which is older than a previous estimate of approximately 14 kya close to the beginning of our current warm period ( François et al 2008 ), but more similar to estimates for the split time among Spanish and Italian A. thaliana of 83 kya ( Mathew et al 2013 ). This old split time does not depend on data preprocessing (such as subsampling) and it is not specific to our best-fitting model, but robust across various model topologies.…”
Section: Discussionsupporting
confidence: 46%
“…Furthermore, the branch to A. lyrata is long enough, so that recurrent mutations are nonnegligible. This in turn might lead to biases when estimating population genetic parameters ( Mathew et al 2013 ). To avoid this, we based our inference on the folded jSFS.…”
Section: Methodsmentioning
confidence: 99%
“…If it is common, neglecting recurrent mutation can bias inferences of mutation rate, population size, and selection (Desai and Plotkin 2008;Mathew et al 2013). Applying our present theory thus requires that the mutation rate be high enough to create a substantial number of triallelic sites for inference, but not so high that a large fraction of biallelic or triallelic sites are affected by recurrent mutation.…”
Section: Correlated Fitness At Triallelic Sites 515mentioning
confidence: 91%
“…In our simulation study we consider the JSFS J of two populations P 1 and P 2 (e.g., J [x, y] = z means that there are z positions in our aligned data that are found in x samples of in P 1 and in y samples in P 2 ). As already extensively tested in Tellier et al (2011) , we further coarsen the JSFS into Jaatha’s default set of summary statistics (SS) based upon frequency classes (for a description see Naduvilezhath et al, 2011 ; Tellier et al, 2011 ; Mathew et al, 2013 ; and an example in Supplementary Figure S1 ), which has been shown to perform well when estimating neutral demographic models. These SS divide the high and low frequency variants into single frequency classes and the middle frequency variants into fewer classes, resulting in 23 frequency classes in total.…”
Section: Introductionmentioning
confidence: 99%
“…Although it has been well-described that selection and demography may result in similar patterns of diversity, the ability to jointly estimate these two processes has remained elusive. Here, we use simulation to explore the utility of the joint site frequency spectrum to estimate selection and demography simultaneously, including developing an extension of the previously proposed Jaatha program ( Mathew et al, 2013 ). We evaluate both complete and incomplete selective sweeps under an isolation-with-migration model with and without population size change (both population growth and bottlenecks).…”
mentioning
confidence: 99%