We identify a new natural coalescent structure, which we call the seed-bank coalescent, that describes the gene genealogy of populations under the influence of a strong seed-bank effect, where "dormant forms" of individuals (such as seeds or spores) may jump a significant number of generations before joining the "active" population. Mathematically, our seed-bank coalescent appears as scaling limit in a Wright-Fisher model with geometric seed-bank age structure if the average time of seed dormancy scales with the order of the total population size N . This extends earlier results of Kaj, Krone and Lascoux [J. Appl. Probab. 38 (2011) 285-300] who show that the genealogy of a Wright-Fisher model in the presence of a "weak" seed-bank effect is given by a suitably time-changed Kingman coalescent. The qualitatively new feature of the seed-bank coalescent is that ancestral lineages are independently blocked at a certain rate from taking part in coalescence events, thus strongly altering the predictions of classical coalescent models. In particular, the seed-bank coalescent "does not come down from infinity," and the time to the most recent common ancestor of a sample of size n grows like log log n. This is in line with the empirical observation that seed-banks drastically increase genetic variability in a population and indicates how they may serve as a buffer against other evolutionary forces such as genetic drift and selection.
We analyze patterns of genetic variability of populations in the presence of a large seedbank with the help of a new coalescent structure called the seedbank coalescent. This ancestral process appears naturally as a scaling limit of the genealogy of large populations that sustain seedbanks, if the seedbank size and individual dormancy times are of the same order as those of the active population. Mutations appear as Poisson processes on the active lineages and potentially at reduced rate also on the dormant lineages. The presence of "dormant" lineages leads to qualitatively altered times to the most recent common ancestor and nonclassical patterns of genetic diversity. To illustrate this we provide a Wright-Fisher model with a seedbank component and mutation, motivated from recent models of microbial dormancy, whose genealogy can be described by the seedbank coalescent. Based on our coalescent model, we derive recursions for the expectation and variance of the time to most recent common ancestor, number of segregating sites, pairwise differences, and singletons. Estimates (obtained by simulations) of the distributions of commonly employed distance statistics, in the presence and absence of a seedbank, are compared. The effect of a seedbank on the expected site-frequency spectrum is also investigated using simulations. Our results indicate that the presence of a large seedbank considerably alters the distribution of some distance statistics, as well as the site-frequency spectrum. Thus, one should be able to detect from genetic data the presence of a large seedbank in natural populations.KEYWORDS Wright-Fisher model; seedbank coalescent; dormancy; site-frequency spectrum; distance statistics M ANY microorganisms can enter reversible dormant states of low [respectively (resp.) zero] metabolic activity, for example when faced with unfavorable environmental conditions; see, e.g., Lennon and Jones (2011) for a recent overview of this phenomenon. Such dormant forms may stay inactive for extended periods of time and thus create a seedbank that should significantly affect the interplay of evolutionary forces driving the genetic variability of the microbial population. In fact, in many ecosystems, the percentage of dormant cells compared to the total population size is substantial and sometimes even dominant (for example, $20% in human gut, 40% in marine water, and 80% in soil; cf. Lennon and Jones 2011, box 1, table a). This abundance of dormant forms, which can be short-lived as well as stay inactive for significant periods of time (decades-or century-old spores are not uncommon), thus creates a seedbank that buffers against environmental change, but potentially also against classical evolutionary forces such as genetic drift, mutation, and selection.In this article, we investigate the effect of large seedbanks (that is, comparable to the size of the active population) on the patterns of genetic variability in populations over macroscopic timescales. In particular, we extend a recently introduced mathematical ...
We present a new model for seed banks, where direct ancestors of individuals may have lived in the near as well as the very far past. The classical Wright-Fisher model, as well as a seed bank model with bounded age distribution considered in Kaj, Krone and Lascoux (2001) are special cases of our model. We discern three parameter regimes of the seed bank age distribution, which lead to substantially different behaviour in terms of genetic variability, in particular with respect to fixation of types and time to the most recent common ancestor. We prove that, for age distributions with finite mean, the ancestral process converges to a time-changed Kingman coalescent, while in the case of infinite mean, ancestral lineages might not merge at all with positive probability. Furthermore, we present a construction of the forward-in-time process in equilibrium. The mathematical methods are based on renewal theory, the urn process introduced in Kaj, Krone and Lascoux (2001) as well as on a paper by Hammond and Sheffield (2013).
We introduce a Cannings model with directional selection via a paintbox construction and establish a strong duality with the line counting process of a new Cannings ancestral selection graph in discrete time. This duality also yields a formula for the fixation probability of the beneficial type. Haldane's formula states that for a single selectively advantageous individual in a population of haploid individuals of size N the probability of fixation is asymptotically (as N → ∞) equal to the selective advantage of haploids sN divided by half of the offspring variance. For a class of offspring distributions within Kingman attraction we prove this asymptotics for sequences sN obeying N −1 sN N −1/2 , which is a regime of "moderately weak selection". It turns out that for sN N −2/3 the Cannings ancestral selection graph is so close to the ancestral selection graph of a Moran model that a suitable coupling argument allows to play the problem back asymptotically to the fixation probability in the Moran model, which can be computed explicitly.
A two-types, discrete-time population model with finite, constant size is constructed, allowing for a general form of frequency-dependent selection and skewed offspring distribution. Selection is defined based on the idea that individuals first choose a (random) number of potential parents from the previous generation and then, from the selected pool, they inherit the type of the fittest parent. The probability distribution function of the number of potential parents per individual thus parametrises entirely the selection mechanism. Using duality, weak convergence is then proved both for the allele frequency process of the selectively weak type and for the population's ancestral process. The scaling limits are, respectively, a two-types Ξ-Fleming-Viot jump-diffusion process with frequency-dependent selection, and a branching-coalescing process with general branching and simultaneous multiple collisions. Duality also leads to a characterisation of the probability of extinction of the selectively weak allele, in terms of the ancestral process' ergodic properties.
We revisit the model by Wiser, Ribeck, and Lenski (Science 342 (2013), 1364-1367), which describes how the mean fitness increases over time due to beneficial mutations in Lenski's long-term evolution experiment. We develop the model further both conceptually and mathematically. Conceptually, we describe the experiment with the help of a Cannings model with mutation and selection, where the latter includes diminishing returns epistasis. The analysis sheds light on the growth dynamics within every single day and reveals a runtime effect, that is, the shortening of the daily growth period with increasing fitness; and it allows to clarify the contribution of epistasis to the mean fitness curve. Mathematically, we explain rigorous results in terms of a law of large numbers (in the limit of infinite population size and for a certain asymptotic parameter regime), and present approximations based on heuristics and supported by simulations for finite populations.ii
The Lenski experiment investigates the long-term evolution of bacterial populations. In this paper we present an individual-based probabilistic model that captures essential features of the experimental design, and whose mechanism does not include epistasis in the continuous-time (intraday) part of the model, but leads to an epistatic effect in the discrete-time (interday) part. We prove that under some assumptions excluding clonal interference, the rescaled relative fitness process converges in the large population limit to a power law function, similar to the one obtained by Wiser et al. (2013), there attributed to effects of clonal interference and epistasis.
We investigate the behaviour of the genealogy of a Wright-Fisher population model under the influence of a strong seed-bank effect. More precisely, we consider a simple seed-bank age distribution with two atoms, leading to either classical or long genealogical jumps (the latter modeling the effect of seed-dormancy). We assume that the length of these long jumps scales like a power N β of the original population size N , thus giving rise to a 'strong' seed-bank effect. For a certain range of β, we prove that the ancestral process of a sample of n individuals converges under a non-classical time-scaling to Kingman's n−coalescent. Further, for a wider range of parameters, we analyze the time to the most recent common ancestor of two individuals analytically and by simulation. MSC 2010 classification: 60K35, 92D15
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.