Abstract:Understanding the impacts of current human activities on within-species genetic variation requires a thorough description of the historical factors that have shaped the genomic and geographical distribution of nucleotide diversity. Past and current conditions influencing effective population size have important evolutionary implications for the efficacy of selection, increased accumulation of deleterious mutations, and loss of adaptive potential under the nearly neutral theory. Here, we gather extensive genome… Show more
“…To integrate more complex demographic scenarios, several recent studies considered interesting approaches, by assuming demographic models including two classes of N e along the genome, one for neutral loci and one for loci under linked selection. The proportion of the two classes and the ratio of N e between them were estimated together with other parameters of the demographic model, using either ABC Bernatchez, 2018, Roux et al, 2016) or a modification (Rougemont et al, 2020, Rougeux et al, 2017 of the diffusion approach implemented in the software ∂a∂i (Gutenkunst et al, 2009). The models described in the present study propose another direction for the development of demographic inference methods by accounting for linked selection through variable classes of N e along the genome, and using the IICR as summary statistic.…”
The relative contribution of selection and neutrality in shaping species genetic diversity is one of the most central and controversial questions in evolutionary theory. Genomic data provide growing evidence that linked selection, i.e. the modification of genetic diversity at neutral sites through linkage with selected sites, might be pervasive over the genome. Several studies proposed that linked selection could be modelled as first approximation by a local reduction (e.g. purifying selection, selective sweeps) or increase (e.g. balancing selection) of effective population size (Ne). At the genome-wide scale, this leads to a large variance of Ne from one region to another, reflecting the heterogeneity of selective constraints and recombination rates between regions. We investigate here the consequences of this variation of Ne on the genome-wide distribution of coalescence times. The underlying motivation concerns the impact of linked selection on demographic inference, because the distribution of coalescence times is at the heart of several important demographic inference approaches. Using the concept of Inverse Instantaneous Coalescence Rate, we demonstrate that in a panmictic population, linked selection always results in a spurious apparent decrease of Ne along time. Balancing selection has a particularly large effect, even when it concerns a very small part of the genome. We quantify the expected magnitude of the spurious decrease of Ne in humans and Drosophila melanogaster, based on Ne distributions inferred from real data in these species. We also find that the effect of linked selection can be significantly reduced by that of population structure.
“…To integrate more complex demographic scenarios, several recent studies considered interesting approaches, by assuming demographic models including two classes of N e along the genome, one for neutral loci and one for loci under linked selection. The proportion of the two classes and the ratio of N e between them were estimated together with other parameters of the demographic model, using either ABC Bernatchez, 2018, Roux et al, 2016) or a modification (Rougemont et al, 2020, Rougeux et al, 2017 of the diffusion approach implemented in the software ∂a∂i (Gutenkunst et al, 2009). The models described in the present study propose another direction for the development of demographic inference methods by accounting for linked selection through variable classes of N e along the genome, and using the IICR as summary statistic.…”
The relative contribution of selection and neutrality in shaping species genetic diversity is one of the most central and controversial questions in evolutionary theory. Genomic data provide growing evidence that linked selection, i.e. the modification of genetic diversity at neutral sites through linkage with selected sites, might be pervasive over the genome. Several studies proposed that linked selection could be modelled as first approximation by a local reduction (e.g. purifying selection, selective sweeps) or increase (e.g. balancing selection) of effective population size (Ne). At the genome-wide scale, this leads to a large variance of Ne from one region to another, reflecting the heterogeneity of selective constraints and recombination rates between regions. We investigate here the consequences of this variation of Ne on the genome-wide distribution of coalescence times. The underlying motivation concerns the impact of linked selection on demographic inference, because the distribution of coalescence times is at the heart of several important demographic inference approaches. Using the concept of Inverse Instantaneous Coalescence Rate, we demonstrate that in a panmictic population, linked selection always results in a spurious apparent decrease of Ne along time. Balancing selection has a particularly large effect, even when it concerns a very small part of the genome. We quantify the expected magnitude of the spurious decrease of Ne in humans and Drosophila melanogaster, based on Ne distributions inferred from real data in these species. We also find that the effect of linked selection can be significantly reduced by that of population structure.
“…2019) and hence enable their persistence and rapid redeployment in heterogeneous environments despite high gene flow (Lowry & Willis 2010; Sinclair-Waters et al 2017; Todesco et al 2019). In addition, scans for adaptive variants need to be interpreted in the light of variation of recombination along the genome (Stapley et al 2017) since it shapes the local extent of diversity and of differentiation (Cutter & Payseur 2013; Tine et al 2014; Burri et al 2015; Rougemont et al 2019). Lastly, genome scans results need to be interpreted in the light of the demographic history of populations.…”
Understanding the genomic processes underlying local adaptation is a central aim of modern evolutionary biology. This task requires identifying footprints of local selection but also estimating spatio-temporal variation of populations’ demography and variation in recombination rate and diversity along the genome. Here, we investigated these parameters in blue tit populations inhabiting neighbouring deciduous and evergreen forests and populations in an insular versus a continental context. Close populations from deciduous and evergreen habitats were weakly genetically differentiated (FST = 0.004 on average), nevertheless with a significant effect of habitat type on the overall genetic structure. This low differentiation was consistent with the large effective population sizes (from 43,000 to 463,000) and the strong and long-lasting gene flow inferred by demographic modeling. In turn, insular and continental populations were moderately differentiated (FST = 0.08 on average), which was consistent with the inference of moderate ancestral migrations followed by isolation since the end of the last glaciation. Weak and non-parallel footprints of divergent selection among deciduous and evergreen populations were consistent with their demography and the probable polygenic nature of local adaptation in these habitats. This contrasted with stronger outlier regions, more often in regions of low recombination, found between insular and continental populations. Lastly, we identified a genomic inversion on the continent, spanning 2.8Mb. These results provide insights into the demographic history and genetic architecture of local adaptation in blue tit populations at multiple geographic scales.
“…Additionally, FST values between pairs of population samples (pairwise FST, Reynolds et al 1983) are routinely used to estimate population structure in human genetics (Pérez-Lezaun et al 1997;Ramachandran et al, 2005), conservation biology (Palsbøll et al 2007;Schwartz et al 2007), and evolutionary biology and ecology (e.g., Hemmer-Hansen et al 2013a;Therkildsen et al 2013a;Geraldes et al 2014;McKown et al 2014a;Jorde et al 2015;Rougemont et al 2020). Divergent selection in an environmental gradient can influence population structure (Nosil et al 2009;Orsini et al 2013), and researchers have examined geographic distance and habitat differences between populations as explanatory variables that impact population structure estimated based on genome-wide (average over loci) pairwise FST values (e.g., Rousset 1997;Bradbury and Bentzen 2007;Petrou et al 2014;Jorde et al 2015;Kitada et al 2017).…”
mentioning
confidence: 99%
“…However, genome-wide population-specific FST is new among biologists. Despite the expected usefulness of genome-wide population-specific FST in evolutionary biology (Weir and Goudet 2017), applications have been sparse to date (e.g., Nicholson et al 2002;Weir et al 2005;Foll and Gaggiotti 2006;Buckleton et al 2016;Rougemont et al 2020).…”
Appropriate estimates of population structure are the basis of population genetics, with applications varying from evolutionary and conservation biology to association mapping and forensic identification. The common procedure is to first compute Wright’s FST over all samples (global FST) and then routinely estimate between-population FST values (pairwise FST). An alternative approach for estimating population differentiation is the use of population-specific FST measures. Here, we characterize population-specific FST and pairwise FST estimators by analyzing publicly available human, Atlantic cod and wild poplar data sets. The bias-corrected moment estimator of population-specific FST identified the source population and traced the migration and evolutionary history of its derived populations by way of genetic diversity, whereas the bias-corrected moment estimator of pairwise FST was found to represent current population structure. Generally, the first axis of multi-dimensional scaling for the pairwise FST distance matrix reflected population history, while subsequent axes indicated migration events, languages and the effect of environment. The relative contributions of these factors were dependent on the ecological characters of the species. Given shrinkage towards mean allele frequencies, maximum likelihood and Bayesian estimators of locus-specific global FST improved the power to detect genes under environmental selection. In contrast, bias-corrected moment estimators of global FST measured species divergence and enabled reliable interpretation of population structure. The genomic data highlight the usefulness of the bias-corrected moment estimators of FST. The R package FinePop2_ver.0.2 for computing these FST estimators is available at CRAN.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.