Reduced representation genome-sequencing approaches based on restriction digestion are enabling large-scale marker generation and facilitating genomic studies in a wide range of model and nonmodel systems. However, sampling chromosomes based on restriction digestion may introduce a bias in allele frequency estimation due to polymorphisms in restriction sites. To explore the effects of this nonrandom sampling and its sensitivity to different evolutionary parameters, we developed a coalescent-simulation framework to mimic the biased recovery of chromosomes in restriction-based short-read sequencing experiments (RADseq). We analysed simulated DNA sequence datasets and compared known values from simulations with those that would be estimated using a RADseq approach from the same samples. We compare these 'true' and 'estimated' values of commonly used summary statistics, π, θ(w), Tajima's D and F(ST). We show that loci with missing haplotypes have estimated summary statistic values that can deviate dramatically from true values and are also enriched for particular genealogical histories. These biases are sensitive to nonequilibrium demography, such as bottlenecks and population expansion. In silico digests with 102 completely sequenced Drosophila melanogaster genomes yielded results similar to our findings from coalescent simulations. Though the potential of RADseq for marker discovery and trait mapping in nonmodel systems remains undisputed, our results urge caution when applying this technique to make population genetic inferences.
Whole genome duplication (WGD) is a major factor in the evolution of multicellular eukaryotes, yet by doubling the number of homologs, WGD severely challenges reliable chromosome segregation [1, 2, 3], a process conserved across kingdoms [4]. Despite this, numerous genome-duplicated (polyploid) species persist in nature, indicating early problems can be overcome [1, 2]. Little is known about which genes are involved – only one has been molecularly characterized [5]. To gain new insights into the molecular basis of adaptation to polyploidy, we investigated genome-wide patterns of differentiation between natural diploids and tetraploids of Arabidopsis arenosa, an outcrossing relative of A. thaliana [6, 7]. We first show that diploids are not preadapted to polyploid meiosis. We then use a genome scanning approach to show that while polymorphism is extensively shared across ploidy levels, there is strong ploidy-specific differentiation in 39 regions spanning 44 genes. These are discrete, mostly single-gene peaks of sharply elevated differentiation. Among these peaks are eight meiosis genes whose encoded proteins coordinate a specific subset of early meiotic functions, suggesting these genes comprise a polygenic solution to WGD-associated chromosome segregation challenges. Our findings indicate that even conserved meiotic processes can be capable of nimble evolutionary shifts when required.
Ploidy-variable species allow direct inference of the effects of chromosome copy number on fundamental evolutionary processes. While an abundance of theoretical work suggests polyploidy should leave distinct population genomic signatures, empirical data remains sparse. We sequenced ~300 individuals from 39 populations of Arabidopsis arenosa, a naturally diploid-autotetraploid species. We find the impacts of polyploidy on population genomic processes are subtle yet pervasive, including reduced efficiency on linked and purifying selection as well as rampant gene flow from diploids. Initial masking of deleterious mutations, faster rates of nucleotide substitution, and interploidy introgression all conspire to shape the evolutionary potential of polyploids.
Genome duplication, which results in polyploidy, is disruptive to fundamental biological processes. Genome duplications occur spontaneously in a range of taxa and problems such as sterility, aneuploidy, and gene expression aberrations are common in newly formed polyploids. In mammals, genome duplication is associated with cancer and spontaneous abortion of embryos. Nevertheless, stable polyploid species occur in both plants and animals. Understanding how natural selection enabled these species to overcome early challenges can provide important insights into the mechanisms by which core cellular functions can adapt to perturbations of the genomic environment. Arabidopsis arenosa includes stable tetraploid populations and is related to well-characterized diploids A. lyrata and A. thaliana. It thus provides a rare opportunity to leverage genomic tools to investigate the genetic basis of polyploid stabilization. We sequenced the genomes of twelve A. arenosa individuals and found signatures suggestive of recent and ongoing selective sweeps throughout the genome. Many of these are at genes implicated in genome maintenance functions, including chromosome cohesion and segregation, DNA repair, homologous recombination, transcriptional regulation, and chromatin structure. Numerous encoded proteins are predicted to interact with one another. For a critical meiosis gene, ASYNAPSIS1, we identified a non-synonymous mutation that is highly differentiated by cytotype, but present as a rare variant in diploid A. arenosa, indicating selection may have acted on standing variation already present in the diploid. Several genes we identified that are implicated in sister chromatid cohesion and segregation are homologous to genes identified in a yeast mutant screen as necessary for survival of polyploid cells, and also implicated in genome instability in human diseases including cancer. This points to commonalities across kingdoms and supports the hypothesis that selection has acted on genes controlling genome integrity in A. arenosa as an adaptive response to genome doubling.
Whole-genome duplication, which leads to polyploidy, has been implicated in speciation and biological novelty. In plants, many species exhibit ploidy variation, which is likely representative of an early stage in the evolution of polyploid lineages. To understand the evolution of such multiploidy systems, we must address questions such as whether polyploid lineage(s) had a single or multiple origins, whether admixture occurs between ploidies, and the timescale over which ploidy variation affects the evolution of populations. Here we analyze three genomic data sets using nonparametric and parametric analyses, including coalescent-based methods, to study the evolutionary history of a geographically widespread autotetraploid variant of Arabidopsis arenosa, a new model system for understanding the molecular basis of autopolyploid evolution. Autotetraploid A. arenosa populations are widely distributed across much of Northern and Central Europe, whereas diploids occur in Eastern Europe and along the southern Baltic coast; the two ploidies overlap in the Carpathian Mountains. We find that the widespread autotetraploid populations we sampled likely arose from a single ancestral population approximately 11,000-30,000 generations ago in the Northern Carpathians, where its closest extant diploid relatives are found today. Afterward, the tetraploid population split into at least four major lineages that colonized much of Europe. Reconstructions of population history suggest that substantial interploidy admixture occurred in both directions, but only among geographically proximal populations. We find two cases in which selection likely acted on an introgressed locus, suggesting that persistent interploidy gene flow has a local influence on patterns of genetic variation in A. arenosa.
Serpentine barrens represent extreme hazards for plant colonists. These sites are characterized by high porosity leading to drought, lack of essential mineral nutrients, and phytotoxic levels of metals. Nevertheless, nature forged populations adapted to these challenges. Here, we use a population-based evolutionary genomic approach coupled with elemental profiling to assess how autotetraploid Arabidopsis arenosa adapted to a multichallenge serpentine habitat in the Austrian Alps. We first demonstrate that serpentine-adapted plants exhibit dramatically altered elemental accumulation levels in common conditions, and then resequence 24 autotetraploid individuals from three populations to perform a genome scan. We find evidence for highly localized selective sweeps that point to a polygenic, multitrait basis for serpentine adaptation. Comparing our results to a previous study of independent serpentine colonizations in the closely related diploid Arabidopsis lyrata in the United Kingdom and United States, we find the highest levels of differentiation in 11 of the same loci, providing candidate alleles for mediating convergent evolution. This overlap between independent colonizations in different species suggests that a limited number of evolutionary strategies are suited to overcome the multiple challenges of serpentine adaptation. Interestingly, we detect footprints of selection in A. arenosa in the context of substantial gene flow from nearby off-serpentine populations of A. arenosa, as well as from A. lyrata. In several cases, quantitative tests of introgression indicate that some alleles exhibiting strong selective sweep signatures appear to have been introgressed from A. lyrata. This finding suggests that migrant alleles may have facilitated adaptation of A. arenosa to this multihazard environment.adaptation | plant | gene flow | population genomics S erpentine barrens offer powerful venues for the study of multitrait adaptations. Soils at these sites feature dramatically skewed elemental contents, phytotoxic levels of heavy metals, drought risk, and very poor mineral nutrition (1-3). A defining characteristic of serpentine soils is a greatly reduced Ca:Mg ratio along with low K, N, and P, resulting in severe ion homeostasis challenges for plant colonists (4-6). Serpentine soils are also highly porous and thus chronically drought prone. As a result of these challenges, serpentine barrens are characterized by minimal ecosystem productivity and high rates of endemism (reviewed in refs. 2 and 3). Evolution has nevertheless repeatedly forged plant populations that overcome these hazards, making serpentine sites an important natural model for ecology, evolution, and physiology. Given the quantifiable challenges of serpentine adaptation presented by strongly skewed elemental levels and dehydration risk, adapted populations present a valuable opportunity to identify loci underlying adaptations important for understanding basic evolutionary processes, as well as candidate genes for rational crop design for tolerance of ...
Many bacterial species are composed of multiple lineages distinguished by extensive variation in gene content. These often co-circulate in the same habitat, but the evolutionary and ecological processes that shape these complex populations are poorly understood. Addressing these questions is particularly important for Streptococcus pneumoniae, a nasopharyngeal commensal and respiratory pathogen, as the changes in population structure associated with the recent introduction of partial-coverage vaccines have significantly reduced pneumococcal disease. Here we show pneumococcal lineages from multiple populations each have a distinct combination of intermediate frequency genes. Functional analysis suggested these loci were likely subject to negative frequency-dependent selection (NFDS) through interactions with other bacteria, hosts, or mobile elements. Correspondingly, these genes had similar frequencies in four populations with dissimilar lineage compositions. These frequencies were maintained following substantial alterations in lineage prevalences once vaccination programmes began. Fitting a multilocus NFDS model of post-vaccine population dynamics to three genomic datasets using Approximate Bayesian Computation generated reproducible estimates of the influence of NFDS on pneumococcal evolution, the strength of which varied between loci. Simulations replicated the stable frequency of lineages unperturbed by vaccination, patterns of serotype switching, and clonal replacement. This framework highlights how bacterial ecology affects the impact of clinical interventions.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.