We present version 6 of the DNA Sequence Polymorphism (DnaSP) software, a new version of the popular tool for performing exhaustive population genetic analyses on multiple sequence alignments. This major upgrade incorporates novel functionalities to analyze large data sets, such as those generated by high-throughput sequencing technologies. Among other features, DnaSP 6 implements: 1) modules for reading and analyzing data from genomic partitioning methods, such as RADseq or hybrid enrichment approaches, 2) faster methods scalable for high-throughput sequencing data, and 3) summary statistics for the analysis of multi-locus population genetics data. Furthermore, DnaSP 6 includes novel modules to perform single- and multi-locus coalescent simulations under a wide range of demographic scenarios. The DnaSP 6 program, with extensive documentation, is freely available at http://www.ub.edu/dnasp.
High quality reference genomes are crucial to understanding genome function, structure and evolution. The availability of reference genomes has allowed us to start inferring the role of genetic variation in biology, disease, and biodiversity conservation. However, analyses across organisms demonstrate that a single reference genome is not enough to capture the global genetic diversity present in populations. In this work, we generate 32 high-quality reference genomes for the well-known model species D. melanogaster and focus on the identification and analysis of transposable element variation as they are the most common type of structural variant. We show that integrating the genetic variation across natural populations from five climatic regions increases the number of detected insertions by 58%. Moreover, 26% to 57% of the insertions identified using long-reads were missed by short-reads methods. We also identify hundreds of transposable elements associated with gene expression variation and new TE variants likely to contribute to adaptive evolution in this species. Our results highlight the importance of incorporating the genetic variation present in natural populations to genomic studies, which is essential if we are to understand how genomes function and evolve.
How loss of genetic exchanges (recombination) evolves between sex chromosomes is a long-standing question. Suppressed recombination may evolve when a sexually antagonistic (SA) polymorphism occurs in a partially sex-linked 'pseudoautosomal' region (or 'PAR'), maintaining allele frequency differences between the two sexes, and creating selection for closer linkage with the fully sex-linked region of the Y chromosome in XY systems, or the W in ZW sex chromosome systems. Most evidence consistent with the SA polymorphism hypothesis is currently indirect, and more studies of the genetics and population genetics of PAR genes are clearly needed. The sex chromosomes of the plant Silene latifolia are suitable for such studies, as they evolved recently and the loss of recombination could still be ongoing. Here, we used RAD sequencing to genetically map sequences in this plant, which has a large genome (c. 3 gigabases) and no available whole-genome sequence. We mapped 83 genes on the sex chromosomes, and comparative mapping in the related species S. vulgaris supports previous evidence for additions to an ancestral PAR and identified at least 12 PAR genes. We describe evidence that recombination rates have been reduced in meiosis of both sexes, and differences in recombination between S. latifolia families suggest ongoing recombination suppression. Large allele frequency differences between the sexes were found at several loci closely linked to the PAR boundary, and genes in different regions of the PAR showed striking sequence diversity patterns that help illuminate the evolution of the PAR.
Transposable elements (TEs) are ubiquitous sequences in genomes of virtually all species. While TEs have been investigated for several decades, only recently we have the opportunity to study their genome‐wide population dynamics. Most of the studies so far have been restricted either to the analysis of the insertions annotated in the reference genome or to the analysis of a limited number of populations. Taking advantage of the European Drosophila population genomics consortium (DrosEU) sequencing data set, we have identified and measured the dynamics of TEs in a large sample of European Drosophila melanogaster natural populations. We showed that the mobilome landscape is population‐specific and highly diverse depending on the TE family. In contrast with previous studies based on SNP variants, no geographical structure was observed for TE abundance or TE divergence in European populations. We further identified de novo individual insertions using two available programs and, as expected, most of the insertions were present at low frequencies. Nevertheless, we identified a subset of TEs present at high frequencies and located in genomic regions with a high recombination rate. These TEs are candidates for being the target of positive selection, although neutral processes should be discarded before reaching any conclusion on the type of selection acting on them. Finally, parallel patterns of association between the frequency of TE insertions and several geographical and temporal variables were found between European and North American populations, suggesting that TEs can be potentially implicated in the adaptation of populations across continents.
Drosophila melanogaster is a leading model in population genetics and genomics, and a growing number of whole-genome datasets from natural populations of this species have been published over the last years. A major challenge is the integration of disparate datasets, often generated using different sequencing technologies and bioinformatic pipelines, which hampers our ability to address questions about the evolution of this species. Here we address these issues by developing a bioinformatics pipeline that maps pooled sequencing (Pool-Seq) reads from D. melanogaster to a hologenome consisting of fly and symbiont genomes and estimates allele frequencies using either a heuristic (PoolSNP) or a probabilistic variant caller (SNAPE-pooled). We use this pipeline to generate the largest data repository of genomic data available for D. melanogaster to date, encompassing 271 previously published and unpublished population samples from over 100 locations in > 20 countries on four continents. Several of these locations have been sampled at different seasons across multiple years. This dataset, which we call Drosophila Evolution over Space and Time (DEST), is coupled with sampling and environmental meta-data. A web-based genome browser and web portal provide easy access to the SNP dataset. We further provide guidelines on how to use Pool-Seq data for model-based demographic inference. Our aim is to provide this scalable platform as a community resource which can be easily extended via future efforts for an even more extensive cosmopolitan dataset. Our resource will enable population geneticists to analyze spatio-temporal genetic patterns and evolutionary dynamics of D. melanogaster populations in unprecedented detail.
DNA sequence diversity in genes in the partially sex-linked pseudoautosomal region (PAR) of the sex chromosomes of the plant Silene latifolia is higher than expected from within-species diversity of other genes. This could be the footprint of sexually antagonistic (SA) alleles that are maintained by balancing selection in a PAR gene (or genes) and affect polymorphism in linked genome regions. SA selection is predicted to occur during sex chromosome evolution, but it is important to test whether the unexpectedly high sequence polymorphism could be explained without it, purely by the combined effects of partial linkage with the sex-determining region and the population's demographic history, including possible introgression from Silene dioica. To test this, we applied approximate Bayesian computation-based model choice to autosomal sequence diversity data, to find the most plausible scenario for the recent history of S. latifolia and then to estimate the posterior density of the most relevant parameters. We then used these densities to simulate variation to be expected at PAR genes. We conclude that an excess of variants at high frequencies at PAR genes should arise in S. latifolia populations only for genes with strong associations with fully sex-linked genes, which requires closer linkage with the fully sex-linked region than that estimated for the PAR genes where apparent deviations from neutrality were observed. These results support the need to invoke selection to explain the S. latifolia PAR gene diversity, and encourage further work to test the possibility of balancing selection due to sexual antagonism.
The establishment of a region of suppressed recombination is a critical change during sex chromosome evolution, leading to such properties as Y (and W) chromosome genetic degeneration, accumulation of repetitive sequences and heteromorphism. Although chromosome inversions can cause large regions to have suppressed recombination, and inversions are sometimes involved in sex chromosome evolution, gradual expansion of the non-recombining region could potentially sometimes occur. We here test whether closer linkage has recently evolved between the sex-determining region and several genes that are partially sex-linked in Silene latifolia, using Silene dioica, a closely related dioecious plants whose XY sex chromosome system is inherited from a common ancestor. The S. latifolia pseudoautosomal region (PAR) includes several genes extremely closely linked to the fully Y-linked region. These genes were added to an ancestral PAR of the sex chromosome pair in two distinct events probably involving translocations of autosomal genome regions causing multiple genes to become partially sex-linked. Close linkage with the PAR boundary must have evolved since these additions, because some genes added in both events now show almost complete sex linkage in S. latifolia. We compared diversity patterns of five such S. latifolia PAR boundary genes with their orthologues in S. dioica, including all three regions of the PAR (one gene that was in the ancestral PAR and two from each of the added regions). The results suggest recent recombination suppression in S. latifolia, since its split from S. dioica.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.