The Drosophila Genome Nexus is a population genomic resource that provides D. melanogaster genomes from multiple sources. To facilitate comparisons across data sets, genomes are aligned using a common reference alignment pipeline which involves two rounds of mapping. Regions of residual heterozygosity, identity-by-descent, and recent population admixture are annotated to enable data filtering based on the user’s needs. Here, we present a significant expansion of the Drosophila Genome Nexus, which brings the current data object to a total of 1,121 wild-derived genomes. New additions include 305 previously unpublished genomes from inbred lines representing six population samples in Egypt, Ethiopia, France, and South Africa, along with another 193 genomes added from recently-published data sets. We also provide an aligned D. simulans genome to facilitate divergence comparisons. This improved resource will broaden the range of population genomic questions that can addressed from multi-population allele frequencies and haplotypes in this model species. The larger set of genomes will also enhance the discovery of functionally relevant natural variation that exists within and between populations.
The Drosophila Genome Nexus is a population genomic resource that provides D. melanogaster genomes from multiple sources. To facilitate comparisons across data sets, genomes are aligned using a common reference alignment pipeline which involves two rounds of mapping. Regions of residual heterozygosity, identity-by-descent, and recent population admixture are annotated to enable data filtering based on the user's needs. Here, we present a significant expansion of the Drosophila Genome Nexus, which brings the current data object to a total of 1,121 wild-derived genomes. New additions include 305 previously unpublished genomes from inbred lines representing six population samples in Egypt, Ethiopia, France, and South Africa, along with another 193 genomes added from recently-published data sets. We also provide an aligned D. simulans genome to facilitate divergence comparisons. This improved resource will broaden the range of population genomic questions that can addressed from multi-population allele frequencies and haplotypes in this model species. The larger set of genomes will also enhance the discovery of functionally relevant natural variation that exists within and between populations.
Unraveling the genetic architecture of adaptive phenotypic divergence is a fundamental quest in evolutionary biology. In Drosophila melanogaster, high-altitude melanism has evolved in separate mountain ranges in sub-Saharan Africa, potentially as an adaptation to UV intensity. We investigated the genetic basis of this melanism in three populations using a new bulk segregant analysis mapping method. We identified 19 distinct QTL regions from nine mapping crosses, with several QTL peaks overlapping between two or all populations, and yet different crosses involving the same melanic population commonly yielded distinct QTL. The strongest QTL often overlapped well-known pigmentation genes, but we typically did not find wide signals of genetic differentiation (FST) between lightly and darkly pigmented populations at these genes. Instead, we found small numbers of highly differentiated SNPs at the probable causative genes. A simulation analysis showed that these patterns of polymorphism were consistent with selection on standing genetic variation. Overall, our results suggest that, even for potentially simpler traits like pigmentation, the complexity of adaptive trait evolution poses important challenges for QTL mapping and population genetic analysis.
Identifying genomic targets of population-specific positive selection is a major goal in several areas of basic and applied biology. However, it is unclear how often such selection should act on new mutations versus standing genetic variation or recurrent mutation, and furthermore, favored alleles may either become fixed or remain variable in the population. Very few population genetic statistics are sensitive to all of these modes of selection. Here we introduce and evaluate the Comparative Haplotype Identity statistic (χMD), which assesses whether pairwise haplotype sharing at a locus in one population is unusually large compared with another population, relative to genome-wide trends. Using simulations that emulate human and Drosophila genetic variation, we find that χMD is sensitive to a wide range of selection scenarios, and for some very challenging cases (e.g. partial soft sweeps), it outperforms other two population statistics. We also find that, as with FST, our haplotype approach has the ability to detect surprisingly ancient selective sweeps. Particularly for the scenarios resembling human variation, we find that χMD outperforms other frequency and haplotype-based statistics for soft and/or partial selective sweeps. Applying χMD and other between-population statistics to published population genomic data from D. melanogaster, we find both shared and unique genes and functional categories identified by each statistic. The broad utility and computational simplicity of χMD will make it an especially valuable tool in the search for genes targeted by local adaptation.
Population genetics seeks to illuminate the forces shaping genetic variation, often based on a single snapshot of genomic variation. However, utilizing multiple sampling times to study changes in allele frequencies can help clarify the relative roles of neutral and non-neutral forces on short time scales. This study compares whole-genome sequence variation of recently collected natural population samples of Drosophila melanogaster against a collection made approximately 35 years prior from the same locality—encompassing roughly 500 generations of evolution. The allele frequency changes between these time points would suggest a relatively small local effective population size on the order of 10,000, significantly smaller than the global effective population size of the species. Some loci display stronger allele frequency changes than would be expected anywhere in the genome under neutrality—most notably the tandem paralogs Cyp6a17 and Cyp6a23, which are impacted by structural variation associated with resistance to pyrethroid insecticides. We find a genome-wide excess of outliers for high genetic differentiation between old and new samples, but a larger number of adaptation targets may have affected SNP-level differentiation versus window differentiation. We also find evidence for strengthening latitudinal allele frequency clines: northern-associated alleles have increased in frequency by an average of nearly 2.5% at SNPs previously identified as clinal outliers, but no such pattern is observed at random SNPs. This project underscores the scientific potential of using multiple sampling time points to investigate how evolution operates in natural populations, by quantifying how genetic variation has changed over ecologically relevant timescales.
In species with large population sizes such as Drosophila, natural selection may have substantial effects on genetic diversity and divergence. However, the implications of this widespread nonneutrality for standard population genetic assumptions and practices remain poorly resolved. Here, we assess the consequences of recurrent hitchhiking (RHH), in which selective sweeps occur at a given rate randomly across the genome. We use forward simulations to examine two published RHH models for D. melanogaster, reflecting relatively common/weak and rare/strong selection. We find that unlike the rare/strong RHH model, the common/weak model entails a slight degree of Hill–Robertson interference in high recombination regions. We also find that the common/weak RHH model is more consistent with our genome-wide estimate of the proportion of substitutions fixed by natural selection between D. melanogaster and D. simulans (19%). Finally, we examine how these models of RHH might bias demographic inference. We find that these RHH scenarios can bias demographic parameter estimation, but such biases are weaker for parameters relating recently diverged populations, and for the common/weak RHH model in general. Thus, even for species with important genome-wide impacts of selective sweeps, neutralist demographic inference can have some utility in understanding the histories of recently diverged populations.
A long-standing enigma concerns the geographic and ecological origins of the intensively studied vinegar fly, Drosophila melanogaster, a globally widespread species [1] which "has invariably appeared to be a strict human commensal" [2]. In spite of its sub-Saharan origins, this species has never been reported from undisturbed wilderness environments that might reflect its pre-commensal niche [3]. Here, we document the collection of 288 D. melanogaster individuals from African wilderness areas in Zambia, Zimbabwe, and Namibia. After sequencing the genomes of 17 flies collected from Kafue National Park, Zambia, we found reduced genetic diversity relative to town populations, elevated chromosomal inversion frequencies, and strong differences at specific genes including known insecticide targets. Combining these new genomes with prior data enabled us to gain novel insights into the history of this species' geographic expansion. Our demographic estimates indicated that an expansion from southern Africa began approximately 10,000 years ago, with a Saharan crossing soon after, but expansion from the Middle East into Europe did not begin until roughly 1,400 years ago. This improved model of demographic history will provide a critical resource for future evolutionary and genomic studies of this key model organism. Our results add historical context to the species' human association, and the opportunity to study wilderness populations opens the door for future studies on the biological basis of its adaptation to human environments. RESULTS AND DISCUSSIONDrosophila melanogaster persists in African wilderness D. melanogaster is among the most intensivelystudied species in the world, and yet it remains "among the commonest species whose exact place of origin and even ancestry have never been satisfactorily explained" [2]. Its relatives are distributed across sub-Saharan Africa and nearby islands, and a biogeographic analysis proposed an ancestral range in western and central Africa [3]. Despite considerable efforts to collect D. melanogaster from this equatorial region, it was never discovered in undisturbed wilderness, instead occurring only in human-settled areas and "seminatural habitats" [3]. However, a recent population genomic analysis suggested that D. melanogaster originated in southern Africa: populations from Zambia and Zimbabwe have the species' highest levels of genetic variation, whereas other populations may have lost diversity due to founder event bottlenecks during geographic expansion [4]. These findings raise the possibility that D. melanogaster originated (and might still persist) in wild environments of southern-central Africa, which are primarily characterized by seasonally dry Miombo and Mopane woodlands [5]. Although D. melanogaster has occasionally been sampled from human settlements near natural areas in Zimbabwe [6,7], its hypothesized persistence in wild Miombo/ Mopane forests [4] remains unconfirmed. Sprengelmeyer 1 ARTICLES PREPRINTHere, we report the collection of D. melanogaster in five distinct...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.