A major challenge of biology is understanding the relationship between molecular genetic variation and variation in quantitative traits, including fitness. This relationship determines our ability to predict phenotypes from genotypes and to understand how evolutionary forces shape variation within and between species. Previous efforts to dissect the genotype-phenotype map were based on incomplete genotypic information. Here, we describe the Drosophila melanogaster Genetic Reference Panel (DGRP), a community resource for analysis of population genomics and quantitative traits. The DGRP consists of fully sequenced inbred lines derived from a natural population. Population genomic analyses reveal reduced polymorphism in centromeric autosomal regions and the X chromosome, evidence for positive and negative selection, and rapid evolution of the X chromosome. Many variants in novel genes, most at low frequency, are associated with quantitative traits and explain a large fraction of the phenotypic variance. The DGRP facilitates genotype-phenotype mapping using the power of Drosophila genetics.
Genome data have revealed great variation in the numbers of genes in different organisms, which indicates that there is a fundamental process of genome evolution: the origin of new genes. However, there has been little opportunity to explore how genes with new functions originate and evolve. The study of ancient genes has highlighted the antiquity and general importance of some mechanisms of gene origination, and recent observations of young genes at early stages in their evolution have unveiled unexpected molecular and evolutionary processes.
Experimental evolution systems allow the genomic study of adaptation, and so far this has been done primarily in asexual systems with small genomes, such as bacteria and yeast. Here we present whole-genome resequencing data from Drosophila melanogaster populations that have experienced over 600 generations of laboratory selection for accelerated development. Flies in these selected populations develop from egg to adult ∼20% faster than flies of ancestral control populations, and have evolved a number of other correlated phenotypes. On the basis of 688,520 intermediate-frequency, high-quality single nucleotide polymorphisms, we identify several dozen genomic regions that show strong allele frequency differentiation between a pooled sample of five replicate populations selected for accelerated development and pooled controls. On the basis of resequencing data from a single replicate population with accelerated development, as well as single nucleotide polymorphism data from individual flies from each replicate population, we infer little allele frequency differentiation between replicate populations within a selection treatment. Signatures of selection are qualitatively different than what has been observed in asexual species; in our sexual populations, adaptation is not associated with 'classic' sweeps whereby newly arising, unconditionally advantageous mutations become fixed. More parsimonious explanations include 'incomplete' sweep models, in which mutations have not had enough time to fix, and 'soft' sweep models, in which selection acts on pre-existing, common genetic variants. We conclude that, at least for life history characters such as development time, unconditionally advantageous alleles rarely arise, are associated with small net fitness gains or cannot fix because selection coefficients change over time.
Genome-wide scanning for signals of recent positive selection is essential for a comprehensive and systematic understanding of human adaptation. Here, we present a genomic survey of recent local selective sweeps, especially aimed at those nearly or recently completed. A novel approach was developed for such signals, based on contrasting the extended haplotype homozygosity (EHH) profiles between populations. We applied this method to the genome single nucleotide polymorphism (SNP) data of both the International HapMap Project and Perlegen Sciences, and detected widespread signals of recent local selection across the genome, consisting of both complete and partial sweeps. A challenging problem of genomic scans of recent positive selection is to clearly distinguish selection from neutral effects, given the high sensitivity of the test statistics to departures from neutral demographic assumptions and the lack of a single, accurate neutral model of human history. We therefore developed a new procedure that is robust across a wide range of demographic and ascertainment models, one that indicates that certain portions of the genome clearly depart from neutrality. Simulations of positive selection showed that our tests have high power towards strong selection sweeps that have undergone fixation. Gene ontology analysis of the candidate regions revealed several new functional groups that might help explain some important interpopulation differences in phenotypic traits.
We have sequenced the genome of a second Drosophila species, Drosophila pseudoobscura, and compared this to the genome sequence of Drosophila melanogaster, a primary model organism. Throughout evolution the vast majority of Drosophila genes have remained on the same chromosome arm, but within each arm gene order has been extensively reshuffled, leading to a minimum of 921 syntenic blocks shared between the species. A repetitive sequence is found in the D. pseudoobscura genome at many junctions between adjacent syntenic blocks. Analysis of this novel repetitive element family suggests that recombination between offset elements may have given rise to many paracentric inversions, thereby contributing to the shuffling of gene order in the D. pseudoobscura lineage. Based on sequence similarity and synteny, 10,516 putative orthologs have been identified as a core gene set conserved over 25-55 million years (Myr) since the pseudoobscura/melanogaster divergence. Genes expressed in the testes had higher amino acid sequence divergence than the genome-wide average, consistent with the rapid evolution of sex-specific proteins. Cis-regulatory sequences are more conserved than random and nearby sequences between the species-but the difference is slight, suggesting that the evolution of cis-regulatory elements is flexible. Overall, a pattern of repeat-mediated chromosomal rearrangement, and high coadaptation of both male genes and cis-regulatory sequences emerges as important themes of genome divergence between these species of Drosophila.
Uncertainty about the demographic history of populations can hamper genome-wide scans for selection based on population genetic models. To obtain a portrait of the effects of demographic history on genome variability patterns in Drosophila melanogaster populations, we surveyed noncoding DNA polymorphism at 10 X-linked loci in large samples from three African and two non-African populations. All five populations show significant departures from expectations under the standard neutral model. We detect weak but significant differentiation between East (Kenya and Zimbabwe) and West/Central sub-Saharan (Gabon) African populations. A skew toward high-frequency-derived polymorphisms, elevated levels of linkage disequilibrium (LD) and significant heterogeneity in levels of polymorphism and divergence in the Gabon sample suggest that this population is further from mutation-drift equilibrium than the two Eastern African populations. Both non-African populations harbor significantly higher levels of LD, a large excess of high-frequency-derived mutations and extreme heterogeneity among loci in levels of polymorphism and divergence. Rejections of the neutral model in D. melanogaster populations using these and similar features have been interpreted as evidence for an important role for natural selection in shaping genome variability patterns. Based on simulations, we conclude that simple bottleneck models are sufficient to account for most, if not all, polymorphism features of both African and non-African populations. In contrast, we show that a steady-state recurrent hitchhiking model fails to account for several aspects of the data. Demographic departures from equilibrium expectations in both ancestral and derived populations thus represent a serious challenge to detecting positive selection in genome-wide scans using current methodologies.
Genome-wide nucleotide variation in non-African populations of Drosophila melanogaster is a subset of variation found in East sub-Saharan African populations, suggesting a bottleneck in the history of the former. We implement an approximate Bayesian approach to infer the timing, duration, and severity of this putative bottleneck and ask whether this inferred model is sufficient to account for patterns of variability observed at 115 loci scattered across the X chromosome. We estimate a recent bottleneck 0.006N e generations ago, somewhat further in the past than suggested by biogeographical evidence. Using various proposed statistical tests, we find that this bottleneck model is able to predict the majority of observed features of diversity and linkage disequilibrium in the data. Thus, while precise estimates of bottleneck parameters (like inferences of selection) are sensitive to model assumptions, our results imply that it may be unnecessary to invoke frequent selective sweeps associated with the dispersal of D. melanogaster from Africa to explain patterns of variability in non-African populations.T HERE is considerable interest in using population genetic approaches to identify regions of the genome that underlie population-or species-specific adaptations. More generally, such scans of the genome can also answer basic evolutionary questions such as the relative importance of selection and nonadaptive factors (such as changes in the size and structure of populations over time) in shaping patterns of genome variability. The rapid increase in our ability to survey population-level nucleotide variability for larger and larger portions of the genome yields increasing statistical power to distinguish among alternative population genetic models.The most common approach to hypothesis testing has been model based, comparing statistics summarizing features of the observed data, such as the distribution of polymorphism frequencies, to the expectations of a null model (e.g., Tajima 1989;Fu and Li 1993;Fay and Wu 2000). The most commonly used null model assumes a large, panmictic, population with neutral mutations occurring according to the infinite-sites model, hereafter referred to as the standard neutral model (SNM) (see Hudson 1983). Significant departures from the SNM are usually interpreted as evidence for the recent action of natural selection (e.g., Harr et al. 2002;Glinka et al. 2003). However, natural selection is not a unique alternative hypothesis to the SNM. Deviations from the demographic assumptions of the SNM, such as population bottlenecks, can result in the observation of significant departures in the same direction as that expected under selective sweep models (e.g. In Drosophila melanogaster, the observation that variation is substantially reduced outside of Africa (Begun and Aquadro 1993) has focused much attention on understanding the cause of this difference between populations. The species is believed to have an African origin and to have colonized other continents only recently (David and ...
The three species of the Drosophila simulans clade-the cosmopolitan species, D. simulans, and the two island endemic species, D. mauritiana and D. sechellia-are important models in speciation genetics, but some details of their phylogenetic and speciation history remain unresolved. The order and timing of speciation are disputed, and the existence, magnitude, and timing of gene flow among the three species remain unclear. Here we report on the analysis of a whole-genome four-species sequence alignment that includes all three D. simulans clade species as well as the D. melanogaster reference sequence. The alignment comprises novel, paired short-read sequence data from a single highly inbred line each from D. simulans, D. mauritiana, and D. sechellia. We are unable to reject a species phylogeny with a basal polytomy; the estimated age of the polytomy is 242,000 yr before the present. However, we also find that up to 4.6% of autosomal and 2.2% of X-linked regions have evolutionary histories consistent with recent gene flow between the mainland species (D. simulans) and the two island endemic species (D. mauritiana and D. sechellia). Our findings thus show that gene flow has occurred throughout the genomes of the D. simulans clade species despite considerable geographic, ecological, and intrinsic reproductive isolation. Last, our analysis of lineage-specific changes confirms that the D. sechellia genome has experienced a significant excess of slightly deleterious changes and a dearth of presumed favorable changes. The relatively reduced efficacy of natural selection in D. sechellia is consistent with its derived, persistently reduced historical effective population size.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.