The histories of crop domestication and breeding are recorded in genomes. Although tomato is a model species for plant biology and breeding, the nature of human selection that altered its genome remains largely unknown. Here we report a comprehensive analysis of tomato evolution based on the genome sequences of 360 accessions. We provide evidence that domestication and improvement focused on two independent sets of quantitative trait loci (QTLs), resulting in modern tomato fruit ∼100 times larger than its ancestor. Furthermore, we discovered a major genomic signature for modern processing tomatoes, identified the causative variants that confer pink fruit color and precisely visualized the linkage drag associated with wild introgressions. This study outlines the accomplishments as well as the costs of historical selection and provides molecular insights toward further improvement.
Most fruits in our daily diet are the products of domestication and breeding. Here we report a map of genome variation for a major fruit that encompasses ~3.6 million variants, generated by deep resequencing of 115 cucumber lines sampled from 3,342 accessions worldwide. Comparative analysis suggests that fruit crops underwent narrower bottlenecks during domestication than grain crops. We identified 112 putative domestication sweeps; 1 of these regions contains a gene involved in the loss of bitterness in fruits, an essential domestication trait of cucumber. We also investigated the genomic basis of divergence among the cultivated populations and discovered a natural genetic variant in a β-carotene hydroxylase gene that could be used to breed cucumbers with enhanced nutritional value. The genomic history of cucumber evolution uncovered here provides the basis for future genomics-enabled breeding.
Using coalescent simulations, we study the impact of three different sampling schemes on patterns of neutral diversity in structured populations. Specifically, we are interested in two summary statistics based on the site frequency spectrum as a function of migration rate, demographic history of the entire substructured population (including timing and magnitude of specieswide expansions), and the sampling scheme. Using simulations implementing both finite-island and two-dimensional stepping-stone spatial structure, we demonstrate strong effects of the sampling scheme on Tajima's D (D T ) and Fu and Li's D (D FL ) statistics, particularly under specieswide (range) expansions. Pooled samples yield average D T and D FL values that are generally intermediate between those of local and scattered samples. Local samples (and to a lesser extent, pooled samples) are influenced by local, rapid coalescence events in the underlying coalescent process. These processes result in lower proportions of external branch lengths and hence lower proportions of singletons, explaining our finding that the sampling scheme affects D FL more than it does D T . Under specieswide expansion scenarios, these effects of spatial sampling may persist up to very high levels of gene flow (Nm . 25), implying that local samples cannot be regarded as being drawn from a panmictic population. Importantly, many data sets on humans, Drosophila, and plants contain signatures of specieswide expansions and effects of sampling scheme that are predicted by our simulation results. This suggests that validating the assumption of panmixia is crucial if robust demographic inferences are to be made from local or pooled samples. However, future studies should consider adopting a framework that explicitly accounts for the genealogical effects of population subdivision and empirical sampling schemes.A LTHOUGH most (if not all) species are characterized by various degrees of population subdivision, population geneticists continue to employ models of panmictic populations of constant size (the neutral equilibrium, NE, model) when testing for selection and/ or demographic changes through time. Motivated by empirical data sets that were found to be incompatible with expectations under the NE model, recent largescale studies have employed coalescent simulations of demographic changes such as bottlenecks and/or population expansions in attempts to estimate parameters of such (arguably) biologically more realistic scenarios (e.g., Marth et al. 2004;Nordborg et al. 2005;Ometto et al. 2005;Schmid et al. 2005;Heuertz et al. 2006;Li and Stephan 2006;Pyhäjärvi et al. 2007;Ross-Ibarra et al. 2008). However, such simulations still assume unstructured populations, while the empirical data may derive from subdivided species and a variety of sampling schemes. The data typically collected in such projects consist of multilocus DNA sequences from which genealogical information is extracted, e.g., on the number of segregating sites (S ), the average pairwise sequence diversity...
Ever since Darwin's pioneering research, the evolution of self-fertilisation (selfing) has been regarded as one of the most prevalent evolutionary transitions in flowering plants. A major mechanism to prevent selfing is the self-incompatibility (SI) recognition system, which consists of male and female specificity genes at the S-locus and SI modifier genes. Under conditions that favour selfing, mutations disabling the male recognition component are predicted to enjoy a relative advantage over those disabling the female component, because male mutations would increase through both pollen and seeds whereas female mutations would increase only through seeds. Despite many studies on the genetic basis of loss of SI in the predominantly selfing plant Arabidopsis thaliana, it remains unknown whether selfing arose through mutations in the female specificity gene (S-receptor kinase, SRK), male specificity gene (S-locus cysteine-rich protein, SCR; also known as S-locus protein 11, SP11) or modifier genes, and whether any of them rose to high frequency across large geographic regions. Here we report that a disruptive 213-base-pair (bp) inversion in the SCR gene (or its derivative haplotypes with deletions encompassing the entire SCR-A and a large portion of SRK-A) is found in 95% of European accessions, which contrasts with the genome-wide pattern of polymorphism in European A. thaliana. Importantly, interspecific crossings using Arabidopsis halleri as a pollen donor reveal that some A. thaliana accessions, including Wei-1, retain the female SI reaction, suggesting that all female components including SRK are still functional. Moreover, when the 213-bp inversion in SCR was inverted and expressed in transgenic Wei-1 plants, the functional SCR restored the SI reaction. The inversion within SCR is the first mutation disrupting SI shown to be nearly fixed in geographically wide samples, and its prevalence is consistent with theoretical predictions regarding the evolutionary advantage of mutations in male components.
We employed a multilocus approach to examine the effects of population subdivision and natural selection on DNA polymorphism in 2 closely related wild tomato species (Solanum peruvianum and Solanum chilense), using sequence data for 8 nuclear loci from populations across much of the species' range. Both species exhibit substantial levels of nucleotide variation. The species-wide level of silent nucleotide diversity is 18% higher in S. peruvianum (pi(sil) approximately 2.50%) than in S. chilense (pi(sil) approximately 2.12%). One of the loci deviates from neutral expectations, showing a clinal pattern of nucleotide diversity and haplotype structure in S. chilense. This geographic pattern of variation is suggestive of an incomplete (ongoing) selective sweep, but neutral explanations cannot be entirely dismissed. Both wild tomato species exhibit moderate levels of population differentiation (average F(ST) approximately 0.20). Interestingly, the pooled samples (across different demes) exhibit more negative Tajima's D and Fu and Li's D values; this marked excess of low-frequency polymorphism can only be explained by population (or range) expansion and is unlikely to be due to population structure per se. We thus propose that population structure and population/range expansion are among the most important evolutionary forces shaping patterns of nucleotide diversity within and among demes in these wild tomatoes. Patterns of population differentiation may also be impacted by soil seed banks and historical associations mediated by climatic cycles. Intragenic linkage disequilibrium (LD) decays very rapidly with physical distance, suggesting high recombination rates and effective population sizes in both species. The rapid decline of LD seems very promising for future association studies with the purpose of mapping functional variation in wild tomatoes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.