The transitions from foraging to farming and later to pastoralism in Stone Age Eurasia (c. 11-3 thousand years before present, BP) represent some of the most dramatic lifestyle changes in human evolution. We sequenced 317 genomes of primarily Mesolithic and Neolithic individuals from across Eurasia combined with radiocarbon dates, stable isotope data, and pollen records. Genome imputation and co-analysis with previously published shotgun sequencing data resulted in >1600 complete ancient genome sequences offering fine-grained resolution into the Stone Age populations. We observe that: 1) Hunter-gatherer groups were more genetically diverse than previously known, and deeply divergent between western and eastern Eurasia. 2) We identify hitherto genetically undescribed hunter-gatherers from the Middle Don region that contributed ancestry to the later Yamnaya steppe pastoralists; 3) The genetic impact of the Neolithic transition was highly distinct, east and west of a boundary zone extending from the Black Sea to the Baltic. Large-scale shifts in genetic ancestry occurred to the west of this "Great Divide", including an almost complete replacement of hunter-gatherers in Denmark, while no substantial ancestry shifts took place during the same period to the east. This difference is also reflected in genetic relatedness within the populations, decreasing substantially in the west but not in the east where it remained high until c. 4,000 BP; 4) The second major genetic transformation around 5,000 BP happened at a much faster pace with Steppe-related ancestry reaching most parts of Europe within 1,000-years. Local Neolithic farmers admixed with incoming pastoralists in eastern, western, and southern Europe whereas Scandinavia experienced another near-complete population replacement. Similar dramatic turnover-patterns are evident in western Siberia; 5) Extensive regional differences in the ancestry components involved in these early events remain visible to this day, even within countries. Neolithic farmer ancestry is highest in southern and eastern England while Steppe-related ancestry is highest in the Celtic populations of Scotland, Wales, and Cornwall (this research has been conducted using the UK Biobank resource); 6) Shifts in diet, lifestyle and environment introduced new selection pressures involving at least 21 genomic regions. Most such variants were not universally selected across populations but were only advantageous in particular ancestral backgrounds. Contrary to previous claims, we find that selection on the FADS regions, associated with fatty acid metabolism, began before the Neolithisation of Europe. Similarly, the lactase persistence allele started increasing in frequency before the expansion of Steppe-related groups into Europe and has continued to increase up to the present. Along the genetic cline separating Mesolithic hunter-gatherers from Neolithic farmers, we find significant correlations with trait associations related to skin disorders, diet and lifestyle and mental health status, suggesting marked phenotypic differences between these groups with very different lifestyles. This work provides new insights into major transformations in recent human evolution, elucidating the complex interplay between selection and admixture that shaped patterns of genetic variation in modern populations.
The ancestral recombination graph (ARG) is a structure that describes the joint genealogies of sampled DNA sequences along the genome. Recent computational methods have made impressive progress towards scalably estimating whole-genome genealogies. In addition to inferring the ARG, some of these methods can also provide ARGs sampled from a defined posterior distribution. Obtaining good samples of ARGs is crucial for quantifying statistical uncertainty and for estimating population genetic parameters such as effective population size, mutation rate, and allele age. Here, we use standard neutral coalescent simulations to benchmark the estimates of pairwise coalescence times from three popular ARG inference programs: ARGweaver, Relate, and tsinfer+tsdate. We compare 1) the true coalescence times to the inferred times at each locus; 2) the distribution of coalescence times across all loci to the expected exponential distribution; 3) whether the sampled coalescence times have the properties expected of a valid posterior distribution. We find that inferred coalescence times at each locus are most accurate in ARGweaver, and often more accurate in Relate than in tsinfer+tsdate. However, all three methods tend to overestimate small coalescence times and underestimate large ones. Lastly, the posterior distribution of ARGweaver is closer to the expected posterior distribution than Relate’s, but this higher accuracy comes at a substantial trade-off in scalability. The best choice of method will depend on the number and length of input sequences and on the goal of downstream analyses, and we provide guidelines for the best practices.
Estimating fitness differences between allelic variants is a central goal of experimental evolution. Current methods for inferring such differences from allele frequency time series typically assume that the effects of selection can be described by a fixed selection coefficient. However, fitness is an aggregate of several components including mating success, fecundity, and viability. Distinguishing between these components could be critical in many scenarios. Here, we develop a flexible maximum likelihood framework that can disentangle different components of fitness from genotype frequency data, and estimate them individually in males and females. As a proof-of-principle, we apply our method to experimentally evolved cage populations of Drosophila melanogaster, in which we tracked the relative frequencies of a loss-of-function and wild-type allele of yellow. This X-linked gene produces a recessive yellow phenotype when disrupted and is involved in male courtship ability. We find that the fitness costs of the yellow phenotype take the form of substantially reduced mating preference of wild-type females for yellow males, together with a modest reduction in the viability of yellow males and females. Our framework should be generally applicable to situations where it is important to quantify fitness components of specific genetic variants, including quantitative characterization of the population dynamics of CRISPR gene drives.
Summary The Eurasian Holocene (beginning c. 12 thousand years ago) encompassed some of the most significant changes in human evolution, with far-reaching consequences for the dietary, physical and mental health of present-day populations. Using an imputed dataset of >1600 complete ancient genome sequences, and new computational methods for locating selection in time and space, we reconstructed the selection landscape of the transition from hunting and gathering, to farming and pastoralism across West Eurasia. We identify major selection signals related to metabolism, possibly associated with the dietary shift occurring in this period. We show that the selection on loci such as the FADS cluster, associated with fatty acid metabolism, and the lactase persistence locus, began earlier than previously thought. A substantial amount of selection is also found in the HLA region and other loci associated with immunity, possibly due to the increased exposure to pathogens during the Neolithic, which may explain the current high prevalence of auto-immune disease, such as psoriasis, due to genetic trade-offs. By using ancient populations to infer local ancestry tracks in hundreds of thousands of samples from the UK Biobank, we find strong genetic differentiation among ancient Europeans in loci associated with anthropometric traits and susceptibility to several diseases that contribute to present-day disease burden. These were previously thought to be caused by local selection, but in fact can be attributed to differential genetic contributions from various source populations that are ancestral to present-day Europeans. Thus, alleles associated with increased height seem to have increased in frequency following the Yamnaya migration into northwestern Europe around 5,000 years ago. Alleles associated with increased risk of some mood-related phenotypes are overrepresented in the farmer ancestry component entering Europe from Anatolia around 11,000 years ago, while western hunter-gatherers show a strikingly high contribution of alleles conferring risk of traits related to diabetes. Our results paint a picture of the combined contributions of migration and selection in shaping the phenotypic landscape of present-day Europeans that suggests a combination of ancient selection and migration, rather than recent local selection, is the primary driver of present-day phenotypic differences in Europe.
Epidemiological models for multi-host pathogen systems often classify individuals taxonomically and use species-specific parameter values, but in species-rich communities, that approach may require intractably many parameters. Trait-based epidemiological models offer a potential solution, but have not accounted for within-species trait variation or between-species trait overlap. Here, we propose and study trait-based models with host and vector communities represented as trait distributions without regard to species identity. To illustrate this approach, we develop SIS models for disease spread in plant-pollinator networks with continuous trait distributions. We model trait-dependent contact rates in two common scenarios: nested networks, and specialized plant-pollinator interactions based on trait matching. We find that disease spread in plant-pollinator networks is impacted the most by selective pollinators, universally attractive flowers, and co-specialized plant-pollinator pairs. When extreme pollinator traits are rare, pollinators with common traits are most important for disease spread, whereas when extreme flower traits are rare, flowers with uncommon traits impact disease spread the most. Greater nestedness and specialization both typically promote disease persistence. Given recent pollinator declines caused in part by pathogens, we discuss how trait-based models could inform conservation strategies for wild and managed pollinators. Furthermore, while we have applied our model to pollinators and pathogens, its framework is general and can be transferred to any kind of species interactions, in any community.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.