Abstract:Although a broad range of methods exists for reconstructing population history from genome-wide single nucleotide polymorphism data, just a few methods gained popularity in archaeogenetics: principal component analysis (PCA); ADMIXTURE, an algorithm that models individuals as mixtures of multiple ancestral sources represented by actual or inferred populations; formal tests for admixture such as f3-statistics and D-statistics; and qpAdm, a tool for fitting two-component and more complex admixture models to grou… Show more
“…e . after the time of the ancient individual(s) 49 . The most principled approach is thus to build ancestry models where source and ‘outgroup/reference’ populations are older than, or at least contemporary with, the target genome or group which we are trying to model 49 .…”
Section: Resultsmentioning
confidence: 99%
“…Modern populations provide ample power to detect differences, but their genetic affinity to ancient individuals may be confounded by later gene flow, i.e. after the time of the ancient individual(s) 49 . The most principled approach is thus to build ancestry models where source and 'outgroup/reference' populations are older than, or at least contemporary with, the target genome or group which we are trying to model 49 .…”
Section: Northern Europementioning
confidence: 99%
“…after the time of the ancient individual(s) 49 . The most principled approach is thus to build ancestry models where source and 'outgroup/reference' populations are older than, or at least contemporary with, the target genome or group which we are trying to model 49 . The improved statistical power of time-restricted ancestry in Twigstats thus offers an opportunity to develop principled ancestry models based solely on temporally appropriate source proxy groups.…”
Ancient DNA has unlocked new genetic histories and shed light on archaeological and historical questions, but many known and unknown historical events have remained below detection thresholds because subtle ancestry changes are challenging to reconstruct. Methods based on sharing of haplotypes and rare variants can improve power, but are not explicitly temporal and have not been adopted in unbiased ancestry models. Here, we develop Twigstats, a new approach of time-stratified ancestry analysis that can improve statistical power by an order of magnitude by focusing on coalescences in recent times, while remaining unbiased by population-specific drift. We apply this framework to 1,151 available ancient genomes, focussing on northern and central Europe in the historical period, and show that it allows modelling of individual-level ancestry using preceding genomes and provides previously unavailable resolution to detect broader ancestry transformations. In the first half of the first millennium ~1-500 CE (Common Era), we observe an expansion of Scandinavian-related ancestry across western, central, and southern Europe. However, in the second half of the millennium ~500-1000 CE, ancestry patterns suggest the regional disappearance or substantial admixture of these ancestries in multiple regions. Within Scandinavia itself, we document a major ancestry influx by ~800 CE, when a large proportion of Viking Age individuals carried ancestry from groups related to continental Europe. This primarily affected southern Scandinavia, and was differentially represented in the western and eastern directions of the wider Viking world. We infer detailed ancestry portraits integrated with historical, archaeological, and stable isotope evidence, documenting mobility at an individual level. Overall, our results are consistent with substantial mobility in Europe in the early historical period, and suggest that time-stratified ancestry analysis can provide a new lens for genetic history.
“…e . after the time of the ancient individual(s) 49 . The most principled approach is thus to build ancestry models where source and ‘outgroup/reference’ populations are older than, or at least contemporary with, the target genome or group which we are trying to model 49 .…”
Section: Resultsmentioning
confidence: 99%
“…Modern populations provide ample power to detect differences, but their genetic affinity to ancient individuals may be confounded by later gene flow, i.e. after the time of the ancient individual(s) 49 . The most principled approach is thus to build ancestry models where source and 'outgroup/reference' populations are older than, or at least contemporary with, the target genome or group which we are trying to model 49 .…”
Section: Northern Europementioning
confidence: 99%
“…after the time of the ancient individual(s) 49 . The most principled approach is thus to build ancestry models where source and 'outgroup/reference' populations are older than, or at least contemporary with, the target genome or group which we are trying to model 49 . The improved statistical power of time-restricted ancestry in Twigstats thus offers an opportunity to develop principled ancestry models based solely on temporally appropriate source proxy groups.…”
Ancient DNA has unlocked new genetic histories and shed light on archaeological and historical questions, but many known and unknown historical events have remained below detection thresholds because subtle ancestry changes are challenging to reconstruct. Methods based on sharing of haplotypes and rare variants can improve power, but are not explicitly temporal and have not been adopted in unbiased ancestry models. Here, we develop Twigstats, a new approach of time-stratified ancestry analysis that can improve statistical power by an order of magnitude by focusing on coalescences in recent times, while remaining unbiased by population-specific drift. We apply this framework to 1,151 available ancient genomes, focussing on northern and central Europe in the historical period, and show that it allows modelling of individual-level ancestry using preceding genomes and provides previously unavailable resolution to detect broader ancestry transformations. In the first half of the first millennium ~1-500 CE (Common Era), we observe an expansion of Scandinavian-related ancestry across western, central, and southern Europe. However, in the second half of the millennium ~500-1000 CE, ancestry patterns suggest the regional disappearance or substantial admixture of these ancestries in multiple regions. Within Scandinavia itself, we document a major ancestry influx by ~800 CE, when a large proportion of Viking Age individuals carried ancestry from groups related to continental Europe. This primarily affected southern Scandinavia, and was differentially represented in the western and eastern directions of the wider Viking world. We infer detailed ancestry portraits integrated with historical, archaeological, and stable isotope evidence, documenting mobility at an individual level. Overall, our results are consistent with substantial mobility in Europe in the early historical period, and suggest that time-stratified ancestry analysis can provide a new lens for genetic history.
“…Following results from our ADMIXTURE and PCA analyses, we included multiple Asian and Melanesian populations (limited to a subset of Papuan language speakers; see SI Methods) as potential ancestry sources, and added the Toalean forager and Onge population as proxies for founder ancestry. To improve qpAdm sensitivity and power, we rotated populations across 'left' and 'right' groupings to produce a large set of ancestry models for each Wallacean population that were evaluated for plausibility (see SI Methods; (39)).…”
Section: Evidence For Retention Of Amh Founder Ancestry In Wallaceamentioning
The tropical archipelago of Wallacea was first settled by anatomically modern humans (AMH) by 50 thousand years ago (kya), with descendent populations thought to have remained genetically isolated prior to the arrival of Austronesian seafarers around 3.5 kya. Modern Wallaceans exhibit a longitudinal countergradient of Papuan- and Asian-related ancestries widely considered as evidence for mixing between local populations and Austronesian seafarers, though converging multidisciplinary evidence suggests that the Papuan-related component instead comes primarily from back-migrations from New Guinea. Here, we reconstruct Wallacean population genetic history using more than 250 newly reported genomes from 12 Wallacean and three West Papuan populations and confirm that the vast majority of Papuan-related ancestry in Wallacea (∼75–100%) comes from prehistoric migrations originating in New Guinea and only a minor fraction is attributable to the founding AMH settlers. Mixing between Papuan and local Wallacean lineages appear to have been confined to the western and central parts of the archipelago and likely occurred contemporaneously with the widespread introduction of genes from Austronesian seafarers—which now comprise between ∼40–85% of modern Wallacean ancestry—though dating historical admixture events remains challenging due to mixing continuing into the Historical Period. In conjunction with archaeological and linguistic records, our findings point to a dynamic Wallacean population history that was profoundly reshaped by the spread of Papuan genes, languages, and culture in the past 3,500 years.
“…However, sifting through all possible proxy sources and the right populations for an admixture event can be overwhelming. In addition, a recent survey has shown that, depending on the approach and the quality of the genetic data analyzed, qpAdm may suffer from high false discovery rates, adding substantial uncertainty to the interpretation of the results of admixture inference (Eren Yüncü et al, 2023).…”
The history of human populations has been strongly shaped by admixture events, contributing to the patterns of observed genetic diversity across populations. Given its significance for evolutionary and medical studies, many algorithms focusing on the inference of the genetic composition of admixed populations have been developed. In particular, the recent development of new ancestry estimation methods that consider the fragmentary nature of ancient genotype data, such as the f-statistics family and its derivations, have radically changed our understanding of the past. F-statistics capture similar genetic similarity information as Principal Component Analysis (PCA), which is widely used in population genetics to quantify genetic affinity between populations or individuals. In this study, we introduce ASAP (ASsessing ancestry proportions through Principal component Analysis) method that leverages PCA and Non-Negative Least Square (NNLS) to assess the ancestral compositions of admixed individuals given a large set of populations. We tested ASAP on different simulated models, incorporating high levels of missingness. Our results show its ability to reliably estimate ancestry across numerous scenarios, even those with a significant proportion of missing genotypes, in a fraction of the time required when using other tools. When harnessed on Eurasia's genotype data, ASAP helped replicate and extend findings from previous studies proving to be a fast, efficient, and straightforward new ancestry estimation tool.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.