Highlights d Bronze Age (BA) Helladic, Cycladic, and Minoan genomes from the Aegean were sequenced d 3,000 BCE Aegeans are homogeneous and derive ancestry mainly from Neolithic farmers d Neolithic Caucasus-like and BA Pontic-Caspian Steppe-like gene flow shaped the Aegean d Present-day Greeks are genetically similar to 2,000 BCE Aegeans from Northern Greece
Although Brazil was inhabited by more than 3,000 Indigenous populations prior to European colonization, today's Indigenous peoples represent less than 1% of Brazil's census population. Some of the decimated communities belonged to the so-called "Botocudos" from central-eastern Brazil. These peoples are thought to represent a case of long-standing genetic continuity bearing a strong craniometric resemblance to that of the oldest Indigenous Americans ("Paleoamericans"). Yet, little is known about their origins and genetic relationship to other Native Americans, as only two "Botocudo" genomes have been sequenced so far and those were surprisingly of Polynesian ancestry. To deepen our knowledge on the genomic history of pre-contact Indigenous Americans and the pathogens they were exposed to, we carbon-dated and sequenced 24 ancient Brazilians (including 22 "Botocudos") whose remains were hosted at the National Museum of Rio de Janeiro and recovered prior to the tragic 2018 fire. The resulting genomes' depth of coverage ranged from 0.001x to 24x. Their genetic ancestry was found to be Indigenous American without gene flow from external populations such as Europeans, Africans or Polynesians. Unlike Mesoamericans, the "Botocudos" and Amazonians do not seem to have experienced a population expansion once in the Americas. Moreover, remarkably, their genomes exhibit amongst the lowest levels of heterozygosity worldwide and long runs of homozygosity, which could be explained by unique social practices or a very small effective size. Finally, whole genomes of likely ancient pathogens were recovered, including lineages of human parvovirus B19 that were possibly introduced after the European contact.
Owing to technological advances in ancient DNA, it is now possible to sequence viruses from the past to track down their origin and evolution. However, ancient DNA data is considerably more degraded and contaminated than modern data making the identification of ancient viral genomes particularly challenging. Several methods to characterise the modern microbiome (and, within this, the virome) have been developed; in particular, tools that assign sequenced reads to specific taxa in order to characterise the organisms present in a sample of interest. While these existing tools are routinely used in modern data, their performance when applied to ancient microbiome data to screen for ancient viruses remains unknown. In this work, we conducted an extensive simulation study using public viral sequences to establish which tool is the most suitable to screen ancient samples for human DNA viruses. We compared the performance of four widely used classifiers, namely Centrifuge, Kraken2, DIAMOND and MetaPhlAn2, in correctly assigning sequencing reads to the corresponding viruses. To do so, we simulated reads by adding noise typical of ancient DNA to a set of publicly available human DNA viral sequences and to the human genome. We fragmented the DNA into different lengths, added sequencing error and C to T and G to A deamination substitutions at the read termini. Then we measured the resulting sensitivity and precision for all classifiers. Across most simulations, more than 228 out of the 233 simulated viruses were recovered by Centrifuge, Kraken2 and DIAMOND, in contrast to MetaPhlAn2 which recovered only around one third. Overall, Centrifuge and Kraken2 had the best performance with the highest values of sensitivity and precision. We found that deamination damage had little impact on the performance of the classifiers, less than the sequencing error and the length of the reads. Since Centrifuge can handle short reads (in contrast to DIAMOND and Kraken2 with default settings) and since it achieve the highest sensitivity and precision at the species level across all the simulations performed, it is our recommended tool. Regardless of the tool used, our simulations indicate that, for ancient human studies, users should use strict filters to remove all reads of potential human origin. Finally, we recommend that users verify which species are present in the database used, as it might happen that default databases lack sequences for viruses of interest.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.