Evolution is typically thought to proceed through divergence of genes, proteins, and ultimately phenotypes1-3. However, similar traits might also evolve convergently in unrelated taxa due to similar selection pressures4,5. Adaptive phenotypic convergence is widespread in nature, and recent results from a handful of genes have suggested that this phenomenon is powerful enough to also drive recurrent evolution at the sequence level6-9. Where homoplasious substitutions do occur these have long been considered the result of neutral processes. However, recent studies have demonstrated that adaptive convergent sequence evolution can be detected in vertebrates using statistical methods that model parallel evolution9,10 although the extent to which sequence convergence between genera occurs across genomes is unknown. Here we analyse genomic sequence data in mammals that have independently evolved echolocation and show for the first time that convergence is not a rare process restricted to a handful of loci but is instead widespread, continuously distributed and commonly driven by natural selection acting on a small number of sites per locus. Systematic analyses of convergent sequence evolution in 805,053 amino acids within 2,326 orthologous coding gene sequences compared across 22 mammals (including four new bat genomes) revealed signatures consistent with convergence in nearly 200 loci. Strong and significant support for convergence among bats and the dolphin was seen in numerous genes linked to hearing or deafness, consistent with an involvement in echolocation. Surprisingly we also found convergence in many genes linked to vision: the convergent signal of many sensory genes was robustly correlated with the strength of natural selection. This first attempt to detect genome-wide convergent sequence evolution across divergent taxa reveals the phenomenon to be much more pervasive than previously recognised.
Molecular phylogenetics has rapidly established the evolutionary positions of most major mammal groups, yet analyses have repeatedly failed to agree on that of bats (order Chiroptera). Moreover, the relationship among the major bat lineages has proven equally contentious, with ongoing disagreements about whether echolocating bats are paraphyletic or a true group having profound implications for whether echolocation evolved once or possibly multiple times. By generating new bat genome data and applying model-based phylogenomic analyses designed to accommodate heterogeneous evolutionary processes, we show that-contrary to recent suggestions-bats are not closely related to odd-toed ungulates but instead have a more ancient origin as sister group to a large clade of carnivores, ungulates, and cetaceans. Additionally, we provide the first genome-scale support showing that laryngeal echolocating bats are not a true group and that this paraphyly is robust to their position within mammals. We suggest that earlier disagreements in the literature may reflect model misspecification, long-branch artifacts, poor taxonomic coverage, and differences in the phylogenetic markers used. These findings are a timely reminder of the relevance of experimental design and careful statistical analysis as we move into the phylogenomic era.
BackgroundHepatitis C virus (HCV) is a rapidly-evolving RNA virus that establishes chronic infections in humans. Despite the virus' public health importance and a wealth of sequence data, basic aspects of HCV molecular evolution remain poorly understood. Here we investigate three sets of whole HCV genomes in order to directly compare the evolution of whole HCV genomes at different biological levels: within- and among-hosts. We use a powerful Bayesian inference framework that incorporates both among-lineage rate heterogeneity and phylogenetic uncertainty into estimates of evolutionary parameters.ResultsMost of the HCV genome evolves at ~0.001 substitutions/site/year, a rate typical of RNA viruses. The antigenically-important E1/E2 genome region evolves particularly quickly, with correspondingly high rates of positive selection, as inferred using two related measures. Crucially, in this region an exceptionally higher rate was observed for within-host evolution compared to among-host evolution. Conversely, higher rates of evolution were seen among-hosts for functionally relevant parts of the NS5A gene. There was also evidence for slightly higher evolutionary rate for HCV subtype 1a compared to subtype 1b.ConclusionsUsing new statistical methods and comparable whole genome datasets we have quantified, for the first time, the variation in HCV evolutionary dynamics at different scales of organisation. This confirms that differences in molecular evolution between biological scales are not restricted to HIV and may represent a common feature of chronic RNA viral infection. We conclude that the elevated rate observed in the E1/E2 region during within-host evolution more likely results from the reversion of host-specific adaptations (resulting in slower long-term among-host evolution) than from the preferential transmission of slowly-evolving lineages.
Advances in DNA sequencing and informatics have revolutionised biology over the past four decades, but technological limitations have left many applications unexplored. Recently, portable, real-time, nanopore sequencing (RTnS) has become available. This offers opportunities to rapidly collect and analyse genomic data anywhere. However, generation of datasets from large, complex genomes has been constrained to laboratories. The portability and long DNA sequences of RTnS offer great potential for field-based species identification, but the feasibility and accuracy of these technologies for this purpose have not been assessed. Here, we show that a field-based RTnS analysis of closely-related plant species (Arabidopsis spp.) has many advantages over laboratory-based high-throughput sequencing (HTS) methods for species level identification and phylogenomics. Samples were collected and sequenced in a single day by RTnS using a portable, “al fresco” laboratory. Our analyses demonstrate that correctly identifying unknown reads from matches to a reference database with RTnS reads enables rapid and confident species identification. Individually annotated RTnS reads can be used to infer the evolutionary relationships of A. thaliana. Furthermore, hybrid genome assembly with RTnS and HTS reads substantially improved upon a genome assembled from HTS reads alone. Field-based RTnS makes real-time, rapid specimen identification and genome wide analyses possible.
The known genetic diversity of the hepaciviruses and pegiviruses has increased greatly in recent years through the discovery of viruses related to hepatitis C virus and human pegivirus in bats, bovines, equines, primates, and rodents. Analysis of these new species is important for research into animal models of hepatitis C virus infection and into the zoonotic origins of human viruses. Here, we provide the first systematic phylogenetic and evolutionary analysis of these two genera at the whole-genome level. Phylogenies confirmed that hepatitis C virus is most closely related to viruses from horses whereas human pegiviruses clustered with viruses from African primates. Within each genus, several well-supported lineages were identified and viral diversity was structured by both host species and location of sampling. Recombination analyses provided evidence of interspecific recombination in hepaciviruses, but none in the pegiviruses. Putative mosaic genome structures were identified in NS5B gene region and were supported by multiple tests. The identification of interspecific recombination in the hepaciviruses represents an important evolutionary event that could be clarified by future sampling of novel viruses. We also identified parallel amino acid changes shared by distantly related lineages that infect similar types of host. Notable parallel changes were clustered in the NS3 and NS4B genes and provide a useful starting point for experimental studies of the evolution of Hepacivirus host–virus interactions.
Canine parvovirus type 2 (CPV-2) is a severe enteric pathogen of dogs, causing high mortality in unvaccinated dogs. After emerging, CPV-2 spread rapidly worldwide. However, there is now some evidence to suggest that international transmission appears to be more restricted. In order to investigate the transmission and evolution of CPV-2 both nationally and in relation to the global situation, we have used a long-range PCR to amplify and sequence the full VP2 gene of 150 canine parvoviruses obtained from a large cross-sectional sample of dogs presenting with severe diarrhea to veterinarians in the United Kingdom, over a 2-year period. Among these 150 strains, 50 different DNA sequence types (S) were identified, and apart from one case, all appeared unique to the United Kingdom. Phylogenetic analysis provided clear evidence for spatial clustering at the international level and for the first time also at the national level, with the geographical range of some sequence types appearing to be highly restricted within the United Kingdom. Evolution of the VP2 gene in this data set was associated with a lack of positive selection. In addition, the majority of predicted amino acid sequences were identical to those found elsewhere in the world, suggesting that CPV VP2 has evolved a highly fit conformation. Based on typing systems using key amino acid mutations, 43% of viruses were CPV-2a, and 57% CPV-2b, with no type 2 or 2c found. However, phylogenetic analysis suggested complex antigenic evolution of this virus, with both type 2a and 2b viruses appearing polyphyletic. As such, typing based on specific amino acid mutations may not reflect the true epidemiology of this virus. The geographical restriction that we observed both within the United Kingdom and between the United Kingdom and other countries, together with the lack of CPV-2c in this population, strongly suggests the spread of CPV within its population may be heterogeneously subject to limiting factors. This cross-sectional study of national and global CPV phylogeographic segregation reveals a substantially more complex epidemic structure than previously described.Sequence analysis has revolutionized our knowledge of the spatial and temporal dynamics of infection, allowing a greater understanding of the evolution and molecular epidemiology of pathogens. This is particularly important for rapidly evolving pathogens, such as RNA viruses (e.g., feline calicivirus [9], foot and mouth disease virus [8], and influenza virus [33]) and also for certain single-stranded DNA viruses with high mutation rates, such as canine parvovirus (CPV) (37).CPV type 2 (CPV-2) consists of a 5.5-kb single-stranded linear DNA genome (5) encoding nonstructural (NS1 and -2) and capsid (VP1, -2, and -3) proteins at the 5Ј and 3Ј ends, respectively (1). The three structural proteins are derived from the same open reading frame (ORF) by proteolytic cleavage and alternate RNA splicing, with the full, infectious capsid consisting predominately of VP2 (34).The virus first emerged as a new causative ...
The impact of human-mediated environmental change on the evolutionary trajectories of wild organisms is poorly understood. In particular, species’ capacities to adapt rapidly (in hundreds of generations or less), reproducibly and predictably to extreme environmental change is unclear. Silene uniflora is predominantly a coastal species, but it has also colonised isolated, disused mines with phytotoxic, zinc-contaminated soils. To test whether rapid, parallel adaptation to anthropogenic pollution has taken place, we used reduced representation sequencing (ddRAD) to reconstruct the evolutionary history of geographically proximate mine and coastal population pairs and found largely independent colonisation of mines from different coastal sites. Furthermore, our results show that parallel evolution of zinc tolerance has occurred without gene flow spreading adaptive alleles between mine populations. In genomic regions where signatures of selection were detected across multiple mine-coast pairs, we identified genes with functions linked to physiological differences between the putative ecotypes, although genetic differentiation at specific loci is only partially shared between mine populations. Our results are consistent with a complex, polygenic genetic architecture underpinning rapid adaptation. This shows that even under a scenario of strong selection and rapid adaptation, evolutionary responses to human activities (and other environmental challenges) may be idiosyncratic at the genetic level and, therefore, difficult to predict from genomic data.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.