SARS-CoV-2 is a SARS-like coronavirus of likely zoonotic origin first identified in December 2019 in Wuhan, the capital of China's Hubei province. The virus has since spread globally, resulting in the currently ongoing COVID-19 pandemic. The first whole genome sequence was published on January 5 2020, and thousands of genomes have been sequenced since this date. This resource allows unprecedented insights into the past demography of SARS-CoV-2 but also monitoring of how the virus is adapting to its novel human host, providing information to direct drug and vaccine design. We curated a dataset of 7666 public genome assemblies and analysed the emergence of genomic diversity over time. Our results are in line with previous estimates and point to all sequences sharing a common ancestor towards the end of 2019, supporting this as the period when SARS-CoV-2 jumped into its human host. Due to extensive transmission, the genetic diversity of the virus in several countries recapitulates a large fraction of its worldwide genetic diversity. We identify regions of the SARS-CoV-2 genome that have remained largely invariant to date, and others that have already accumulated diversity. By focusing on mutations which have emerged independently multiple times (homoplasies), we identify 198 filtered recurrent mutations in the SARS-CoV-2 genome. Nearly 80% of the recurrent mutations produced non-synonymous changes at the protein level, suggesting possible ongoing adaptation of SARS-CoV-2. Three sites in Orf1ab in the regions encoding Nsp6, Nsp11, Nsp13, and one in the Spike protein are characterised by a particularly large number of recurrent mutations (>15 events) which may signpost convergent evolution and are of particular interest in the context of adaptation of SARS-CoV-2 to the human host. We additionally provide an interactive user-friendly web-application to query the alignment of the 7666 SARS-CoV-2 genomes.
Individuals with potential exposure to severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) do not necessarily develop PCR or antibody positivity, suggesting that some individuals may clear subclinical infection before seroconversion. T cells can contribute to the rapid clearance of SARS-CoV-2 and other coronavirus infections1–3. Here we hypothesize that pre-existing memory T cell responses, with cross-protective potential against SARS-CoV-2 (refs. 4–11), would expand in vivo to support rapid viral control, aborting infection. We measured SARS-CoV-2-reactive T cells, including those against the early transcribed replication–transcription complex (RTC)12,13, in intensively monitored healthcare workers (HCWs) who tested repeatedly negative according to PCR, antibody binding and neutralization assays (seronegative HCWs (SN-HCWs)). SN-HCWs had stronger, more multispecific memory T cells compared with a cohort of unexposed individuals from before the pandemic (prepandemic cohort), and these cells were more frequently directed against the RTC than the structural-protein-dominated responses observed after detectable infection (matched concurrent cohort). SN-HCWs with the strongest RTC-specific T cells had an increase in IFI27, a robust early innate signature of SARS-CoV-2 (ref. 14), suggesting abortive infection. RNA polymerase within RTC was the largest region of high sequence conservation across human seasonal coronaviruses (HCoV) and SARS-CoV-2 clades. RNA polymerase was preferentially targeted (among the regions tested) by T cells from prepandemic cohorts and SN-HCWs. RTC-epitope-specific T cells that cross-recognized HCoV variants were identified in SN-HCWs. Enriched pre-existing RNA-polymerase-specific T cells expanded in vivo to preferentially accumulate in the memory response after putative abortive compared to overt SARS-CoV-2 infection. Our data highlight RTC-specific T cells as targets for vaccines against endemic and emerging Coronaviridae.
COVID-19 is caused by the coronavirus SARS-CoV-2, which jumped into the human population in late 2019 from a currently uncharacterised animal reservoir. Due to this recent association with humans, SARS-CoV-2 may not yet be fully adapted to its human host. This has led to speculations that SARS-CoV-2 may be evolving towards higher transmissibility. The most plausible mutations under putative natural selection are those which have emerged repeatedly and independently (homoplasies). Here, we formally test whether any homoplasies observed in SARS-CoV-2 to date are significantly associated with increased viral transmission. To do so, we develop a phylogenetic index to quantify the relative number of descendants in sister clades with and without a specific allele. We apply this index to a curated set of recurrent mutations identified within a dataset of 46,723 SARS-CoV-2 genomes isolated from patients worldwide. We do not identify a single recurrent mutation in this set convincingly associated with increased viral transmission. Instead, recurrent mutations currently in circulation appear to be evolutionary neutral and primarily induced by the human immune system via RNA editing, rather than being signatures of adaptation. At this stage we find no evidence for significantly more transmissible lineages of SARS-CoV-2 due to recurrent mutations.
SARS-CoV-2, the causative agent of the COVID-19 pandemic, can infect a wide range of mammals. Since its spread in humans, secondary host jumps of SARS-CoV-2 from humans to multiple domestic and wild populations of mammals have been documented. Understanding the extent of adaptation to these animal hosts is critical for assessing the threat that the spillback of animal-adapted SARS-CoV-2 into humans poses. We compare the genomic landscapes of SARS-CoV-2 isolated from animal species to that in humans, profiling the mutational biases indicative of potentially different selective pressures in animals. We focus on viral genomes isolated from mink (Neovison vison) and white-tailed deer (Odocoileus virginianus) for which multiple independent outbreaks driven by onward animal-to-animal transmission have been reported. We identify five candidate mutations for animal-specific adaptation in mink (NSP9_G37E, Spike_F486L, Spike_N501T, Spike_Y453F, ORF3a_L219V), and one in deer (NSP3a_L1035F), though they appear to confer a minimal advantage for human-to-human transmission. No considerable changes to the mutation rate or evolutionary trajectory of SARS-CoV-2 has resulted from circulation in mink and deer thus far. Our findings suggest that minimal adaptation was required for onward transmission in mink and deer following human-to-animal spillover, highlighting the ‘generalist’ nature of SARS-CoV-2 as a mammalian pathogen.
Severe acute respiratory coronavirus 2 (SARS-CoV-2), the agent of the ongoing COVID-19 pandemic, jumped into humans from an unknown animal reservoir in late 2019. In line with other coronaviruses, SARS-CoV-2 has the potential to infect a broad range of hosts. SARS-CoV-2 genomes have now been isolated from cats, dogs, lions, tigers and minks. SARS-CoV-2 seems to transmit particularly well in mink farms with outbreaks reported in Spain, Sweden, the Netherlands, Italy, the USA and Denmark. Genomic data from SARS-CoV-2 isolated from infected minks provides a natural case study of a secondary host jump of the virus, in this case from humans to animals, and occasionally back again. We screened published SARS-CoV-2 genomes isolated from minks for the presence of recurrent mutations common in mink but infrequent in SARS-CoV-2 genomes isolated from human infections. We identify 23 recurrent mutations including three nonsynonymous mutations in the Receptor Binding Domain of the SARS-CoV-2 spike protein that independently emerged at least four times but are only rarely observed in human lineages. The repeat emergence of mutations across phylogenetically distinct lineages of the virus isolated from minks points to ongoing adaptation of SARS-CoV-2 to a new host. The rapid acquisition and spread of SARS-CoV-2 mutations in minks suggests that if a similar phenomenon of host adaptation had occurred upon its jump into humans, those human-specific mutations would likely have reached fixation already before the first SARS-CoV-2 genomes were generated.Data SummaryAll genome assemblies considered in this manuscript are openly available on registration with GISAID (https://www.gisaid.org). Information on the included assemblies, including the accessions used in the global analysis are provided in Tables S1-S2.
Individuals with likely exposure to the highly infectious SARS-CoV-2 do not necessarily develop PCR or antibody positivity, suggesting some may clear sub-clinical infection before seroconversion. T cells can contribute to the rapid clearance of SARS-CoV-2 and other coronavirus infections1–5. We hypothesised that pre-existing memory T cell responses, with cross-protective potential against SARS-CoV-26–12, would expand in vivo to mediate rapid viral control, potentially aborting infection. We studied T cells against the replication transcription complex (RTC) of SARS-CoV-2 since this is transcribed first in the viral life cycle13–15and should be highly conserved. We measured SARS-CoV-2-reactive T cells in a cohort of intensively monitored healthcare workers (HCW) who remained repeatedly negative by PCR, antibody binding, and neutralisation for SARS-CoV-2 (exposed seronegative, ES). 16-weeks post-recruitment, ES had memory T cells that were stronger and more multispecific than an unexposed pre-pandemic cohort, and more frequently directed against the RTC than the structural protein-dominated responses seen post-detectable infection (matched concurrent cohort). The postulate that HCW with the strongest RTC-specific T cells had an abortive infection was supported by a low-level increase in IFI27 transcript, a robust early innate signature of SARS-CoV-2 infection16. We showed that the RNA-polymerase within RTC was the largest region of high sequence conservation across human seasonal coronaviruses (HCoV) and was preferentially targeted by T cells from UK and Singapore pre-pandemic cohorts and from ES. RTC epitope-specific T cells capable of cross-recognising HCoV variants were identified in ES. Longitudinal samples from ES and an additional validation cohort, showed pre-existing RNA-polymerase-specific T cells expanded in vivo following SARS-CoV-2 exposure, becoming enriched in the memory response of those with abortive compared to overt infection. In summary, we provide evidence of abortive seronegative SARS-CoV-2 infection with expansion of cross-reactive RTC-specific T cells, highlighting these highly conserved proteins as targets for future vaccines against endemic and emerging Coronaviridae.
SARS-CoV-2, the agent of the COVID-19 pandemic, emerged in late 2019 in China, and rapidly spread throughout the world to reach all continents. As the virus expanded in its novel human host, viral lineages diversified through the accumulation of around two mutations a month on average. Different viral lineages have replaced each other since the start of the pandemic, with the most successful Alpha, Delta and Omicron Variants of Concern (VoCs) sequentially sweeping through the world to reach high global prevalence. Neither Alpha nor Delta was characterised by strong immune escape, with their success coming mainly from their higher transmissibility. Omicron is far more prone to immune evasion and spread primarily due to its increased ability to (re-)infect hosts with prior immunity. As host immunity reaches high levels globally through vaccination and prior infection, the epidemic is expected to transition from a pandemic regime to an endemic one where seasonality and waning host immunisation are anticipated to become the primary forces shaping future SARS-CoV-2 lineage dynamics. In this review, we consider a body of evidence on the origins, host tropism, epidemiology, genomic and immunogenetic evolution of SARS-CoV-2 including an assessment of other coronaviruses infecting humans. Considering what is known so far, we conclude by delineating scenarios for the future dynamic of SARS-CoV-2, ranging from the good—circulation of a fifth endemic ‘common cold’ coronavirus of potentially low virulence, the bad—a situation roughly comparable to seasonal flu, and the ugly—extensive diversification into serotypes with long-term high-level endemicity. Lay Summary SARS-CoV-2, the agent of the COVID-19 pandemic, emerged in late 2019 and spread globally causing the COVID-19 pandemic. When SARS-CoV-2 jumped into humans, it had essentially no genetic diversity, but as the viral population expanded, it diversified into myriad genetically distinct lineages through the acumulation of approximately two mutations a month. The vast majority of those lineages are now extinct. The most successful lineages so far were the Alpha, Delta and Omicron variants which all reached global prevalence and replaced each other in the process. Alpha and Delta emerged in late 2020 and spread at a time when immunisation of the human population was still low and reached global prevalence primarily by being more transmissible. Omicron started expanding in late 2021, at a time when a large proportion of the human population had already been vaccinated and/or exposed to SARS-CoV-2. Omicron was so successful mainly due to its ability to (re-)infect hosts with prior immunity. In this review, we provide an overview of the evolution of SARS-CoV-2 and put it in context of what is known about other coronaviruses infecting humans. We conclude by delineating plausible scenarios for the ‘endemic’ future of SARS-CoV-2.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.