The rapid accumulation of mutations in the SARS-CoV-2 Omicron variant that enabled its outbreak raises questions as to whether its proximal origin occurred in humans or another mammalian host. Here, we identified 45 point mutations that Omicron acquired since divergence from the B.1.1 lineage. We found that the Omicron spike protein sequence was subjected to stronger positive selection than that of any reported SARS-CoV-2 variants known to evolve persistently in human hosts, suggesting a possibility of host-jumping. The molecular spectrum of mutations ( i.e. , the relative frequency of the 12 types of base substitutions) acquired by the progenitor of Omicron was significantly different from the spectrum for viruses that evolved in human patients, but resembled the spectra associated with virus evolution in a mouse cellular environment. Furthermore, mutations in the Omicron spike protein significantly overlapped with SARS-CoV-2 mutations known to promote adaptation to mouse hosts, particularly through enhanced spike protein binding affinity for the mouse cell entry receptor. Collectively, our results suggest that the progenitor of Omicron jumped from humans to mice, rapidly accumulated mutations conducive to infecting that host, then jumped back into humans, indicating an inter-species evolutionary trajectory for the Omicron outbreak.
The rapid accumulation of mutations in the SARS-CoV-2 Omicron variant that enabled its outbreak raises questions as to whether its proximal origin occurred in humans or another mammalian host. Here, we identified 45 point mutations that Omicron acquired since divergence from the B.1.1 lineage. We found that the Omicron spike protein sequence was subjected to stronger positive selection than that of any reported SARS-CoV-2 variants known to evolve persistently in human hosts, suggesting the possibility of host-jumping. The molecular spectrum (i.e., the relative frequency of the twelve types of base substitutions) of mutations acquired by the progenitor of Omicron was significantly different from the spectrum for viruses that evolved in human patients, but was highly consistent with spectra associated with evolution in a mouse cellular environment. Furthermore, mutations in the Omicron spike protein significantly overlapped with SARS-CoV-2 mutations known to promote adaptation to mouse hosts, particularly through enhanced spike protein binding affinity for the mouse cell entry receptor. Collectively, our results suggest that the progenitor of Omicron jumped from humans to mice, rapidly accumulated mutations conducive to infecting that host, then jumped back into humans, indicating an inter-species evolutionary trajectory for the Omicron outbreak.
The recurrent coronavirus outbreaks in China (SARS-CoV and its relative, SARS-CoV-2) have raised speculations that perhaps Asians are somehow more susceptible to these coronaviruses. Here, we test this possibility based on an analysis of the lung-specific expression of ACE2, which encodes the known cell-entry receptor of both SARS-CoV and SARS-CoV-2. We show that ACE2 expression is not affected during tumorigenesis, supporting that the abundant transcriptomes in cancer genomic studies can be informatively used to study ACE2 expression among diverse individuals without cancer. We find that ACE2 expression in the lung increases with age, but is not associated with sex. Further, Asians do not differ from other populations for ACE2 expression and do not harbor unique genetic polymorphisms in the ACE2 locus. Thus, beyond illustrating an innovative method for assessing the potential impacts of demographic factors for non-cancer diseases from large-scale cancer sample datasets, our statistically robust findings emphasize that individuals of all races require the same level of personal protection against SARS-CoV-2.
The capacity of RNA viruses to adapt to new hosts and rapidly escape the host immune system is largely attributable to de novo genetic diversity that emerges through mutations in RNA. Although the molecular spectrum of de novo mutations—the relative rates at which various base substitutions occur—are widely recognized as informative towards understanding the evolution of a viral genome, little attention has been paid to the possibility of using molecular spectra to infer the host origins of a virus. Here, we characterize the molecular spectrum of de novo mutations for SARS-CoV-2 from transcriptomic data obtained from virus-infected cell lines, enabled by the use of sporadic junctions formed during discontinuous transcription as molecular barcodes. We find that de novo mutations are generated in a replication-independent manner, typically on the genomic strand, and highly dependent on mutagenic mechanisms specific to the host cellular environment. De novo mutations will then strongly influence the types of base substitutions accumulated during SARS-CoV-2 evolution, in an asymmetric manner favoring specific mutation types. Consequently, similarities between the mutation spectra of SARS-CoV-2 and the bat coronavirus RaTG13 which have accumulated since their divergence strongly suggest that SARS-CoV-2 evolved in a host cellular environment highly similar to that of bats before its zoonotic transfer into humans. Collectively, our findings provide data-driven support for the natural origin of SARS-CoV-2.
The spread of Coronavirus Disease 2019 (COVID-19), caused by the SARS-CoV-2 coronavirus, has progressed into a global pandemic. To date, thousands of genetic variants have been identified across SARS-CoV-2 isolates from patients. Sequence analysis reveals that the codon usage of viral sequences decreased over time but fluctuated from time to time. In this study, through evolution modeling, we found that this phenomenon might result from the virus' preference for mutations during transmission. Using dual luciferase assays, we further discovered that the deoptimization of codons on viruses might weaken protein expression during the virus evolution, indicating that the choice of codon usage might play important role in virus fitness. Finally, given the importance of codon usage in protein expression and particularly for mRNA vaccine, we designed several omicron BA.2.12.1 and BA.4/5 spike mRNA vaccine candidates based on codon optimization, and experimentally validated their high levels of expression. Our study highlights the importance of codon usage in virus evolution and mRNA vaccine development.
The spread of coronavirus disease 2019 (COVID‐19), caused by severe respiratory syndrome coronavirus 2 (SARS‐CoV‐2), has progressed into a global pandemic. To date, thousands of genetic variants have been identified among SARS‐CoV‐2 isolates collected from patients. Sequence analysis reveals that the codon adaptation index (CAI) values of viral sequences have decreased over time but with occasional fluctuations. Through evolution modeling, it is found that this phenomenon may result from the virus's mutation preference during transmission. Using dual‐luciferase assays, it is further discovered that the deoptimization of codons in the viral sequence may weaken protein expression during virus evolution, indicating that codon usage may play an important role in virus fitness. Finally, given the importance of codon usage in protein expression and particularly for mRNA vaccines, it is designed several codon‐optimized Omicron BA.2.12.1, BA.4/5, and XBB.1.5 spike mRNA vaccine candidates and experimentally validated their high levels of expression. This study highlights the importance of codon usage in virus evolution and provides guidelines for codon optimization in mRNA and DNA vaccine development.
The monkeypox virus (mpox virus, MPXV) epidemic in 2022 poses a significant public health risk. However, the evolutionary principles of MPXV are largely unknown. Here, we examined the evolutionary patterns of protein sequences and codon usage in MPXV. We first showed the signal of positive selection in OPG027 specifically in Clade I. We then discovered accelerated protein sequence evolution over time in 2022 outbreak-causing variants. We also found strong epistasis between amino acid substitutions located in different genes. Codon adaptation index (CAI) analysis revealed that MPXV tended to use more unpreferred codons than human genes and that the CAI decreased with time and diverged between clades, with Clade I > IIa and IIb-A > IIb-B. Although the decrease in fatality rate in the three groups matched the CAI pattern, it is unclear whether this is a coincidence or if the deoptimization of codon usage in MPXV induced a decrease in fatality. This study sheds new light on the processes influencing the evolution of MPXV in human populations.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.