RNA viruses are proficient at switching to novel host species due to their fast mutation rates. Implicit in this assumption is the need to evolve adaptations in the new host species to exploit their cells efficiently. However, SARS-CoV-2 has required no significant adaptation to humans since the pandemic began, with no observed selective sweeps to date. Here we contrast the role of positive selection and recombination in the Sarbecoviruses in horseshoe bats to SARS-CoV-2 evolution in humans. While methods can detect some evidence for positive selection in SARS-CoV-2, we demonstrate these are mostly due to recombination and sequencing artefacts.Purifying selection is also substantially weaker in SARS-CoV-2 than in the related bat Sarbecoviruses. In comparison, our results show evidence for positive, specifically episodic selection, acting on the bat virus lineage SARS-CoV-2 emerged from. This signature of selection can also be observed among synonymous substitutions, for example, linked to ancestral CpG depletion on this bat lineage. We show the bat virus RmYN02 has recombinant CpG content in Spike pointing to coinfection and evolution in bats without involvement of other species. Our results suggest the non-human progenitor of SARS-CoV-2 was capable of humanhuman transmission as a consequence of its natural evolution in bats.
Main textIn December 2019, a novel coronavirus emerged in the city of Wuhan, China, causing coronavirus disease-2019 (COVID-19) characterised by respiratory or gastrointestinal viral symptoms, and in severe cases, additionally, acute respiratory distress syndrome, cardiovascular dysfunction, thrombosis and other symptoms 1 . Evolutionary analysis placed this new human virus in the same subgenus of Betacoronavirus, the Sarbecoviruses (Figure 1A), that SARS emerged from 2 , and it was named SARS-CoV-2 3 -the seventh known humaninfecting member of the Coronaviridae. The initial outbreak of human cases of the virus was connected to the Huanan Seafood Wholesale Market in Wuhan 4 , and while related viruses have been found in horseshoe bats 5 and pangolins 6 , their divergence represents decades of evolution 7 leaving the direct origin of the pandemic unknown. In addition to the importance of understanding the route from animals to humans, key questions for assessing future risk of emergence are: what is the extent of evolution required to permit a bat virus to transmit to humans, and what subsequent evolution needs to occur for efficient transmission once the virus is established within the human population?While both the first SARS virus outbreak in 2002/2003, causing approximately 8,000 infections, and a re-emergence in late 2003, causing four infections, were linked to Himalayan palm civets and raccoon dogs in marketplaces in Guangdong province 8,9 , it became clear that these animals were conduits for spillover to humans and not true viral reservoirs 10 . Extensive surveillance work subsequently identified related viruses circulating in horseshoe bats in China some of which can replicate rea...