2020
DOI: 10.1101/2020.05.03.074567
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Large scale genomic analysis of 3067 SARS-CoV-2 genomes reveals a clonal geo-distribution and a rich genetic variations of hotspots mutations

Abstract: Large scale genomic analysis of 3067 SARS-1 CoV-2 genomes reveals a clonal geo-distribution 2 and a rich genetic variations of hotspots 3 mutations 4 Abstract 33In late December 2019, an emerging viral infection COVID-19 was identified in Wuhan, 34China, and became a global pandemic. Characterization of the genetic variants of SARS-35CoV-2 is crucial in following and evaluating it spread across countries. In this study, we 36 collected and analyzed 3,067 SARS-CoV-2 genomes isolated from 55 countries during the… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

5
72
1

Year Published

2020
2020
2021
2021

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 53 publications
(78 citation statements)
references
References 37 publications
5
72
1
Order By: Relevance
“…In addition to their influence on phylogenetic inference, recurrent systematic errors can also lead to erroneous inferences about viral mutation processes, recombination and selection. For example, artefactual biases in mutational processes could confound signatures of mutational hotspots [28][29][30][31][32][33] . The issue of whether or not recombination has occurred during the outbreak is critical to the immunological battle against the virus and is under intense debate [6,[34][35][36][37][38][39][40] .…”
Section: Figure 1: Effect Of Recurrent Sequencing Mutations On Phylogmentioning
confidence: 99%
See 1 more Smart Citation
“…In addition to their influence on phylogenetic inference, recurrent systematic errors can also lead to erroneous inferences about viral mutation processes, recombination and selection. For example, artefactual biases in mutational processes could confound signatures of mutational hotspots [28][29][30][31][32][33] . The issue of whether or not recombination has occurred during the outbreak is critical to the immunological battle against the virus and is under intense debate [6,[34][35][36][37][38][39][40] .…”
Section: Figure 1: Effect Of Recurrent Sequencing Mutations On Phylogmentioning
confidence: 99%
“…Because many tests of recombination assume that all mutations can only occur once at each site, recurrent mutation and systematic errors can confound signatures of recombination [6,26,35] . Finally, recurrent mutations have been identified as a possible signatures of elevated mutation rates and natural selection in SARS-CoV-2 [8,13,[24][25][26]29,33,35,41] , but some of these apparent instances of selection may be due systematic errors in the sequences. Confusion about recurrent mutations and recombination affects our understanding of host response and influences our decisions about which viral molecular processes or specific immune epitopes we might want to target in vaccine development.…”
Section: Figure 1: Effect Of Recurrent Sequencing Mutations On Phylogmentioning
confidence: 99%
“…These polyproteins are cleaved into five structural structural proteins, including spike protein (S), membrane protein (M), envelope protein (E), nucleocapsid protein (N) and 26 non-structural proteins. There are also nine putative ORFs (ORF3a, ORF3b, ORF4, ORF5, ORF6, ORF7a, ORF7b, ORF8 and ORF10) predicted as hypothetical proteins [1].Characterization of viral mutations can provide valuable information for assessing the mechanisms linked to pathogenesis, immune evasion and viral drug resistance. In addition, viral mutation studies can be crucial for the design of new vaccines, antiviral drugs and diagnostic tests.…”
mentioning
confidence: 99%
“…These polyproteins are cleaved into five structural structural proteins, including spike protein (S), membrane protein (M), envelope protein (E), nucleocapsid protein (N) and 26 non-structural proteins. There are also nine putative ORFs (ORF3a, ORF3b, ORF4, ORF5, ORF6, ORF7a, ORF7b, ORF8 and ORF10) predicted as hypothetical proteins [1].…”
mentioning
confidence: 99%
“…Other key variants observed in present report includes 25563G>T (Q57H) in ORF3a, along with consecutive series of three variants at position 28881 (G>A), 28882 (G>A) and 28883(G>C) in N proteins. The triple site mutation 28881-28883 that brings change in two amino acid 203-204:RG>KR, is known to play a critical role in virion assembly and structure and had been abundantly seen in US strain previously [14], but also observed recently in Spain, Greece, Vietnam and South America, Mexico, Australia, New Zealand, Belgium, Brazil and Peru [15,16]. According to GISAID repository, these mutations in N protein were rst described in isolates from Netherlands (EPI_ISL_413565) with travel history to Italy, whereas the same mutation was detected in one Italian sample from Abruzzo region (EPI_ISL_436718).…”
Section: Frequent Mutations In Turkish Isolatesmentioning
confidence: 99%