2020
DOI: 10.1101/2020.06.09.082834
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

SARS-CoV-2 sequence typing, evolution and signatures of selection using CoVa, a Python-based command-line utility

Abstract: The current global pandemic COVID-19, caused by SARS-CoV-2, has resulted in millions of infections worldwide in a few months. Global efforts to tackle this situation have produced a tremendous body of genomic data, which can be used for tracing transmission routes, characterization of isolates, and monitoring variants with potential for unusual virulence. Several groups have analyzed these genomes using different approaches. However, as new data become available, the research community needs a pipeline to perf… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 40 publications
0
2
0
Order By: Relevance
“…Multiple sequences alignments were performed using the progressive method (FFT-NS-2) implemented in MAFFT (version 7.4) (Katoh et al, 2002). The whole genome mutation analysis was carried out used the pipeline provided by CoVa (version 0.2) software (Ali et al 2020). For substitution analysis, we first deleted the gap generated in the multi-sequence alignments based on the ref NC_045512.2 and then used the CoVa pipeline for substitution calculation (Figure S1).…”
Section: Sequence Processingmentioning
confidence: 99%
“…Multiple sequences alignments were performed using the progressive method (FFT-NS-2) implemented in MAFFT (version 7.4) (Katoh et al, 2002). The whole genome mutation analysis was carried out used the pipeline provided by CoVa (version 0.2) software (Ali et al 2020). For substitution analysis, we first deleted the gap generated in the multi-sequence alignments based on the ref NC_045512.2 and then used the CoVa pipeline for substitution calculation (Figure S1).…”
Section: Sequence Processingmentioning
confidence: 99%
“…To understand temporal and geographic patterns in the development of various genetic variants across India, we used a set of ~1,700 genomes combining those sequences by this consortium and others available in GISAID. We first defined sequence types based on the nucleotide sequence of 10 sites as defined by Guan et al (16), and numbered according to the date of their first emergence globally, as described earlier for the CoVa pipeline (17). The most predominant sequence type among Indian samples was ST4 ( Figure 13) which is characterized by two non-synonymous and 1 synonymous mutations (S D614G + RdRp P323L + nsp3 F106F).…”
Section: Mutation Analysismentioning
confidence: 99%