2021
DOI: 10.1038/s41587-021-01069-1
|View full text |Cite
|
Sign up to set email alerts
|

Ready-to-use public infrastructure for global SARS-CoV-2 monitoring

Abstract: To the Editor -The COVID-19 pandemic is the first health crisis characterized by large amounts of genomic data 1 . Computational infrastructure can be a bottleneck for data analysis, amplifying global inequalities in ability to track SARS-CoV-2 evolution. This is an issue even in developed countries, as computational infrastructure requires expertise in resource procurement, configuration and maintenance. Commercial computational clouds do not fully address the problem because these resources must still be con… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
18
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
7
1
1

Relationship

3
6

Authors

Journals

citations
Cited by 22 publications
(18 citation statements)
references
References 9 publications
0
18
0
Order By: Relevance
“…Global genomic databases for emergent variants have greatly improved since the onset of COVID-19 pandemics ( 31 ). Before COVID-19, influenza virus sequences are archived in GISAID.…”
Section: Discussionmentioning
confidence: 99%
“…Global genomic databases for emergent variants have greatly improved since the onset of COVID-19 pandemics ( 31 ). Before COVID-19, influenza virus sequences are archived in GISAID.…”
Section: Discussionmentioning
confidence: 99%
“… Intrapatient allelic variation seen at BA.1 amino acid mutation sites in a subset of SARS-CoV-2 raw sequencing data since March 2020 analyzed using a standardized variant calling pipeline ( Maier et al 2021 ). The areas of the circles indicate the proportions of raw sequence data sets (per 1,000 samples) where a mutation away from the Wuhan-Hu-1 consensus sequence was called.…”
Section: Resultsmentioning
confidence: 99%
“…Intrahost allelic variation seen at BA.1 amino acid mutation sites was analyzed in 282,788 annotated (i.e., with detailed associated metadata) publically available SARS-CoV-2 raw sequencing data sets from the UK, Greece, Estonia, Ireland, and South Africa between March 2020 and September 2021 all of which were processed and analyzed using the standardized variant calling pipeline described in Maier et al (2021) . All variant calling data for genomic sites where BA.1, 2, and 3 lineage-defining mutations occur were extracted from processed data sets available via ftp://xfer13.crg.eu/ and https://covid19.galaxyproject.org/genomics/global_platform/#processed-cog-uk-data can be explored using the observable notebook at https://observablehq.com/@spond/intrahost-dashboard .…”
Section: Methodsmentioning
confidence: 99%
“…Galaxy is traditionally accessed via a graphical interface in the web browser, and features such as Galaxy collections already provide a high level of parallelization to users of the graphical interface. Nonetheless, there are important scenarios in which a user might need to run individual workflows hundreds or thousands of times, in which the data cannot be grouped into collections ahead of time-for example, for variant calling of SARS-CoV-2 genomic data, in which a huge amount of new data is published continuously [28]. As a convenient alternative to the graphical interface, Planemo allows workflow execution to be scheduled programmatically using the `run` subcommand, either on a local machine or a larger Galaxy server.…”
Section: Executionmentioning
confidence: 99%