2021
DOI: 10.1101/2021.12.03.470766
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Pandemic-scale phylogenetics

Abstract: Phylogenetics has been central to the genomic surveillance, epidemiology and contact tracing efforts during the COVD-19 pandemic. But the massive scale of genomic sequencing has rendered the pre-pandemic tools inadequate for comprehensive phylogenetic analyses. Here, we discuss the phylogenetic package that we developed to address the needs imposed by this pandemic. The package incorporates several pandemic-specific optimization and parallelization techniques and comprises four programs: UShER, matOptimize, RI… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 48 publications
0
6
0
Order By: Relevance
“…Using our method, we traced transmission clusters in 102 countries from across the world (Figure 2A) using the global parsimony phylogenetic tree, built from 5,563,847 available sequences on GISAID 21 , GenBank 19 , and COG-UK 25 on 11-28-2021 (see Methods). Cluster size is highly skewed (Figure 2C), with approximately 20% of distinct regional clusters containing 89% of samples.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…Using our method, we traced transmission clusters in 102 countries from across the world (Figure 2A) using the global parsimony phylogenetic tree, built from 5,563,847 available sequences on GISAID 21 , GenBank 19 , and COG-UK 25 on 11-28-2021 (see Methods). Cluster size is highly skewed (Figure 2C), with approximately 20% of distinct regional clusters containing 89% of samples.…”
Section: Resultsmentioning
confidence: 99%
“…At UCSC we maintain a large phylogeny of all GISAID 21 , GenBank 19 , and COG-UK 25 sequences using the script https://github.com/ucscGenomeBrowser/kent/blob/master/src/hg/utils/otto/sarscov2phylo/update Public.sh and the UShER online phylogenetics suite 12,21 . Updates are performed daily by obtaining all newly uploaded sequences from each database and placing them on the previous day’s global phylogenetic tree with UShER (see McBroome et al) 13 .Starting with our phylogeny updated on 11-28-2021, we pruned all samples with long branch lengths and path lengths using the matUtils parameters --max-branch-length 45 and --max-path-length 100 and performed a round of optimization with an SPR radius of 8.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…A phylogenetic representation of this database is believed to be the largest ever constructed ( Turakhia, Thornlow, Hinrichs, De Maio, et al 2021 ). Existing phylogenetic methods, which were developed and tested on datasets orders of magnitude smaller, are inadequate for pandemic-scale analysis, resulting in missed opportunities to improve our surveillance and response capabilities ( Hodcroft et al 2021 ; Morel et al 2021 ; Ye et al 2021 ).…”
Section: Introductionmentioning
confidence: 99%
“…A phylogenetic representation of this database is believed to be the largest ever constructed (Turakhia et al, 2021a). Existing phylogenetic methods, which were developed and tested on datasets orders of magnitude smaller, are inadequate for pandemic-scale analysis, resulting in missed opportunities to improve our surveillance and response capabilities (Hodcroft et al, 2021; Ye et al, 2021; Morel et al, 2021).…”
Section: Introductionmentioning
confidence: 99%