2021
DOI: 10.1101/2021.02.14.431150
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Fast and Accurate Distance-based Phylogenetic Placement using Divide and Conquer

Abstract: Phylogenetic placement of query samples on an existing phylogeny is increasingly used in microbiome analyses and other biological studies. As the size of available reference trees used in microbiome analyses continues to grow, there is a growing need for methods that place sequences on ultra-large trees with high accuracy.In this paper, we build on our previous work and introduce APPLES-2, a distance-based phylogenetic placement method. In extensive studies, we show that APPLES-2 is more accurate and much mor… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
24
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(24 citation statements)
references
References 70 publications
(144 reference statements)
0
24
0
Order By: Relevance
“…We use the published FastTree-2 results (tree and numeric parameters estimated under Minimum Evolution) on the nt78 datasets [21]. The scripts to calculate delta error was adapted from a published script [1], and borrow utilities from newick_utils. Further details are provided in the published scripts here.…”
Section: A Summarymentioning
confidence: 99%
See 2 more Smart Citations
“…We use the published FastTree-2 results (tree and numeric parameters estimated under Minimum Evolution) on the nt78 datasets [21]. The scripts to calculate delta error was adapted from a published script [1], and borrow utilities from newick_utils. Further details are provided in the published scripts here.…”
Section: A Summarymentioning
confidence: 99%
“…The most accurate of the current phylogenetic placement methods operate using maximum likelihood, and so require the query sequences to already be aligned to the backbone sequences; examples of such methods include pplacer [9] and EPA-ng [3]. Other types of methods have been developed for phylogenetic placement that are not based on maximum likelihood; a recent example is APPLES-2 [1], which uses branch lengths and distances computed between the query sequence and the leaves in the backbone tree to place each query sequence. APPLES-2 is much faster than pplacer and EPA-ng and has lower memory requirements, but is not as accurate as pplacer.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Other phylogenetic placement methods have been developed that seek to improve scalability to larger trees or reduce running time (e.g., UShER (Turakhia et al, 2021), EPA-ng (Barbera et al, 2019), APPLES (Balaban et al, 2020), and APPLES-2 (Balaban et al, 2021)). EPA-ng is likelihood-based and has been optimized for "batch processing" of query sequences (so that the cost of performing phylogenetic placement of a large number of query sequences is much less than the cost of placing them one-by-one).…”
Section: Adding Sequences To Gene Treesmentioning
confidence: 99%
“…EPA-ng has slightly reduced accuracy compared to pplacer. APPLES is a very fast distance-based method; recent studies (Balaban et al, 2020(Balaban et al, , 2021Wedell et al, 2021) showed that APPLES can run on trees with 200,000 leaves and is much faster than both pplacer and EPA-ng. APPLES-2 is an improvement on APPLES with respect to accuracy and running time, and also scales to at least 200,000 sequences.…”
Section: Adding Sequences To Gene Treesmentioning
confidence: 99%