2021
DOI: 10.1038/s41467-021-22073-8
|View full text |Cite
|
Sign up to set email alerts
|

Harnessing machine learning to guide phylogenetic-tree search algorithms

Abstract: Inferring a phylogenetic tree is a fundamental challenge in evolutionary studies. Current paradigms for phylogenetic tree reconstruction rely on performing costly likelihood optimizations. With the aim of making tree inference feasible for problems involving more than a handful of sequences, inference under the maximum-likelihood paradigm integrates heuristic approaches to evaluate only a subset of all potential trees. Consequently, existing methods suffer from the known tradeoff between accuracy and running t… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
22
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1
1
1

Relationship

1
9

Authors

Journals

citations
Cited by 28 publications
(23 citation statements)
references
References 53 publications
1
22
0
Order By: Relevance
“…These works reveal that MSAs contain crucial underlying relationships that couple sequences with numerous plausible conformations ( Wang and Barth, 2015 ), which could be leveraged for expanded functional capacity design. A sequence generation approach that also harnesses MSA data could recognise the divergence of structural states from phylogenetic trees of extensive protein families constructed using existing modern methods ( Azouri et al, 2021 ). Here, the goal would be to learn the general motif changes in key sites that lead to the adoption of multiple states, and exploit that in design.…”
Section: Accounting For Flexibility In Deep Learning Protein Designmentioning
confidence: 99%
“…These works reveal that MSAs contain crucial underlying relationships that couple sequences with numerous plausible conformations ( Wang and Barth, 2015 ), which could be leveraged for expanded functional capacity design. A sequence generation approach that also harnesses MSA data could recognise the divergence of structural states from phylogenetic trees of extensive protein families constructed using existing modern methods ( Azouri et al, 2021 ). Here, the goal would be to learn the general motif changes in key sites that lead to the adoption of multiple states, and exploit that in design.…”
Section: Accounting For Flexibility In Deep Learning Protein Designmentioning
confidence: 99%
“…Alternatively, more complex Bayesian methods and mixture models may be used where the associated probabilities for events may vary for different sequence regions. The topology of a phylogenetic tree may be initialized randomly from the MSA and is often inferred by hierarchical clustering with a maximum likelihood estimator (MLE) [401]. In practice, phylogenetic tree methods are central to evolutionary biology [53] and understanding the evolutionary dynamics of clonal populations in cancer [402] (among a multitude of other applications), the latter of which has significant clinical relevance to the targeting of precision therapeutics and characterization of treatment resistance.…”
Section: Bioinformaticsmentioning
confidence: 99%
“…Furthermore, it was previously shown that it is beneficial not to compute the likelihood of neighboring trees whose estimated sum of branch lengths highly deviates from that of the current tree ( Hordijk and Gascuel, 2005 ). We have recently shown that machine-learning algorithms can be efficiently utilized to accurately rank neighboring trees without computing their likelihood, thus potentially increasing the computational efficiency of tree inference ( Azouri et al , 2021 ).…”
Section: Introductionmentioning
confidence: 99%