2008
DOI: 10.1109/tcbb.2008.26
|View full text |Cite
|
Sign up to set email alerts
|

Mixed Integer Linear Programming for Maximum-Parsimony Phylogeny Inference

Abstract: Reconstruction of phylogenetic trees is a fundamental problem in computational biology. While excellent heuristic methods are available for many variants of this problem, new advances in phylogeny inference will be required if we are to be able to continue to make effective use of the rapidly growing stores of variation data now being gathered. In this paper, we present two integer linear programming (ILP) formulations to find the most parsimonious phylogenetic tree from a set of binary variation data. One met… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
35
0

Year Published

2009
2009
2023
2023

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 51 publications
(35 citation statements)
references
References 31 publications
0
35
0
Order By: Relevance
“…Similarly, if the input haplotype set H satisfies the perfect phylogeny condition i.e., the requirement that each allele changes only once throughout the optimal phylogeny (see [19]), then the MPPEP-SNP can be still solved in polynomial time [26][27][28]. Unfortunately, it is possible to prove that in the general case the MPPEP-SNP is N P-hard (see [1,22]). In fact, the binary nature of the SNP haplotypes allows us to interpret a generic haplotype h i ∈ H as a vertex of a m-dimensional unit hypercube, its s-th allele as the s-th coordinate of the vertex h i , and the set H as the set of Steiner vertices of the unit hypercube.…”
Section: Problem 2 the Most Parsimonious Phylogeny Estimation Problementioning
confidence: 99%
See 4 more Smart Citations
“…Similarly, if the input haplotype set H satisfies the perfect phylogeny condition i.e., the requirement that each allele changes only once throughout the optimal phylogeny (see [19]), then the MPPEP-SNP can be still solved in polynomial time [26][27][28]. Unfortunately, it is possible to prove that in the general case the MPPEP-SNP is N P-hard (see [1,22]). In fact, the binary nature of the SNP haplotypes allows us to interpret a generic haplotype h i ∈ H as a vertex of a m-dimensional unit hypercube, its s-th allele as the s-th coordinate of the vertex h i , and the set H as the set of Steiner vertices of the unit hypercube.…”
Section: Problem 2 the Most Parsimonious Phylogeny Estimation Problementioning
confidence: 99%
“…Variables u i , i ∈ Q, do not need to be declared as their value will be always 1 any feasible solution to the problem. Finally, variables x s i , i ∈ V H , can be removed as their value is univocally assigned by the input haplotype set H. The reduction process can be further combined with the preprocessing strategies described in [1] to obtain even smaller formulations. Such strategies allow one to remove alleles from the input haplotype set H without altering the optimal solution to the problem.…”
Section: Reducing Model Sizementioning
confidence: 99%
See 3 more Smart Citations