2016
DOI: 10.1093/bioinformatics/btw537
|View full text |Cite
|
Sign up to set email alerts
|

H-PoP and H-PoPG: heuristic partitioning algorithms for single individual haplotyping of polyploids

Abstract: Motivation: Some economically important plants including wheat and cotton have more than two copies of each chromosome. With the decreasing cost and increasing read length of next-generation sequencing technologies, reconstructing the multiple haplotypes of a polyploid genome from its sequence reads becomes practical. However, the computational challenge in polyploid haplotyping is much greater than that in diploid haplotyping, and there are few related methods. Results: This paper models the polyploid haploty… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
104
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 56 publications
(104 citation statements)
references
References 32 publications
0
104
0
Order By: Relevance
“…We calculate this criterion for each haplotype block and report the average. The vector error rate is calculated by finding the minimum number of switches needed in haplotype segments in order to match • to ; this number is then divided by the haplotype length [7,15].…”
Section: Performance Assessmentmentioning
confidence: 99%
See 1 more Smart Citation
“…We calculate this criterion for each haplotype block and report the average. The vector error rate is calculated by finding the minimum number of switches needed in haplotype segments in order to match • to ; this number is then divided by the haplotype length [7,15].…”
Section: Performance Assessmentmentioning
confidence: 99%
“…SDhaP [6] solves a correlation clustering problem using a gradient method to estimate the haplotypes. H-PoP [7], a heuristic algorithm, solves a combinatorial optimization problem called "polyploid balanced optimal partition". Another approach is to use the minimum fragment removal (MFR) model in which conflicting fragments (due to erroneous reads) are removed.…”
Section: Introductionmentioning
confidence: 99%
“…The vast majority of existing haplotype assembly methods attempt to remove the aforementioned ambiguity by altering or even discarding the data, leading to minimum SNP removal (Lancia 2001), maximum fragments cut (Duitama 2010), and minimum error correction (MEC) score optimization criteria. Majority of haplotype assembly methods developed in recent years are focused on optimizing the MEC score, i.e., determining the smallest possible number of nucleotides in sequencing reads that should be altered such that the resulting dataset is consistent with having originated from k haplotypes (k denotes the ploidy of an organism) (Xie 2016;Pirola 2015;Kuleshov 2014;Patterson 2015;Bonizzoni 2016). These include the branch-and-bound scheme (Wang 2005), an integer linear programming formulation in (Chen 2013), and a dynamic programming framework in (Kuleshov 2014).…”
Section: Introductionmentioning
confidence: 99%
“…Among the aforementioned methods, only HapCompass (Aguiar 2012), SD-haP (Das 2015) and BP (Puljiz 2016) are capable of solving the haplotype assembly problem for k > 2. Other techniques that can handle reconstruction of haplotypes for both diploid and polyploid genomes include a Bayesian method HapTree (Berger 2014), a dynamic programming method H-PoP (Xie 2016) shown to be more accurate than the techniques in (Aguiar 2012;Berger 2014;Das 2015), and the matrix factorization schemes in (Cai 2016;Hashemi 2018).…”
Section: Introductionmentioning
confidence: 99%
“…Examples for diploid haplotype assembly are WhatsHap (Patterson et al, 2015), Phaser (Castel et al, 2016), Hap-Cut2 (Edge et al, 2017), ProbHap (Kuleshov, 2014) and HapCol (Pirola et al, 2016). Examples for polyploid haplotype assembly are Hap-Compass (Aguiar and Istrail, 2012), HapTree (Berger et al, 2014), SDhaP (Das and Vikalo, 2015), and H-PoP (Xie et al, 2016). The disadvantage of haplotype assembly programs is that they depend on high-quality reference sequence as a backbone, and, in addition, also on external variant call sets, which are major external factors that can introduce non-negligible biases.…”
Section: Introductionmentioning
confidence: 99%