2019
DOI: 10.1101/837674
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A Graph Auto-Encoder for Haplotype Assembly and Viral Quasispecies Reconstruction

Abstract: Reconstructing components of a genomic mixture from data obtained by means of DNA sequencing is a challenging problem encountered in a variety of applications including single individual haplotyping and studies of viral communities. Highthroughput DNA sequencing platforms oversample mixture components to provide massive amounts of reads whose relative positions can be determined by mapping the reads to a known reference genome; assembly of the components, however, requires discovery of the reads' origin -an NP… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
13
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
3
1
1

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(13 citation statements)
references
References 37 publications
0
13
0
Order By: Relevance
“…Note that no method can correctly reconstruct the strains in 4 genes (vpu, gp120, gp41 and nef). As noted in (Ke and Vikalo, 2020), this may be due to translocations of short segments in those genes, causing mismatch between the 5 HIV-1 strains reconstructed by (Di Giallonardo et al, 2014) and the actual ground truth.…”
Section: Performance Comparison On Semi-experimental Solanum Tuberosumentioning
confidence: 99%
See 4 more Smart Citations
“…Note that no method can correctly reconstruct the strains in 4 genes (vpu, gp120, gp41 and nef). As noted in (Ke and Vikalo, 2020), this may be due to translocations of short segments in those genes, causing mismatch between the 5 HIV-1 strains reconstructed by (Di Giallonardo et al, 2014) and the actual ground truth.…”
Section: Performance Comparison On Semi-experimental Solanum Tuberosumentioning
confidence: 99%
“…While early methods explored a variety of metrics including minimum single nucleotide polymorphism (SNP) removal (Lancia et al, 2001) and maximum fragments cut (Duitama et al, 2010), the vast majority of more recent techniques is focused on minimum error correction (MEC) optimization (Lippert et al, 2002), i.e. determining the smallest number of inconsistencies between reads and the reconstructed haplotypes (Ke and Vikalo, 2020;Edge, Bafna, and Bansal, 2017;Xie et al, 2016;Bonizzoni et al, 2016;Patterson et al, 2015;Pisanti et al, 2015;Kuleshov, 2014). Existing MEC score optimization methods can be divided into two categories: those in pursuit of exact solutions to the MEC optimization problem, and computationally efficient heuristics.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations