2019
DOI: 10.1109/tcbb.2017.2779509
|View full text |Cite
|
Sign up to set email alerts
|

Theory and A Heuristic for the Minimum Path Flow Decomposition Problem

Abstract: Motivated by multiple genome assembly problems and other applications, we study the following minimum path flow decomposition problem: given a directed acyclic graph with source and sink and a flow , compute a set of paths and assign weight for such that , and is minimized. We develop some fundamental theory for this problem, upon which we design an efficient heuristic. Specifically, we prove that the gap between the optimal number of paths and a known upper bound is determined by the nontrivial equations with… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
41
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 26 publications
(41 citation statements)
references
References 37 publications
(46 reference statements)
0
41
0
Order By: Relevance
“…Many approximation algorithms have been developed for finding a minimum path flow decomposition, e.g. [40,41], but these algorithms could not even handle our smallest data set (a mixture of 2 haplotypes of length 2500 bp). Therefore, we resort to other, more efficient means for obtaining a set of haplotypes from the given flow solution [42].…”
Section: Greedy Path Extractionmentioning
confidence: 99%
“…Many approximation algorithms have been developed for finding a minimum path flow decomposition, e.g. [40,41], but these algorithms could not even handle our smallest data set (a mixture of 2 haplotypes of length 2500 bp). Therefore, we resort to other, more efficient means for obtaining a set of haplotypes from the given flow solution [42].…”
Section: Greedy Path Extractionmentioning
confidence: 99%
“…Finding a flow decomposition with the minimum number of weighted paths is a well-studied problem in computer science. Even when restricted to DAGs, the minimum FD problem is NP-hard [11], and thus various practical approaches to it exist: approximation algorithms [12], [13], [14], [15], [16], [17], FPT algorithms [9], greedy algorithms [11], [18]. By taking the set of subpaths constraints to be empty (or to correspond to all edges of the graph with non-zero flow), it follows that also finding a solution to the FDSC problem with a minimum number of paths is NP-hard.…”
Section: Related Workmentioning
confidence: 99%
“…We implement both algorithms for FDSC, and perform a proof-of-concept study of their usefulness in RNA assembly. We experiment on a dataset developed by Shao et al [18] to study their heuristic for the minimum FD problem. The same dataset was then used by Kloster et al [9], who focused on studying the usefulness of standard minimum flow decompositions in RNA assembly, as explained above.…”
Section: Contributionsmentioning
confidence: 99%
See 1 more Smart Citation
“…StringTie [ 6 ] iteratively finds the heaviest path of a flow network constructed from splice graphs. A theoretical work by Shao et al [ 10 ] studies the minimum path decomposition of splice graphs when the edge abundances satisfy flow balance constraints. Better network flow estimation on splice graphs inspires improvement of transcriptome assembly methods.…”
Section: Introductionmentioning
confidence: 99%