2022
DOI: 10.1089/cmb.2022.0257
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Minimum Flow Decomposition via Integer Linear Programming

Abstract: Minimum flow decomposition (MFD) is an NP-hard problem asking to decompose a network flow into a minimum set of paths (together with associated weights). Variants of it are powerful models in multiassembly problems in Bioinformatics, such as RNA assembly. Owing to its hardness, practical multiassembly tools either use heuristics or solve simpler, polynomial time-solvable versions of the problem, which may yield solutions that are not minimal or do not perfectly decompose the flow. Here, we provide the first fa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
10
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(10 citation statements)
references
References 62 publications
0
10
0
Order By: Relevance
“…In case 3, we have more complex phage components where there are more than two unitigs forming branching paths, and we model them as a minimum flow decomposition (MFD) problem. The MFD problem decomposes a directed acyclic graph (DAG) into a minimum number of source-to-sink ( s − t ) paths that explain the flow values of the edges of the graph (Vatinlen et al 2008; Dias et al 2022). The most prominent applications of the MFD problem in bioinformatics include reconstructing RNA transcripts (Shao and Kingsford 2017; Tomescu et al 2013; Gatter and Stadler 2019) and viral quasispecies assembly (Baaijens, Stougie, and Schönhuth 2020).…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…In case 3, we have more complex phage components where there are more than two unitigs forming branching paths, and we model them as a minimum flow decomposition (MFD) problem. The MFD problem decomposes a directed acyclic graph (DAG) into a minimum number of source-to-sink ( s − t ) paths that explain the flow values of the edges of the graph (Vatinlen et al 2008; Dias et al 2022). The most prominent applications of the MFD problem in bioinformatics include reconstructing RNA transcripts (Shao and Kingsford 2017; Tomescu et al 2013; Gatter and Stadler 2019) and viral quasispecies assembly (Baaijens, Stougie, and Schönhuth 2020).…”
Section: Methodsmentioning
confidence: 99%
“…Hence, Phables models each of the remaining phage components as an MFD problem and uses the MFD-ILP implementation from Dias et al . (Dias et al 2022). MFD-ILP finds a FD (𝒫, w ) with a set of s − t flow paths 𝒫 and associated weights w such that the number of flow paths is minimized.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…On DAGs, a FD into paths always exists ( Ahuja et al 1988 ), but in general graphs, cycles may be necessary to decompose the flow [see e.g. Dias et al (2022a) for different possible formulations of the problem].…”
Section: Methodsmentioning
confidence: 99%
“…Computational methods are increasingly used to identify novel isoforms to complement annotations of model species and to construct transcriptomes for non-model species, thanks to the availability of largescale reposited RNA-seq data. The process of reconstructing full-length transcripts from RNA-seq reads, known as transcript assembly, has been extensively studied, with significant progress made in advancing the theory [20,21,22] and in developing practical assemblers, including Cufflinks [23], CLASS2 [24],…”
Section: Introductionmentioning
confidence: 99%