2016
DOI: 10.1007/s00453-016-0165-4
|View full text |Cite
|
Sign up to set email alerts
|

An External-Memory Algorithm for String Graph Construction

Abstract: Some recent results [13,25] have introduced external-memory algorithms to compute self-indexes of a set of strings, mainly via computing the BurrowsWheeler Transform (BWT) of the input strings. The motivations for those results stem from Bioinformatics, where a large number of short strings (called reads) are routinely produced and analyzed. In that field, a fundamental problem is to assemble a genome from a large set of much shorter samples extracted from the unknown genome. The approaches that are currently … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
4
2
1

Relationship

4
3

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 34 publications
(65 reference statements)
0
4
0
Order By: Relevance
“…Furthermore, fingerprints are numerical sequences shorter than the represented character sequences and we also expect that they are resilient to errors occurring in the reads (especially in long reads). The first application is related to genome assembly based on the use of an overlap graph which is constructed by detecting the overlaps between genomic reads [11,12]. When dealing with long reads this task is further complicated by the length of the reads and the high sequencing error rate.…”
Section: Some Applications: Representing and Querying Read Sequencesmentioning
confidence: 99%
“…Furthermore, fingerprints are numerical sequences shorter than the represented character sequences and we also expect that they are resilient to errors occurring in the reads (especially in long reads). The first application is related to genome assembly based on the use of an overlap graph which is constructed by detecting the overlaps between genomic reads [11,12]. When dealing with long reads this task is further complicated by the length of the reads and the high sequencing error rate.…”
Section: Some Applications: Representing and Querying Read Sequencesmentioning
confidence: 99%
“…Their algorithm for transitive reduction is similar to that of Myers'. Bonizzoni et al [15] propose a similar approach using only the FM-index of the input sequence to create a string graph and perform a transitive reduction with a different but equivalent formulation than Myers' [19].…”
Section: Related Workmentioning
confidence: 99%
“…In (Myers, 2005) redundant arcs are those arcs (r, s) labeled by γ, for γ containing as prefix the label of an arc (r, t). In (Bonizzoni et al, 2016b) we state an equivalent characterization of string graphs (given below) which is a direct consequence of the fact that an arc (r i , r j ) is labeled by the left extension α and its assembly is αr j . An arc e 1 = (r i , r j ) of G O labeled by α is transitive (or reducible) if there exists another arc e 2 = (r k , r j ) labeled by δ where δ is a suffix of α.…”
Section: Preliminariesmentioning
confidence: 99%