2019
DOI: 10.1101/651638
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Accelerating Sequence Alignment to Graphs

Abstract: Aligning DNA sequences to an annotated reference is a key step for genotyping in biology. Recent scientific studies have demonstrated improved inference by aligning reads to a variation graph, i.e., a reference sequence augmented with known genetic variations. Given a variation graph in the form of a directed acyclic string graph, the sequence to graph alignment problem seeks to find the best matching path in the graph for an input query sequence. Solving this problem exactly using a sequential dynamic program… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
27
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 19 publications
(27 citation statements)
references
References 48 publications
(45 reference statements)
0
27
0
Order By: Relevance
“…We measured performance using GCUPS (giga cell updates per second), which is the number of cells in the dynamic programming matrix computed per second. This standard metric used extensively in the literature normalizes performance to enable comparisons between experiments conducted using reads and references of different lengths (Rahn et al, 2018;Daily, 2016;Jain et al, 2019;Liu and Schmidt, 2014;Liu et al, 2013).…”
Section: Resultsmentioning
confidence: 99%
See 4 more Smart Citations
“…We measured performance using GCUPS (giga cell updates per second), which is the number of cells in the dynamic programming matrix computed per second. This standard metric used extensively in the literature normalizes performance to enable comparisons between experiments conducted using reads and references of different lengths (Rahn et al, 2018;Daily, 2016;Jain et al, 2019;Liu and Schmidt, 2014;Liu et al, 2013).…”
Section: Resultsmentioning
confidence: 99%
“…For a particular row and column position in the matrix, SIMD instructions are used to compute the value for 8-64 queries simultaneously, depending on the vector capacity. The query-parallel strategy has been used for alignment to linear references in SeqAn (Rahn et al, 2018) and to directed acyclic graphs in PaSGAL (Jain et al, 2019).…”
Section: Vectorization Strategiesmentioning
confidence: 99%
See 3 more Smart Citations