2019
DOI: 10.12688/f1000research.20233.1
|View full text |Cite
|
Sign up to set email alerts
|

A Sequence Distance Graph framework for genome assembly and analysis

Abstract: The Sequence Distance Graph (SDG) framework works with genome assembly graphs and raw data from paired, linked and long reads. It includes a simple deBruijn graph module, and can import graphs using the graphical fragment assembly (GFA) format. It also maps raw reads onto graphs, and provides a Python application programming interface (API) to navigate the graph, access the mapped and raw data and perform interactive or scripted analyses. Its complete workspace can be dumped to and loaded from disk, decoupling… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
1
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 17 publications
0
3
0
Order By: Relevance
“…A canonical collapsed De Bruijn Graph (DBG) is first constructed in Sequence Distance Graph Threader (SDG-Threader)(14) from the short reads, which is then cleaned and simplified. Further repeat resolution is completed by finding unique paths between nodes representing haploid content, followed by canonical repeat resolution using long reads.…”
Section: Methodsmentioning
confidence: 99%
“…A canonical collapsed De Bruijn Graph (DBG) is first constructed in Sequence Distance Graph Threader (SDG-Threader)(14) from the short reads, which is then cleaned and simplified. Further repeat resolution is completed by finding unique paths between nodes representing haploid content, followed by canonical repeat resolution using long reads.…”
Section: Methodsmentioning
confidence: 99%
“…This has resulted in the development of the FASTG format ( http://fastg.sourceforge.net , accessed 25 May 2021) and, more recently and with wider adoption, the graphical fragment assembly (GFA) format for assembly graph files ( https://github.com/GFA-spec/GFA-spec , accessed 25 May 2021). These have been implemented in a number of tools including recent versions of ABySS ( Jackman, 2017 ), SPAdes ( Bankevich, 2012 ), metaSPAdes and SDG ( Yanes, 2019 ). Other tools have addressed the problem of analysing the quantity of sequencing data that is currently available, particularly in metagenomics.…”
Section: Introductionmentioning
confidence: 99%
“…This has resulted in the development of the FASTG format [12] and, more recently and with wider adoption, the graphical fragment assembly (GFA) format for assembly graph files [13]. These have been implemented in a number of tools including recent versions of ABySS [14], SPAdes [15] and SDG [16]. However, as yet, specialist metagenomic assembly tools lack the ability to capture variation through GFA output.…”
Section: Introductionmentioning
confidence: 99%