2018
DOI: 10.1186/s12859-018-2243-x
|View full text |Cite
|
Sign up to set email alerts
|

ARKS: chromosome-scale scaffolding of human genome drafts with linked read kmers

Abstract: BackgroundThe long-range sequencing information captured by linked reads, such as those available from 10× Genomics (10xG), helps resolve genome sequence repeats, and yields accurate and contiguous draft genome assemblies. We introduce ARKS, an alignment-free linked read genome scaffolding methodology that uses linked reads to organize genome assemblies further into contiguous drafts. Our approach departs from other read alignment-dependent linked read scaffolders, including our own (ARCS), and uses a kmer-bas… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
52
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 91 publications
(57 citation statements)
references
References 24 publications
(36 reference statements)
0
52
0
Order By: Relevance
“…The total scaffold length is 943,793,727 bp distributed in 7830 scaffolds with an N50 of 421 kb (with gaps). The resulting assembly was further scaffolded by ARCS v1.0 ( Coombe et al, 2018 ) + LINKS v1.8.5 ( Warren et al, 2015 ) with the following parameters: arcs -e 50000 c 3 r 0.05 s 98 and LINKS -m -d 4000 k 20 -e 0.1 l 3 -a 0.3 t 2 -o 0 -z 500 r -p 0.001 -x 0. This increased the scaffold N50 to 1.527 Mb.…”
Section: Methodsmentioning
confidence: 99%
“…The total scaffold length is 943,793,727 bp distributed in 7830 scaffolds with an N50 of 421 kb (with gaps). The resulting assembly was further scaffolded by ARCS v1.0 ( Coombe et al, 2018 ) + LINKS v1.8.5 ( Warren et al, 2015 ) with the following parameters: arcs -e 50000 c 3 r 0.05 s 98 and LINKS -m -d 4000 k 20 -e 0.1 l 3 -a 0.3 t 2 -o 0 -z 500 r -p 0.001 -x 0. This increased the scaffold N50 to 1.527 Mb.…”
Section: Methodsmentioning
confidence: 99%
“…The total scaffold length is 943,793,727bp distributed in 7830 scaffolds with an N50 of 421kb (with gaps). The resulting assembly was further scaffolded by ARCS v1.0 48 + LINKS v1.8.5 49 with the following parameters: arcs -e 50000 -c 3 -r 0.05 -s 98 and LINKS -m -d 4000 -k 20 -e 0.1 -l 3 -a 0.3 -t 2 –o 0 -z 500 -r -p 0.001 -x 0. This increased the scaffold N50 to 1.527 Mb.…”
Section: Methodsmentioning
confidence: 99%
“…Analysis of 17-mer sequences revealed a high level of heterozygosity of the alkaligrass genome (1.64%) ( Figure S3, Table S2 in Supporting Information); therefore, we also used the genome sequence of Aegilops tauschii (2n=14; a closely related species of alkaligrass) to remove potential heterozygous contig sequences using ALLMAPS (Tang et al, 2015). Afterwards, the remaining contigs were assembled into scaffolds by ARKS (Coombe et al, 2018) and LINKS (Warren et al, 2015) using 10× Genomics data, and finally a total of 1,715 scaffolds were generated with a total size of 1.50 and an N50 size of 1.46 Mb (Table S3 in Supporting Information).…”
Section: Genome Sequencing and De Novo Assemblymentioning
confidence: 99%
“…The redundancy in the polished contig sequences were removed based on gene synteny compared with the genome of Aegilops tauschii (a closely related species of alkaligrass) using ALLMAPS (v0.8.12;Tang et al, 2015). 10× Genomics data was used to scaffold the assembled contigs using the ARKS-LINKS pipeline with the parameters of ARKS (Coombe et al, 2018) set to '-c 5 -j 0.55 -m 50-10000 -k 30 -r 0.05 -e 3000 -z 500 -d 0' and the parameters of LINKS (Warren et al, 2015) set to '-l 5 -a 0.9 -z 500'. Based on high-throughput chromosome conformation capture (Hi-C) data, scaffolds were anchored to seven pseudo-molecules using LACHESIS (Burton et al, 2013)…”
Section: Genome Assemblymentioning
confidence: 99%