2018
DOI: 10.1101/306902
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

ARKS: chromosome-scale scaffolding of human genome drafts with linked read kmers

Abstract: Background. The long-range sequencing information captured by linked reads, such as those available from 10x Genomics (10xG), helps resolve genome sequence repeats, and yields accurate and contiguous draft genome assemblies. We introduce ARKS, an alignment-free linked read genome scaffolding methodology that uses linked reads to organize genome assemblies further into contiguous drafts. Our approach departs from other read alignment-dependent linked read scaffolders, including our own (ARCS), and uses a kmer-b… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
46
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 38 publications
(46 citation statements)
references
References 26 publications
0
46
0
Order By: Relevance
“…Integration of 10x and Hi-C reads was done differently. For the 10x scaffolding, we used tigmint v1.1.2 50 to detect and cut sequences at positions with few spanning molecules, arks v1.0.3 51 to generate graphs of scaffolds with connection evidence, and LINKS 52 for a second step of scaffolding. For the Hi-C data, we used BWA 53 to map the reads against the previous scaffolds and juicer tools v1.5 54 for the super-scaffolding.…”
Section: Methodsmentioning
confidence: 99%
“…Integration of 10x and Hi-C reads was done differently. For the 10x scaffolding, we used tigmint v1.1.2 50 to detect and cut sequences at positions with few spanning molecules, arks v1.0.3 51 to generate graphs of scaffolds with connection evidence, and LINKS 52 for a second step of scaffolding. For the Hi-C data, we used BWA 53 to map the reads against the previous scaffolds and juicer tools v1.5 54 for the super-scaffolding.…”
Section: Methodsmentioning
confidence: 99%
“…The scaffolds were extended with 10× Genomics data by using ARKS 43 with the following parameters: "-c 5 -j 0.55 -m 50-10000 -k 30 -r 0.05 -e 3000 -z 500 -d 0", and LINKS 44 with the following parameters: "-l 5 -a 0.9 -z 500". Thereafter, redundancy in the assembly was removed using redundans 0-13c 45 with the following parameters: "-identity 0.7 -overlap 0.7".…”
Section: Genome Assemblymentioning
confidence: 99%
“…70 to increase the contiguity of each of our non-redundant assemblies. ARKS uses a k-mer approach to infer graph edges by determining the Chromium barcodes associated with the best-matching contig end for each read and selecting the contig end with the largest fraction of k-mer overlap.…”
mentioning
confidence: 99%