2018
DOI: 10.1186/s12859-018-2425-6
|View full text |Cite
|
Sign up to set email alerts
|

Tigmint: correcting assembly errors using linked reads from large molecules

Abstract: BackgroundGenome sequencing yields the sequence of many short snippets of DNA (reads) from a genome. Genome assembly attempts to reconstruct the original genome from which these reads were derived. This task is difficult due to gaps and errors in the sequencing data, repetitive sequence in the underlying genome, and heterozygosity. As a result, assembly errors are common. In the absence of a reference genome, these misassemblies may be identified by comparing the sequencing data to the assembly and looking for… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
102
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 116 publications
(103 citation statements)
references
References 29 publications
0
102
0
Order By: Relevance
“…The final Hi-C-assisted genome assembly was commissioned by Annoroad Gene Technology. Tigmint (version 1.1.2) 53 was used to detect errors using linked reads from 10× Genomics Chromium. The reads were initially aligned to the Hi-C scaffolds, and the extents of the large DNA molecules were inferred from alignments of the reads.…”
Section: Methodsmentioning
confidence: 99%
“…The final Hi-C-assisted genome assembly was commissioned by Annoroad Gene Technology. Tigmint (version 1.1.2) 53 was used to detect errors using linked reads from 10× Genomics Chromium. The reads were initially aligned to the Hi-C scaffolds, and the extents of the large DNA molecules were inferred from alignments of the reads.…”
Section: Methodsmentioning
confidence: 99%
“…After processing, 408 million read pairs remained with 95.6% whitelisted barcodes and a barcode diversity of 752,722. We used tigmint (Jackman et al 2018) with settings as=100, depth_treshold=65, minsize=2000, number of mismatches=5 to break contigs at suspected misassemblies and low-quality regions. Next, arcs (Yeo et al 2017) was used to scaffold the contigs using settings c = 5, e=30000 and r=0.05, followed by links v1.8.5 (also from the arcs pipeline)…”
Section: Misassembly Detection and Scaffolding With Linked-read Datamentioning
confidence: 99%
“…In the second scaffolding stage, the Tigmint -arcs -links pipeline was utilized from BC Genome Sciences Centre. Tigmint v1.1.2 (Jackman et al, 2018) was run using the "arcs" pipeline to run all three stages. The Tigmint portion of the pipeline was run with default parameters.…”
Section: Subsequent Library Preparation and Sequencing Steps Were Permentioning
confidence: 99%
“…First, Hi-C sequencing data was utilized with the SALSA2 pipeline (Ghurye et al, 2019) (Supplementary Figure 1) to generate an assembly with scaffold N50 of 18.8 Mbp across a total of 941 scaffolds. Second, 10X Chromium data was introduced with the Tigmint/ARCS pipeline (Jackman et al, 2018;Yeo et al, 2018), to both break suspect contigs and to further scaffold the genome. Error correction reduced contig N50 to 3.40 Mbp (1,395 contigs) while scaffold N50 increased to 23.3 Mbp.…”
Section: Genome Assemblymentioning
confidence: 99%