2018
DOI: 10.1093/gigascience/giy157
|View full text |Cite
|
Sign up to set email alerts
|

LR_Gapcloser: a tiling path-based gap closer that uses long reads to complete genome assembly

Abstract: BackgroundCompleting a genome is an important goal of genome assembly. However, many assemblies, including reference assemblies, are unfinished and have a number of gaps. Long reads obtained from third-generation sequencing (TGS) platforms can help close these gaps and improve assembly contiguity. However, current gap-closure approaches using long reads require extensive runtime and high memory usage. Thus, a fast and memory-efficient approach using long reads is needed to obtain complete genomes.FindingsWe de… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
165
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
5

Relationship

0
10

Authors

Journals

citations
Cited by 186 publications
(166 citation statements)
references
References 43 publications
(65 reference statements)
1
165
0
Order By: Relevance
“…If an error is found over a gap, the scaffold was broken into two using the same software to remove potential structural errors in the assembly. The broken assemblies were concatenated using SALSA2 60 or 3D-DNA 61 , followed by gap filling with the 80X Illumina reads using SOAP-denovo2 Gap-Closer 62 and with the PacBio reads using LR_GapCloser v1.1 63 . We observed that 3D-DNA generated a slightly more correct assembly than SALSA2 from BUSCO analysis (Table S1).…”
Section: Genome Assemblymentioning
confidence: 99%
“…If an error is found over a gap, the scaffold was broken into two using the same software to remove potential structural errors in the assembly. The broken assemblies were concatenated using SALSA2 60 or 3D-DNA 61 , followed by gap filling with the 80X Illumina reads using SOAP-denovo2 Gap-Closer 62 and with the PacBio reads using LR_GapCloser v1.1 63 . We observed that 3D-DNA generated a slightly more correct assembly than SALSA2 from BUSCO analysis (Table S1).…”
Section: Genome Assemblymentioning
confidence: 99%
“…This assembly had a total of 3593 contigs (N50: 683 kb) from across the genome. We closed gaps in the v. 4 threespine stickleback reference assembly using these contigs and LR_Gapcloser with the parameter -a 1 (Xu et al 2019). We increased the allowed deviation between gap length and the inserted sequence length in order to provide additional flexibility for gap size that was not inferred accurately in the v. 4 genome assembly.…”
Section: Closing Gaps In the Reference Assemblymentioning
confidence: 99%
“…8 We scaffolded the polished contigs using the corrected nanopore sequences with SSPACE-LongRead (version 1-1) (Boetzer and Pirovano 2014), requiring three overlaps to connect contigs ('-l 3'). We used the corrected nanopore reads to extend contigs and fill scaffold gaps using LR_Gapcloser (Xu et al 2019) with default settings and a total of ten iterative rounds. Finally, we aligned the corrected nanopore reads to the scaffolds with Minimap2 and used these alignments to remove redundant scaffolds from the assembly using Purge Haplotigs (version 1.0.0) (Roach et al 2018).…”
Section: Genome Assembly Error Correction and Scaffoldingmentioning
confidence: 99%