2020
DOI: 10.1101/2020.03.16.992933
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Comparison of long read methods for sequencing and assembly of a plant genome

Abstract: Sequencing technologies have advanced to the point where it is possible to generate high accuracy, haplotype resolved, chromosome scale assemblies. Several long read sequencing technologies are available on the market and a growing number of algorithms have been developed over the last years to assemble the reads generated by those technologies. When starting a new genome project, it is therefore challenging to select the most cost-effective sequencing technology as well as the most appropriate software for as… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
11
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3

Relationship

3
4

Authors

Journals

citations
Cited by 13 publications
(11 citation statements)
references
References 34 publications
0
11
0
Order By: Relevance
“…Although this benchmark gave essential information on the performance of these assemblers, eukaryote genomes represent a completely different challenge. Recently, a publication compared different long-read sequencing technologies to assemble a plant genome, Macadamia jansenii 31 . They included statistics for different assemblers and obtained, depending on the tool, oversized assemblies combined with heightened numbers of duplicated BUSCO features on an 80X PacBio dataset, while they did not observe such differences on a 30X Nanopore dataset.…”
Section: Discussionmentioning
confidence: 99%
“…Although this benchmark gave essential information on the performance of these assemblers, eukaryote genomes represent a completely different challenge. Recently, a publication compared different long-read sequencing technologies to assemble a plant genome, Macadamia jansenii 31 . They included statistics for different assemblers and obtained, depending on the tool, oversized assemblies combined with heightened numbers of duplicated BUSCO features on an 80X PacBio dataset, while they did not observe such differences on a 30X Nanopore dataset.…”
Section: Discussionmentioning
confidence: 99%
“…Accession numbers are as follows: BGI (SRR11191908), PacBio (SRR11191909), ONT PromethION (SRR11191910), ONT MinION (SRR11191911), and Illumina (SRR11191912). Assemblies and other supporting data are available from the GigaScience GigaDB repository [ 56 ].…”
Section: Data Availabilitymentioning
confidence: 99%
“…It demonstrates a possibility of combining SLR data with other data type, for instance, PacBio and ONT long reads, as well as other bioinformatics tools, for instance, single-nucleotide-polymorphism or structuralvariation callers. A typical association with long reads relates to the extension of scaffolds 48 and contigs 49,50 . It is necessary to take full advantages of different sequencing platforms and comprehensively utilize genomic information in short range (short reads, <1kb), medium range (long reads, 1k~10kb) and long range (ultra-long reads, SLR, Hi-C, BioNano, 10kb~Mb) with various error rates and balance their costs.…”
Section: Discussionmentioning
confidence: 99%