2022
DOI: 10.1093/bioadv/vbac054
|View full text |Cite
|
Sign up to set email alerts
|

Nanopore quality score resolution can be reduced with little effect on downstream analysis

Abstract: Motivation The use of high precision for representing quality scores in nanopore sequencing data makes these scores hard to compress and, thus, responsible for most of the information stored in losslessly compressed FASTQ files. This motivates the investigation of the effect of quality score information loss on downstream analysis from nanopore sequencing FASTQ files. Results We polished de novo assemblies for a mock microbia… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(1 citation statement)
references
References 28 publications
0
1
0
Order By: Relevance
“…Note that while quality scores occupy a significant amount of space even after compression, we focus on read sequences due to the relative lack of research in this area, and since quality scores are often ignored by downstream tools like minimap2 7 . Quality scores have also been compressed lossily without an impact on the downstream performance for short-read technologies 8 , 9 and more recently for nanopore itself 10 , 11 . RENANO 12 is a recent reference-based compressor that achieves significantly better compression for read sequences, but is limited to aligned data with a reference available.…”
Section: Introductionmentioning
confidence: 99%
“…Note that while quality scores occupy a significant amount of space even after compression, we focus on read sequences due to the relative lack of research in this area, and since quality scores are often ignored by downstream tools like minimap2 7 . Quality scores have also been compressed lossily without an impact on the downstream performance for short-read technologies 8 , 9 and more recently for nanopore itself 10 , 11 . RENANO 12 is a recent reference-based compressor that achieves significantly better compression for read sequences, but is limited to aligned data with a reference available.…”
Section: Introductionmentioning
confidence: 99%