2014
DOI: 10.1101/006551
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Compression of short-read sequences using path encoding

Abstract: Storing, transmitting, and archiving the amount of data produced by next generation sequencing is becoming a significant computational burden. For example, large-scale RNA-seq meta-analyses may now routinely process tens of terabytes of sequence. We present here an approach to biological sequence compression that reduces the difficulty associated with managing the data produced by large-scale transcriptome sequencing. Our approach offers a new direction by sitting between pure reference-based compression and r… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2015
2015
2015
2015

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 43 publications
0
1
0
Order By: Relevance
“…Currently, MFCompress [23], PathEnc [16], SCALCE [14], and fastqz [1] are some of the most efficient reads compression algorithms available in the literature. Every algorithm we have compared against, except for PathEnc, is a de novo compression algorithm.…”
Section: Datasets and Algorithms Used For Comparisonsmentioning
confidence: 99%
“…Currently, MFCompress [23], PathEnc [16], SCALCE [14], and fastqz [1] are some of the most efficient reads compression algorithms available in the literature. Every algorithm we have compared against, except for PathEnc, is a de novo compression algorithm.…”
Section: Datasets and Algorithms Used For Comparisonsmentioning
confidence: 99%