Compression of short-read sequences using path encoding

Kingsford, Carl; Patro, Rob

doi:10.1101/006551

Search citation statements

Order By: Relevance

Paper Sections

Select...

Datasets and Algorithms Used For Comparisons1

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2015

Publication Types

Select...

Book1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

(1 citation statement)

References 43 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Currently, MFCompress [23], PathEnc [16], SCALCE [14], and fastqz [1] are some of the most efficient reads compression algorithms available in the literature. Every algorithm we have compared against, except for PathEnc, is a de novo compression algorithm.…”

Section: Datasets and Algorithms Used For Comparisonsmentioning

confidence: 99%

NRRC: A Non-referential Reads Compression Algorithm

Saha

Rajasekaran

2015

Bioinformatics Research and Applications

View full text Add to dashboard Cite

Abstract. In the era of modern sequencing technology, we are collecting a vast amount of biological sequence data. The technology to store, process, and analyze the data is not as cheap as to generate the sequencing data. As a result, the need for devising efficient data compression and data reduction techniques is growing by the day. Although there exist a number of sophisticated general purpose compression algorithms, they are not efficient to compress biological data. As a result, we need specialized compression algorithms targeting biological data. Five different NGS data compression problems have been identified and studied. In this article we propose a novel algorithm for one of these problems. We have done extensive experiments using real sequencing reads of various lengths. The simulation results reveal that our proposed algorithm is indeed competitive and performs better than the best known algorithms existing in the current literature.

show abstract