2011
DOI: 10.6026/97320630005350
|View full text |Cite
|
Sign up to set email alerts
|

DNABIT Compress-Genome Compression Algorithm

Abstract: Data compression is concerned with how information is organized in data. Efficient storage means removal of redundancy from the data being stored in the DNA molecule. Data compression algorithms remove redundancy and are used to understand biologically important molecules. We present a compression algorithm, “DNABIT Compress” for DNA sequences based on a novel algorithm of assigning binary bits for smaller segments of DNA bases to compress both repetitive and non repetitive DNA sequence. Our proposed algori… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
22
0

Year Published

2013
2013
2019
2019

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 35 publications
(22 citation statements)
references
References 17 publications
0
22
0
Order By: Relevance
“…[35] describes a particular run-length encoding scheme. Finally, [36] focuses on the analysis of how to store repeats with variable-size codes.…”
Section: Naive Bit Manipulation Algorithmsmentioning
confidence: 99%
“…[35] describes a particular run-length encoding scheme. Finally, [36] focuses on the analysis of how to store repeats with variable-size codes.…”
Section: Naive Bit Manipulation Algorithmsmentioning
confidence: 99%
“…Existing short-read compression approaches generally fall into categories: reference-based schemes (Campagne et al, 2013; Fritz et al, 2011; Li et al, 2014) attempt to compress reads by aligning them to one or more known reference sequences and recording edits between the read and its mapped location in the reference. De novo compression schemes (Adjeroh et al, 2002; Bhola et al, 2011; Bonfield and Mahoney, 2013; Brandon et al, 2009; Burriesci et al, 2012; Cox et al, 2012; Deorowicz and Grabowski, 2011; Hach et al, 2012; Jones et al, 2012; Kozanitis et al, 2011; Popitsch and von Haeseler, 2013; Rajarajeswari and Apparao, 2011; Tembe et al, 2010) attempt to compress without appeal to a reference. SCALCE (Hach et al, 2012) is one of the most effective, and works by reordering reads within the FASTA file to boost the compression of general purpose compressors.…”
Section: Introductionmentioning
confidence: 99%
“…DNABIT (Rajarajeswari and Apparao, 2011) has two phases, first even bit technique which assigns two bits for every nucleotide of non-repeat regions; second phase is odd bit technique which assigns 3, 5, 7 or 9 bits based on the size of repeat regions.…”
Section: Related Workmentioning
confidence: 99%
“…General purpose compression algorithms expand the sequences rather than compressing (Rajarajeswari and Apparao, 2011), so they cannot achieve the same compression ratio as specialized DNA sequences compression algorithms.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation