2008 Fifth IEEE International Workshop on Storage Network Architecture and Parallel I/Os 2008
DOI: 10.1109/snapi.2008.11
|View full text |Cite
|
Sign up to set email alerts
|

ADMAD: Application-Driven Metadata Aware De-duplication Archival Storage System

Abstract: There is a huge amount of duplicated or redundant data in current storage systems. So Data Deduplication, which uses lossless data compression schemes to minimize the duplicated data at the interfile level, has been receiving broad attention in recent years. But there are still research challenges in current approaches and storage systems, such as: how to chunking the files more efficiently and better leverage potential similarity and identity among dedicated applications; how to store the chunks effectively a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
28
0

Year Published

2009
2009
2020
2020

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 43 publications
(28 citation statements)
references
References 13 publications
0
28
0
Order By: Relevance
“…The are a few works [25,27] that attempt to adjust the chunking algorithm to the content type for improved redundancy exposure. [27] employs specialized chunking for flash videos (FLV) and MP3 files and improves the data compression rate by up to 27% compared with that of Rabin's fingerprinting.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The are a few works [25,27] that attempt to adjust the chunking algorithm to the content type for improved redundancy exposure. [27] employs specialized chunking for flash videos (FLV) and MP3 files and improves the data compression rate by up to 27% compared with that of Rabin's fingerprinting.…”
Section: Related Workmentioning
confidence: 99%
“…[27] employs specialized chunking for flash videos (FLV) and MP3 files and improves the data compression rate by up to 27% compared with that of Rabin's fingerprinting. [25] improves the NRE performance by adjusting the chunk size to the characteristics of the content.…”
Section: Related Workmentioning
confidence: 99%
“…LBFS [25] is the first file system that breaks files into variable sized chunks using content defined chunking algorithm instead of fixed chunk size, which overcomes the shifting problem and can identify much more redundant data. To achieve a higher compression ratio, other chunking methods, such as fingerdiff [26], TTTD [27], ADMAD [28], provide more flexibility on the variability of chunk sizes. Fuzzy block matching [29] combines shingling and error-correcting to reduce file transmission costs.…”
Section: Related Workmentioning
confidence: 99%
“…Even if this chunking method is not restricted to Rabin Fingerprints, CDC and Rabin Fingerprints are sometimes used interchangeably [14].…”
Section: Introductionmentioning
confidence: 99%
“…by splitting email archives exactly at mail boundaries. The ADMAD deduplication system incorporates application-specific chunking methods, which are automatically selected based on file metadata [14]. It achieves higher compression ratios for MP3 files (43.9%), Flash video files (27%), HTML files (48%), and Email (14.3%) in comparison to Content-defined Chunking using Rabin Fingerprints.…”
Section: Introductionmentioning
confidence: 99%