2014
DOI: 10.1145/2611778
|View full text |Cite
|
Sign up to set email alerts
|

A Survey and Classification of Storage Deduplication Systems

Abstract: The automatic elimination of duplicate data in a storage system, commonly known as deduplication, is increasingly accepted as an effective technique to reduce storage costs. Thus, it has been applied to different storage types, including archives and backups, primary storage, within solid-state drives, and even to random access memory. Although the general approach to deduplication is shared by all storage types, each poses specific challenges and leads to different trade-offs and solutions. This diversity is … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
41
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 111 publications
(41 citation statements)
references
References 51 publications
(69 reference statements)
0
41
0
Order By: Relevance
“…In contrast, a distributed deduplication includes multiple nodes to perform deduplication, thus parallelizing the deduplication process. Some distributed deduplication systems have separate indexing structures [22]. Some deduplication systems have a centralized index structure.…”
Section: Research Backgroundmentioning
confidence: 99%
“…In contrast, a distributed deduplication includes multiple nodes to perform deduplication, thus parallelizing the deduplication process. Some distributed deduplication systems have separate indexing structures [22]. Some deduplication systems have a centralized index structure.…”
Section: Research Backgroundmentioning
confidence: 99%
“…This policy allows separate file sets for applications with different performance norms, as some may not allow the performance consequence introduced by deduplication. And performing effective lookup and update operations [9].…”
Section: Deduplication Types 41 Offline Deduplicationmentioning
confidence: 99%
“…On the other side, offline deduplication may introduce extra reads from the storage, it requires more storage space, and increases concurrency issues, and increases the complexity of the deduplication process. These problems driven to the development [9]. …”
Section: Inline Deduplicationmentioning
confidence: 99%
“…Paulo and Pereira present a broad survey of different deduplication systems [20]. The authors set the systems in relation to each other and show common properties.…”
Section: Related Workmentioning
confidence: 99%