2016
DOI: 10.1109/jproc.2016.2571298
|View full text |Cite
|
Sign up to set email alerts
|

A Comprehensive Study of the Past, Present, and Future of Data Deduplication

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
73
0
1

Year Published

2017
2017
2024
2024

Publication Types

Select...
9
1

Relationship

1
9

Authors

Journals

citations
Cited by 235 publications
(85 citation statements)
references
References 118 publications
0
73
0
1
Order By: Relevance
“…Common approaches to reduce the data footprint are data compression (see, e.g., Iverson et al 2012;Storer 1988;Xia et al 2016b), sampling (see, e.g., Herrmann 2010Lohr 2009;Mendlik and Gobiet 2016) as well as statistical methods such as cluster analysis and principal component analysis (see, e.g., Rogerson 2015).…”
Section: Related Workmentioning
confidence: 99%
“…Common approaches to reduce the data footprint are data compression (see, e.g., Iverson et al 2012;Storer 1988;Xia et al 2016b), sampling (see, e.g., Herrmann 2010Lohr 2009;Mendlik and Gobiet 2016) as well as statistical methods such as cluster analysis and principal component analysis (see, e.g., Rogerson 2015).…”
Section: Related Workmentioning
confidence: 99%
“…Optimizing deduplication: Existing deduplication studies (see a complete survey [64] on deduplication) exploit workload characteristics (e.g., chunk locality [38,45,65,67] and file similarity [14,65]) to mitigate indexing overhead. For example, DDFS [67] prefetches the fingerprints of nearby chunks that are likely to be accessed together.…”
Section: Related Workmentioning
confidence: 99%
“…Next challenge worth mentioning here is about choosing appropriate type of deduplication process. Empirical studies have shown that source deduplication solution works well, when servers have adequate resources allocated for deduplication process and key management [28] mechanism. But, normally host machines rarely can allocate large amount of resources to deduplication process.…”
Section: Challenges Foundmentioning
confidence: 99%