2011
DOI: 10.1007/978-3-642-25821-3_5
|View full text |Cite
|
Sign up to set email alerts
|

Live Deduplication Storage of Virtual Machine Images in an Open-Source Cloud

Abstract: Abstract. Deduplication is an approach of avoiding storing data blocks with identical content, and has been shown to effectively reduce the disk space for storing multi-gigabyte virtual machine (VM) images. However, it remains challenging to deploy deduplication in a real system, such as a cloud platform, where VM images are regularly inserted and retrieved. We propose LiveDFS, a live deduplication file system that enables deduplication storage of VM images in an open-source cloud that is deployed under low-co… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
39
0

Year Published

2012
2012
2024
2024

Publication Types

Select...
4
3
3

Relationship

0
10

Authors

Journals

citations
Cited by 51 publications
(39 citation statements)
references
References 9 publications
0
39
0
Order By: Relevance
“…Finally, we think it is worthwhile to investigate data compression and deduplication techniques that have been developed for VMI storage (e.g. [7,14]) in the context of VMI caches to gain even more storage efficacy.…”
Section: Discussionmentioning
confidence: 99%
“…Finally, we think it is worthwhile to investigate data compression and deduplication techniques that have been developed for VMI storage (e.g. [7,14]) in the context of VMI caches to gain even more storage efficacy.…”
Section: Discussionmentioning
confidence: 99%
“…Another solution for increasing deduplication throughput and reducing I/O latency is to use a Bloom filter and explore spatial locality by preserving the disk layout, then prefetching contiguous chunk signatures to cache as in DDFS. These two improvements were presented for an inline centralized deduplication system along with a novel fault-tolerant journaling mechanism for tracking system transactions, and recovering data and corresponding signatures in failure scenarios [Ng et al 2011].…”
Section: Primary Storagementioning
confidence: 99%
“…It takes significant memory resource for filtering and caching. NG et al [12] use a related filtering technique for integrating deduplication in Linux file system and the memory consumed is up to 2GB for a single machine. That is still too big in our context discussed below.…”
Section: Background and Related Workmentioning
confidence: 99%