2019
DOI: 10.48550/arxiv.1910.09727
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Mitigating the Performance-Efficiency Tradeoff in Resilient Memory Disaggregation

Abstract: Memory disaggregation has received attention in recent years as a promising idea to reduce the total cost of ownership (TCO) of memory in modern datacenters. However, relying on remote memory expands an application's failure domain and makes it susceptible to tail latency variations. In attempts to making disaggregated memory resilient, stateof-the-art solutions face the classic tradeoff between performance and efficiency: some double the memory overhead of disaggregation by replicating to remote memory, while… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(5 citation statements)
references
References 22 publications
0
5
0
Order By: Relevance
“…DDA and SR-IOV require the whole address range of a VM to be statically pinned in hypervisor-level page tables [15,[75][76][77][78][79]. As a consequence, memory disaggregation cannot rely on hypervisor page faults, which would be needed to deploy existing RDMA-based systems at the platform level [29][30][31][32][33][34][35][36][37]. For the same reason, we cannot migrate pages between local memory and a pool as used in existing two-tier memory systems [31,[49][50][51][52].…”
Section: Design Goals and Requirementsmentioning
confidence: 99%
See 2 more Smart Citations
“…DDA and SR-IOV require the whole address range of a VM to be statically pinned in hypervisor-level page tables [15,[75][76][77][78][79]. As a consequence, memory disaggregation cannot rely on hypervisor page faults, which would be needed to deploy existing RDMA-based systems at the platform level [29][30][31][32][33][34][35][36][37]. For the same reason, we cannot migrate pages between local memory and a pool as used in existing two-tier memory systems [31,[49][50][51][52].…”
Section: Design Goals and Requirementsmentioning
confidence: 99%
“…ThymesisFlow advocates application changes for performance, while we focuse on platform-level MLdriven pool memory management that is transparent to users. Hypervisor/OS level disaggregation: Hypervisor/OS level approaches [11,[28][29][30][31][32][33][34][35][36][37] rely on page faults and access monitoring to maintain the working set in local DRAM. Such OSbased approaches bring significant overhead, jitter, and are incompatible with virtualization acceleration (e.g., DDA).…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In our prototype, we do not assume that we are given a reliable disaggregated memory [54,91], but rather show how to implement a reliable disaggregated memory using RDMA. 2 To do so, we assume 2𝑓 𝑚 +1 memory nodes out of which 𝑓 𝑚 can fail.…”
Section: Modelmentioning
confidence: 99%
“…Carbink [91] and Hydra [54] build reliable disaggregated memory to improve memory utilization in a cluster, albeit without support for concurrent shared access. MIND [53], GAM [21] and Clover [85], on the other hand, provide reliable shared memory, but they do not tolerate Byzantine writers.…”
Section: Related Workmentioning
confidence: 99%