2022
DOI: 10.1016/j.future.2021.12.016
|View full text |Cite
|
Sign up to set email alerts
|

gShare: A centralized GPU memory management framework to enable GPU memory sharing for containers

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
2
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 30 publications
0
2
0
Order By: Relevance
“…Space sharing is suitable when a single application cannot efficiently use the entire GPU, which is addressed by Gavel [25] and Gslice [69] in MPS sharing mode. Many other works have addressed various areas in GPU sharing including communication, memory allocation, and latency sensitivity [70][71][72][73][74][75]. However, none of the above works addresses the challenges and limitations of using MIG-enabled GPU sharing which, as we discussed in Sec.…”
Section: Related Workmentioning
confidence: 99%
“…Space sharing is suitable when a single application cannot efficiently use the entire GPU, which is addressed by Gavel [25] and Gslice [69] in MPS sharing mode. Many other works have addressed various areas in GPU sharing including communication, memory allocation, and latency sensitivity [70][71][72][73][74][75]. However, none of the above works addresses the challenges and limitations of using MIG-enabled GPU sharing which, as we discussed in Sec.…”
Section: Related Workmentioning
confidence: 99%
“…Space sharing is suitable when a single application cannot efficiently use the entire GPU, which is addressed by Gavel [27] and Gslice [71] in MPS sharing mode. Many other works have addressed various areas in GPU sharing including communication, memory allocation, and latency sensitivity [72][73][74][75][76]. However, none of the above works addresses the challenges and limitations of using MIG-enabled GPU sharing which, as we discussed in Sec.…”
Section: Related Workmentioning
confidence: 99%
“…Space sharing is suitable when a single application cannot efficiently use the entire GPU, which is addressed by Gavel [65] and Gslice [95] in MPS sharing mode. Many other works have addressed various areas in GPU sharing including communication, memory allocation, and latency sensitivity [96,97,98,99,100]. This dissertation explores a newly introduced feature of Multi-Instance GPU (MIG) sharing to improve system throughput and reduce carbon emissions.…”
Section: Related Workmentioning
confidence: 99%