2022
DOI: 10.1109/tc.2021.3064352
|View full text |Cite
|
Sign up to set email alerts
|

Toward QoS-Awareness and Improved Utilization of Spatial Multitasking GPUs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
22
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 22 publications
(22 citation statements)
references
References 28 publications
0
22
0
Order By: Relevance
“…Several prior works target similar problems. Laius [80] manages the SM allocation to ensure the QoS of a single application, while the performance of the co-located low priority applications can be sacri ced. It is not applicable for GPU microservices through simple adaption for three reasons.…”
Section: De Ciencies Of Prior Workmentioning
confidence: 99%
See 4 more Smart Citations
“…Several prior works target similar problems. Laius [80] manages the SM allocation to ensure the QoS of a single application, while the performance of the co-located low priority applications can be sacri ced. It is not applicable for GPU microservices through simple adaption for three reasons.…”
Section: De Ciencies Of Prior Workmentioning
confidence: 99%
“…The shared resource usage is used to quantify the runtime contention between microservices on a GPU since only the SMs can be explicitly allocated. We also use the process pool technique [80] to enable dynamic SM allocation.…”
Section: Astraea Runtime Systemmentioning
confidence: 99%
See 3 more Smart Citations