2018 51st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) 2018
DOI: 10.1109/micro.2018.00036
|View full text |Cite
|
Sign up to set email alerts
|

Neighborhood-Aware Address Translation for Irregular GPU Applications

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
12
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 23 publications
(18 citation statements)
references
References 23 publications
0
12
0
Order By: Relevance
“…Sensitivity of Valkyrie to L1-TLB size: While 128-entry L1-TLBs have been shown to provide the best performance [37], 64entry L1-TLBs for GPUs are also quite common [9,41,42]. In a spirit similar to how we propose to split the L1-TLB area into a smaller L1-TLB and a prefetch buer, we evaluated a design with 52entry L1-TLB and a 12-entry prefetch buer (we scaled down both the L1-TLB size and prefetch buer size).…”
Section: Evaluation Resultsmentioning
confidence: 99%
See 4 more Smart Citations
“…Sensitivity of Valkyrie to L1-TLB size: While 128-entry L1-TLBs have been shown to provide the best performance [37], 64entry L1-TLBs for GPUs are also quite common [9,41,42]. In a spirit similar to how we propose to split the L1-TLB area into a smaller L1-TLB and a prefetch buer, we evaluated a design with 52entry L1-TLB and a 12-entry prefetch buer (we scaled down both the L1-TLB size and prefetch buer size).…”
Section: Evaluation Resultsmentioning
confidence: 99%
“…To further improve the performance of Valkyrie, we plan to integrate it with other approaches that improve the page-walk performance [41,42]. We also plan to study eective communication mechanisms and hierarchical network designs such as fat-trees [32] to support more ecient inter-L1 TLB communication schemes.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations