Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Syste 2018
DOI: 10.1145/3173162.3173178
|View full text |Cite
|
Sign up to set email alerts
|

Blasting through the Front-End Bottleneck with Shotgun

Abstract: ASPLOS is the premier forum for multidisciplinary systems research spanning computer architecture and hardware, programming languages and compilers, operating systems and networking. The ASPLOS 2018 will be held in Williamsburg, Virginia, a town that combines a rich slice of American Colonial and Revolutionary history with a modern college atmosphere.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
60
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
7

Relationship

2
5

Authors

Journals

citations
Cited by 38 publications
(62 citation statements)
references
References 16 publications
2
60
0
Order By: Relevance
“…For each trace, we first warmup for 50M instructions and then report performance for the next 50M instructions. BTB designs: Based on the performance improvement and storage overhead, we use the following recent works for our evaluation: (i) Shotgun [3] (ii) SN4L+Dis+BTB [4], (iii) Skewed BTB [12] and (iv) FDIP-X [11]. Table II provides the details about these techniques.…”
Section: Evaluation a Evaluation Methodologymentioning
confidence: 99%
See 3 more Smart Citations
“…For each trace, we first warmup for 50M instructions and then report performance for the next 50M instructions. BTB designs: Based on the performance improvement and storage overhead, we use the following recent works for our evaluation: (i) Shotgun [3] (ii) SN4L+Dis+BTB [4], (iii) Skewed BTB [12] and (iv) FDIP-X [11]. Table II provides the details about these techniques.…”
Section: Evaluation a Evaluation Methodologymentioning
confidence: 99%
“…Prefetch or demand blocks which are filled into the L1I cache are predecoded, and the branches are stored in the Air BTB. Shotgun BTB [3]: Shotgun BTB proposes a new BTB design segregated by the type of control flow jumps. It uses three kinds of BTB: (i) C-BTB for conditional branches, (ii) U-BTB for unconditional branches, and (iii) RIB for return.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…To mitigate the high latency cost of page walks, Translation Lookaside Buffers (TLBs) cache the most recently used virtualto-physical translations. Despite the use of multi-level TLB hierarchies and other hardware and software schemes for accelerating address translation, frequent data TLB misses still cause significant performance degradation due to long miss penalties [30,32,40,47,54,58,63]. In response, the research community has proposed many techniques for reducing the overhead of address translation associated with data accesses [36,38,53,56,60,66,68,69,73,74,79,82].…”
Section: Introductionmentioning
confidence: 99%