2018
DOI: 10.1002/cpe.4666
|View full text |Cite
|
Sign up to set email alerts
|

A survey of techniques for dynamic branch prediction

Abstract: Branch predictor (BP) is an essential component in modern processors since high BP accuracy can improve performance and reduce energy by decreasing the number of instructions executed on wrong-path. However, reducing the latency and storage overhead of BP while maintaining high accuracy presents significant challenges. In this paper, we present a survey of dynamic branch prediction techniques. We classify the works based on key features to underscore their differences and similarities. We believe this paper wi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
13
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 33 publications
(17 citation statements)
references
References 112 publications
(360 reference statements)
0
13
0
Order By: Relevance
“…The overhead of masking is visible in the boundary column of the Table 7. To conclude, Intel released for its latest product native conditional SIMD instructions that should improve the performance, and a full investigation of branching and the performance of the predictor is a tough task, as it is demonstrated in the last survey of the technology [Mittal 2018], or the microprocessor branching unit description of Fog [2016].…”
Section: Boundary Limitsmentioning
confidence: 99%
“…The overhead of masking is visible in the boundary column of the Table 7. To conclude, Intel released for its latest product native conditional SIMD instructions that should improve the performance, and a full investigation of branching and the performance of the predictor is a tough task, as it is demonstrated in the last survey of the technology [Mittal 2018], or the microprocessor branching unit description of Fog [2016].…”
Section: Boundary Limitsmentioning
confidence: 99%
“…The lower performance of Atom is due to the higher latency of divide operation, smaller cache, and higher branch misprediction rate due to the complex control flow of search operation . Yet, the data path resources of Xeon are over‐provisioned, which leads to higher area.…”
Section: Processor Architecture Level Techniquesmentioning
confidence: 99%
“…The lower performance of Atom is due to the higher latency of divide operation, smaller cache, and higher branch misprediction rate due to the complex control flow of search operation. 92 Yet, the data path resources of Xeon are over-provisioned, which leads to higher area. Overall, since different functions/phases of the search operation have different bottlenecks, a variety of strategies are required, eg, application tuning, architectural improvements, etc.…”
Section: Getting Insights Into Browser Applications' Characteristicsmentioning
confidence: 99%
“…This information, referred to as global branch history, is then used to predict the incoming branch's direction. Branch predictor proposals in literature track specific branches in the global history [27] or only use the specific branch's prior history (referred to as local history) to provide a prediction. TAgged GEometric history length predictor (TAGE) [35] is one of the most successful branch predictor proposals.…”
Section: Introductionmentioning
confidence: 99%