0% found this document useful (0 votes)

12 views17 pages

Cs146-Lecture7 2

The document outlines Lecture 7 of Computer Science 146 at Harvard University, focusing on Dynamic Branch Prediction and related concepts such as Tomasulo's Algorithm, register renaming, and control hazards. It discusses various strategies for reducing control hazards, including dynamic hardware branch prediction, and details different branch prediction methods and their implementations. The lecture also covers the importance of predicting branch directions and targets to enhance processor performance.

Uploaded by

srivadeepanshu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views17 pages

Cs146-Lecture7 2

Uploaded by

srivadeepanshu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Computer Science 146

Computer Architecture
Fall 2019
Harvard University

Instructor: Prof. David Brooks

[email protected]

Lecture 7: Dynamic Branch Prediction

Computer Science 146

David Brooks

Lecture Outline
• Tomasulo’s Algorithm Review (3.1-3.3)
• Pointer-Based Renaming (MIPS R10000)
• Dynamic Branch Prediction (3.4)
– Yeh + Patt Paper
• Other Front-end Optimizations (3.5)
– Branch Target Buffers/Return Address Stack

Computer Science 146

David Brooks

1
Tomasulo Review
• Reservation Stations
– Distribute RAW hazard detection
– Renaming eliminates WAW hazards
– Buffering values in Reservation Stations removes WARs
– Tag match in CDB requires many associative compares
• Common Data Bus
– Achilles heal of Tomasulo
– Multiple writebacks (multiple CDBs) expensive
• Load/Store reordering
– Load address compared with store address in store buffer

Computer Science 146

David Brooks

Tomasulo Organization
From Mem FP Op FP Registers
Queue
Load Buffers
Load1
Load2
Load3
Load4
Load5 Store
Load6
Buffers

Add1
Add2 Mult1
Add3 Mult2

Reservation To Mem
Stations
FP
FP adders
adders FP
FP multipliers
multipliers

Common Data Bus (CDB)

Computer Science 146
David Brooks

2
Tomasulo Review
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
LD F0, 0(R1) Iss M1 M2 M3 M4 M5 M6 M7 M8 Wb

MUL F4, F0, F2 Iss Iss Iss Iss Iss Iss Iss Iss Iss Ex Ex Ex Ex Wb

SD 0(R1), F0 Iss Iss Iss Iss Iss Iss Iss Iss Iss Iss Iss Iss Iss M1 M2 M3 Wb

SUBI R1, R1, 8 Iss Ex Wb

BNEZ R1, Loop Iss Ex Wb

LD F0, 0(R1) Iss Iss Iss Iss M Wb

MUL F4, F0, F2 Iss Iss Iss Iss Iss Ex Ex Ex Ex Wb

SD 0(R1), F0 Iss Iss Iss Iss Iss Iss Iss Iss Iss M1 M2 M3 Wb

SUBI R1, R1, 8 Iss Ex Wb

BNEZ R1, Loop Iss Ex Wb

LD F0, 0(R1) Iss M1 M2 M3 M4 M5 M6 M7 M8 Wb

MUL F4, F0, F2 Iss Iss Iss Iss Iss

SD 0(R1), F0 Iss Iss Iss Iss

Computer Science 146

David Brooks

Register Renaming: Pointer-Based

• MIPS R10K, Alpha 21264, Pentium 4, POWER4
• Mapper/Map Table: Hardware to hold these
mappings
– Register Writes: Allocate new location, note mapping in
table
– Register Reads: Look in map table, find location of most
recent write
• Deallocate mappings when done

Computer Science 146

David Brooks

3
Register Renaming: Example
– Mapper/Map Table: Hardware to hold these mappings
• Register Writes: Allocate new location, note mapping in table
• Register Reads: Look in map table, find location of most recent write
– Deallocate mappings when done
• Assume
– 4 Architected/Logical Registers (F1,F2,F3,F4) “names”
– 8 Physical/Rename Registers (P1—P8) “locations”
• Code – Lots of Potential WAR/WAW, also RAWs
ADD R1, R2, R4
SUB R4, R1, R2
ADD R3, R1, R3
ADD R1, R3, R2

Computer Science 146

David Brooks

Register Renaming: Example

Map Table
Initial Mapping R1 R2 R3 R4
P1 P2 P3 P4
ADD R1, R2, R4 P5 P2 P3 P4 ADD P5, P2, P4
SUB R4, R1, R2 P5 P2 P3 P6 SUB P6, P5, P2
ADD R3, R1, R3 P5 P2 P7 P6 ADD P7, P5, P3
ADD R1, R3, R2 P8 P2 P7 P6 ADD P8, P7, P2

Computer Science 146

David Brooks

4
Control Hazards
• Key to performance in current microprocessors
• Almost every design decision changes if we
assume “perfect” rather than realistic branch
prediction

Computer Science 146

David Brooks

Strategies to reduce control hazards

• Compiler techniques reduce branch frequency
• Hardwired strategies for responding to branches –
“assume not taken”
• Delayed branches
• Nullifying branches
• Compiler hints to suggest likely outcomes
• Dynamic hardware branch prediction

Computer Science 146

David Brooks

5
Compiler techniques to reduce
branch frequency
• Loop unrolling
– Will discuss in detail in Chapter 4
• Constant propagation • Procedure inlining/cloning
N=0; foo(int i) {
… return(2*i);
A=b*N; A=0; }
… a=foo(b);
If(A==0) { Inlining => a=2*b;
}

Computer Science 146

David Brooks

Branch prediction methods

• When is information about branches
gathered/applied?
– When the machine is designed
– When the program is compiled (“compile-time”) (ch.4)
– When a “training run” of the program is executed
(“profile-based”)
– As the program is executing (“dynamic”)

Computer Science 146

David Brooks

6
Why predict? Speculative Execution
• Execute beyond branch boundaries before the
branch is resolved
• Correct Speculation
– Avoid stall, result is computed early, performance++
• Incorrect Speculation
– Abort/squash incorrect instructions, complexity+
– Undo any incorrect state changes, complexity++
• Performance gain is weighed vs. penalty
• Speculation accuracy = branch prediction accuracy
Computer Science 146
David Brooks

Dynamic Hardware Branch

Prediction
• Branch behavior is monitored during program execution
– History data can influence prediction of future executions of
the branch instruction
• Branches instruction execution has two tasks/predictions
– Condition evaluation (taken or not-taken)
– Target address calculation (where to go when taken)
• Target prediction also applies to unconditional branches
• Branch Direction Prediction: 3 levels of complexity
– Branch history tables, Two-level tables, hybrid predictors

Computer Science 146

David Brooks

7
Branch Direction Prediction
• Basic idea: Hope that future behavior of the
branch is correlated to past behavior
– Loops
– Error-checking conditionals
• For a single branch PC
– Simplest possible idea: Keep 1 bit around to indicate
taken or not-taken
– 2nd simplest idea: Keep 2 bits around, saturating counter

Computer Science 146

David Brooks

Two-bit Saturating Counters

Taken
Not Taken
“strongly
taken” Predict Taken Predict Taken
11 10
Taken

Taken Not Taken

Not Taken
“strongly
Predict Not Taken Predict Not Taken not taken”
01 00
Taken
Not Taken
• 2-bit FSMs mean prediction must miss twice before change
• N-bit predictors are possible, but after 2-bits not much benefit
Computer Science 146
David Brooks

8
Example: Two-bit Vs. 1-bit
Branch Prediction
Branch Outcome T T T N T T T N T T T N % predict rate

1-bit Prediction N T T T N T T T N T T T

1-bit Mis-Predict? Y Y Y Y Y Y ~50%

2-bit Prediction n T T t T T T t T T T t

2-bit Mis-Predict? Y Y Y Y ~75%

• 2-bit “hysterisis” helps

Computer Science 146
David Brooks

Branch Prediction Buffer

(branch history table, BHT)
PC • Small memory indexed with low bits of the
12-bits branch instruction’s address
– Why the low bits?
• Implementation
– Separate memory accessed during IF phase
– 2-bits attached to each block in the Instruction
Taken or
Cache
Not-taken?
• Caveats: Cannot separately size I-Cache and BHT
• What about multiple branches in a cache line?
– Does this help our simple 5-stage pipeline?

212 = 4K Entries

Computer Science 146

David Brooks

9
Correlating Predictors
• 2-bit scheme only looks at branch’s own history to
predict its behavior
• What if we use other branches to predict it as well?
if (aa==2)aa=0; // Branch #1
if (bb==2)bb=0; // Branch #2
if (aa!=bb){..} // Branch #3

• Clearly branch #3 depends on outcome of #1 and #2

• Prediction must be a function of own branch as well as
recent outcomes of other branches

Computer Science 146

David Brooks

Two-level Adaptive Branch

Prediction (Correlating Predictor)
PC • Two-level BP requires to main
12-bits
components
2-bit BHR
0 0 – Branch history register (BHR):
recent outcomes of branches (last
k branches encountered)
– Pattern History Table (PHT):
branch behavior for last s
Taken or occurrences of the specific pattern
Not-taken? of these k branches
– In effect, we concatenate BHR
with Branch PC bits
• Can also XOR (GSHARE), etc
212 = 4K Entries each (PHTs)
Computer Science 146
David Brooks

10
Branch History Register
• Simple shift register
– Shift in branch outcomes as they occur
– 1 => branch was taken
– 0 => branch was not-taken
– k-bit BHR => 2k patterns
– Use these patterns to address into the Pattern History Table

Computer Science 146

David Brooks

Pattern History Table

• Has 2k entries
• Usually uses a 2-bit counter for the prediction
• Each entry summarizes branch results for the last s
times that BHR pattern was seen
– Not a shift register, usually an FSM
• BHR is used to address the PHT

Computer Science 146

David Brooks

11
Variations on 2-Level BP
• See Yeh + Patt for details
• Variations depend on
– How many branches share a BHR
– How many branches share a PHT
• 3 possibilities for each: global, per-address, per-set
• 9 total!
– GAg, GAs, GAp
– PAg, PAs, PAp
– SAg, SAs, SAp

Computer Science 146

David Brooks

2-level Branch History

• Global history -- 1 Branch History Register (BHR)
• Per-address/set history
– Per-Address/set Branch History Table holds many BHRs
PC

k-bits
k-bits
k-bits
Taken or
Not-taken?
K-bits
k-bits
k-bits
k-bits

Computer Science 146

David Brooks

12
Hardware Costs of 2-level
predictions
• (m,n) predictor Î m-bits of global history, n-bit
predictor
• 2m*n*Number of prediction entries
• Say you have m-bits of history (m=2)
• n-bits of predictor per entries (n=2)

(2,2) predictor with 1K prediction entries

22*2*1024 = 8K-bits

Computer Science 146

David Brooks

Variations on the basics --

GSHARE
• Gshare a variant on GAg
PC
12-bits • Don’t use BHR directly to address PHT
BHR
• Instead, XOR bits of BHR with bits of
PC (branch address) and use that to
index PHT
• Tries to separate out the
XOR behaviors/predictions associated with
different branches, without extra
hardware of PA and SA schemes

Computer Science 146

David Brooks

13
Hybrid Branch Predictors
• Tournament predictors: Adaptively combine local
and global predictors
• Different schemes work better for different branches
Could be Local Global Could be
2-bit BHT G-share
Predictor Predictor

Chooser
Predictor
Taken or
Not-taken?

Computer Science 146

David Brooks

Branch Predictor Performance

Computer Science 146

David Brooks

14
Branch Target Prediction
• So far we have only talked about predicting
direction
• We still need to predict the address
– Branch Target Buffer (BTB)
• Useful for conditional/unconditional branches
– Return Address Stack (RAS)
• Useful for procedure returns

Computer Science 146

David Brooks

Branch Target Buffer

• Simple pipeline resolves stages in ID
– We’d really like to know by the end of IF so we can proceed
without a bubble
• Idea:
– As part of IF use the instruction address (every instruction) to do a
lookup in the BTB
– For N recently executed branches, hold the predicted PC value
(may also hold additional prediction bits)
– If instruction is not a branch, don’t add to BTB
– If BTB fails revert to earlier method
• Either instruction is not a branch
• Or, there is no predictor entry for that branch
– Many more bits per entry than BHT

Computer Science 146

David Brooks

15
Branch Target Buffer

Computer Science 146

David Brooks

Branch Target Cache

• Similar to BTB, but we also want to know the
target instruction!
– Prediction returns not just the direction address, but
also the instruction stored there
– Allows zero-cycle branches (branch-folding)
• Send target-instruction to ID rather than branch
• Branch is not sent into pipe

Computer Science 146

David Brooks

16
Return Address Stack
• Included in many recent processors
– Alpha 21264 => 12 entry RAS
• Procedure returns account for ~85% of indirect jumps
• Like a hardware stack, LIFO
– Procedure Call => Push Return PC onto stack
– Procedure Return => Prediction off of top of stack, Pop it
• RAS tends to work quite well since call depths are
typically not large
• Problem: Speculative state! More next time
Computer Science 146
David Brooks

For next time

• Multiple Issue Machines
• Hardware Speculation
– Performance and Precise Interrupts

Computer Science 146

David Brooks

Computer Organization and Design Pipeliing-Chapter+4 Slides
No ratings yet
Computer Organization and Design Pipeliing-Chapter+4 Slides
131 pages
Advanced Computer Architectures: 17CS72 (As Per CBCS Scheme)
No ratings yet
Advanced Computer Architectures: 17CS72 (As Per CBCS Scheme)
31 pages
5 4-Pipelining
No ratings yet
5 4-Pipelining
10 pages
Branch Prediction: Prof. Mikko H. Lipasti University of Wisconsin-Madison
No ratings yet
Branch Prediction: Prof. Mikko H. Lipasti University of Wisconsin-Madison
22 pages
Computer Architecture: Branching
No ratings yet
Computer Architecture: Branching
37 pages
Branch Handling
No ratings yet
Branch Handling
23 pages
Branch Prediction: Joel Emer
No ratings yet
Branch Prediction: Joel Emer
36 pages
05 - Pipelining - Branch Prediction
No ratings yet
05 - Pipelining - Branch Prediction
20 pages
Coa Lecture Unit 3 Pipelining
No ratings yet
Coa Lecture Unit 3 Pipelining
95 pages
Advanced Branch Prediction Techniques
No ratings yet
Advanced Branch Prediction Techniques
41 pages
Branch Hazards in The Pipelined Processor: Winter 2002 CSE 141 - Topic
No ratings yet
Branch Hazards in The Pipelined Processor: Winter 2002 CSE 141 - Topic
24 pages
9.1.0 Branch Prediction Pentiums IBM PPC
No ratings yet
9.1.0 Branch Prediction Pentiums IBM PPC
163 pages
SimpleScalar for Researchers
No ratings yet
SimpleScalar for Researchers
3 pages
Branch Prediction
No ratings yet
Branch Prediction
38 pages
Branch Predictors
No ratings yet
Branch Predictors
41 pages
CS252 Graduate Computer Architecture Prediction (Con't) (Dependencies, Load Values, Data Values) February 22, 2010
No ratings yet
CS252 Graduate Computer Architecture Prediction (Con't) (Dependencies, Load Values, Data Values) February 22, 2010
54 pages
18 740 Fall15 Lecture05 Branch Prediction Afterlecture
No ratings yet
18 740 Fall15 Lecture05 Branch Prediction Afterlecture
93 pages
Computer Architecture: Pipelining: Dr. Ashok Kumar Turuk
No ratings yet
Computer Architecture: Pipelining: Dr. Ashok Kumar Turuk
136 pages
Branch Prediction Techniques: Prof. Pimal Khanpara Department of Computer Science & Engineering
No ratings yet
Branch Prediction Techniques: Prof. Pimal Khanpara Department of Computer Science & Engineering
20 pages
CA Lecture 4 Module 3
No ratings yet
CA Lecture 4 Module 3
27 pages
Advanced Branch Prediction Techniques
No ratings yet
Advanced Branch Prediction Techniques
23 pages
Prof. Ajit Pal Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur Lecture - 16 Branch Prediction
No ratings yet
Prof. Ajit Pal Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur Lecture - 16 Branch Prediction
26 pages
Conditional Branches
No ratings yet
Conditional Branches
35 pages
Seminar Monday ACA
No ratings yet
Seminar Monday ACA
20 pages
Branch Prediction - 1: Computer Architecture: A Constructive Approach
No ratings yet
Branch Prediction - 1: Computer Architecture: A Constructive Approach
29 pages
Cse-Vii-Advanced Computer Architectures (10cs74) - Solution
100% (1)
Cse-Vii-Advanced Computer Architectures (10cs74) - Solution
111 pages
CPU Architecture Essentials
No ratings yet
CPU Architecture Essentials
74 pages
Dynamic Branch Prediction
No ratings yet
Dynamic Branch Prediction
17 pages
07 Branch Prediction
No ratings yet
07 Branch Prediction
35 pages
Intel Nehalem Core Architecture
No ratings yet
Intel Nehalem Core Architecture
123 pages
Branch Prediction Techniques
No ratings yet
Branch Prediction Techniques
48 pages
Dynamic Branch Prediction
No ratings yet
Dynamic Branch Prediction
7 pages
What About Branches?: Branch Outcomes Are Not Known Until EXE What Are Our Options?
No ratings yet
What About Branches?: Branch Outcomes Are Not Known Until EXE What Are Our Options?
27 pages
Software-Based and Hardware-Based Branch Prediction Strategies and Performance Evaluation
No ratings yet
Software-Based and Hardware-Based Branch Prediction Strategies and Performance Evaluation
19 pages
Branch Prediction
No ratings yet
Branch Prediction
2 pages
Instruction Pipelining Explained
No ratings yet
Instruction Pipelining Explained
27 pages
Intel - Performance Analysis Guide For Intel® Core™ I7 Processor and Intel® Xeon™ 5500 Processors
No ratings yet
Intel - Performance Analysis Guide For Intel® Core™ I7 Processor and Intel® Xeon™ 5500 Processors
72 pages
Ca 2 Marks & Big Ques PDF
No ratings yet
Ca 2 Marks & Big Ques PDF
96 pages
Lec 15
No ratings yet
Lec 15
23 pages
Instruction Level Parallelism and Its Exploitation: Unit Ii by Raju K, Cse Dept
No ratings yet
Instruction Level Parallelism and Its Exploitation: Unit Ii by Raju K, Cse Dept
201 pages
Advanced Branch Prediction Techniques
No ratings yet
Advanced Branch Prediction Techniques
24 pages
Slides Chapter 6 Pipelining
No ratings yet
Slides Chapter 6 Pipelining
60 pages
8 DynamicBranchPrediction
No ratings yet
8 DynamicBranchPrediction
8 pages
Branch Prediction Techniques
No ratings yet
Branch Prediction Techniques
29 pages
Computer Architecture Assignment: The ARM Cortex-A53
No ratings yet
Computer Architecture Assignment: The ARM Cortex-A53
8 pages
Ue21ec341b 20240412163937
No ratings yet
Ue21ec341b 20240412163937
22 pages
CS17303 Computer Architecture Notes On Lesson Unit IV - Sumathi
No ratings yet
CS17303 Computer Architecture Notes On Lesson Unit IV - Sumathi
24 pages
L10 PipelineHazards 3
No ratings yet
L10 PipelineHazards 3
35 pages
البحث الثاني
No ratings yet
البحث الثاني
10 pages
Computer Architecture: Speculation & Multiple Issue
No ratings yet
Computer Architecture: Speculation & Multiple Issue
22 pages
L11 PipelineHazards 4
No ratings yet
L11 PipelineHazards 4
30 pages
L12 - Advanced Branch Preiction
No ratings yet
L12 - Advanced Branch Preiction
9 pages
Computer Architecture Quiz
No ratings yet
Computer Architecture Quiz
93 pages
Reducing Pipeline Branch Penalties
No ratings yet
Reducing Pipeline Branch Penalties
4 pages
Advanced Pipe Lining Techniques
No ratings yet
Advanced Pipe Lining Techniques
8 pages
Advanced Branch Prediction
No ratings yet
Advanced Branch Prediction
45 pages
17.L15 BranchPrediction
No ratings yet
17.L15 BranchPrediction
38 pages
CPU Cycles and Pipeline Performance
No ratings yet
CPU Cycles and Pipeline Performance
16 pages
Spectre (v1 v2 v4) V.S. Meltdown (v3)
No ratings yet
Spectre (v1 v2 v4) V.S. Meltdown (v3)
76 pages
Computer Organization Q&A Bank
100% (2)
Computer Organization Q&A Bank
10 pages
15CSE301 Computer Organization and Architecture: Course Introduction
No ratings yet
15CSE301 Computer Organization and Architecture: Course Introduction
17 pages
Aca Unit-4 Notes
No ratings yet
Aca Unit-4 Notes
23 pages
Milestone03 - Computer Architecture Report - Group3
No ratings yet
Milestone03 - Computer Architecture Report - Group3
45 pages
Ecommended Eading: William Stallings
No ratings yet
Ecommended Eading: William Stallings
23 pages
Lab3 Branch Prediction Hardware
No ratings yet
Lab3 Branch Prediction Hardware
16 pages
Performance Analysis of Dual Core, Core 2 Duo and Core I3 Intel Processor
No ratings yet
Performance Analysis of Dual Core, Core 2 Duo and Core I3 Intel Processor
7 pages
10 Branchprediction
No ratings yet
10 Branchprediction
49 pages
Lect09 Adv Branch Prediction
No ratings yet
Lect09 Adv Branch Prediction
55 pages
Pipe 3
No ratings yet
Pipe 3
32 pages
Branch Pred
No ratings yet
Branch Pred
27 pages
Branch Prediction Two Level
No ratings yet
Branch Prediction Two Level
2 pages
Amarthya Ridheesh Seth Pravar Proj 1
No ratings yet
Amarthya Ridheesh Seth Pravar Proj 1
4 pages
A General Guide To Applying Machine Learning To Computer Architecture - Marked
No ratings yet
A General Guide To Applying Machine Learning To Computer Architecture - Marked
21 pages
9 Types of Two Level Branch Predictor
No ratings yet
9 Types of Two Level Branch Predictor
4 pages
Questions That I Encountered
No ratings yet
Questions That I Encountered
9 pages
WRL TN 36
No ratings yet
WRL TN 36
29 pages
Anch Prediction
No ratings yet
Anch Prediction
25 pages
Implementing A Branch Predictor
No ratings yet
Implementing A Branch Predictor
7 pages
CA L15a BranchPrediction Intro and StaticPredictors
No ratings yet
CA L15a BranchPrediction Intro and StaticPredictors
19 pages
Intel Centrino Mobile Technology Learn
No ratings yet
Intel Centrino Mobile Technology Learn
5 pages
Pipeline Part 2 and Data Hazards
No ratings yet
Pipeline Part 2 and Data Hazards
11 pages
Gshare and Pshare Branch Predictors
No ratings yet
Gshare and Pshare Branch Predictors
4 pages
Lecture #3
No ratings yet
Lecture #3
12 pages
Branch Predicter Project
No ratings yet
Branch Predicter Project
20 pages

Cs146-Lecture7 2

Uploaded by

Cs146-Lecture7 2

Uploaded by

Computer Science 146

Instructor: Prof. David Brooks

Lecture 7: Dynamic Branch Prediction

Computer Science 146

Computer Science 146

Computer Science 146

Common Data Bus (CDB)

SUBI R1, R1, 8 Iss Ex Wb

BNEZ R1, Loop Iss Ex Wb

LD F0, 0(R1) Iss Iss Iss Iss M Wb

MUL F4, F0, F2 Iss Iss Iss Iss Iss Ex Ex Ex Ex Wb

SUBI R1, R1, 8 Iss Ex Wb

BNEZ R1, Loop Iss Ex Wb

LD F0, 0(R1) Iss M1 M2 M3 M4 M5 M6 M7 M8 Wb

MUL F4, F0, F2 Iss Iss Iss Iss Iss

SD 0(R1), F0 Iss Iss Iss Iss

Computer Science 146

Register Renaming: Pointer-Based

Computer Science 146

Computer Science 146

Register Renaming: Example

Computer Science 146

Computer Science 146

Strategies to reduce control hazards

Computer Science 146

Computer Science 146

Branch prediction methods

Computer Science 146

Dynamic Hardware Branch

Computer Science 146

Computer Science 146

Two-bit Saturating Counters

Taken Not Taken

1-bit Mis-Predict? Y Y Y Y Y Y ~50%

2-bit Mis-Predict? Y Y Y Y ~75%

• 2-bit “hysterisis” helps

Branch Prediction Buffer

Computer Science 146

• Clearly branch #3 depends on outcome of #1 and #2

Computer Science 146

Two-level Adaptive Branch

Computer Science 146

Pattern History Table

Computer Science 146

Computer Science 146

2-level Branch History

Computer Science 146

(2,2) predictor with 1K prediction entries

Computer Science 146

Variations on the basics --

Computer Science 146

Computer Science 146

Branch Predictor Performance

Computer Science 146

Computer Science 146

Branch Target Buffer

Computer Science 146

Computer Science 146

Branch Target Cache

Computer Science 146

For next time

Computer Science 146

You might also like