0% found this document useful (0 votes)

127 views14 pages

Instruction Pipeline - Study Notes

Uploaded by

ch mounika

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

127 views14 pages

Instruction Pipeline - Study Notes

Uploaded by

ch mounika

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Instruction

Pipeline
COMPUTER ORGANIZATION

Instruction Pipeline

Pipelining
Mechanism for overlapping execution of many input sets by dividing one computation stage into many (Let k)
computation sub-stages.

 Cost of implementation increases slightly.

 Speed up increases

Working of Pipeline

S1 must happen before S2, S3 and S2 must happen before S3 (sequential execution)

T/3 T/3 T/3 T/3 T/3

S1 Item 1 Item 2 Item 3

S2 Item 1 Item 2 Item 3

S3 Item 1 Item 2 Item 3

 Note: When Item 1 is in S2 stage, S1 will be empty so we can use S1 for Item 2 that time and parallel
execution of Item 1 in stage 2 can also happen.

 In the processor pipeline we need a latch between successive stages to hold the intermediate results
temporarily.

COMPUTER ORGANIZATION | Instruction Pipeline PAGE 2

Download Testbook

Pipelined Processors
a. Degree of Overlap:
 Serial: Next operation starts only after the previous operation gets completed.

 Overlapped: Some overlap between consecutive stages.

 Pipelined: Compute overlap between successive stages.

b. Depth of Pipeline:
 Performance of the pipeline depends on the number of stages and how they are utilized without conflict.

 Shallow pipeline has fewer number of stages.

Stages more complex:

 Deep pipeline has larger number of stages

Stages are simpler.

COMPUTER ORGANIZATION | Instruction Pipeline PAGE 3

Download Testbook

c. Scheduling alternatives:
 Static Pipeline:

i. Same sequence of pipeline stages is executed for all data/instructions.

ii. If one instruction stalls, all subsequent ones also get delayed.

 Dynamic Pipeline:

i. Can be reconfigured to perform variable functions at different times.

ii. Feed forward and feedback b/w stages.

Speedup and Efficiency

τ: Clock period of the pipeline

ti: Time delay of circuit in stage Si

d2: delay of a catch.

Maximum stage delay, τm = max{ti}

τ = τm + dL

Pipeline frequency, 1/c

Speed up for K-stage pipeline with n-inputs:

COMPUTER ORGANIZATION | Instruction Pipeline PAGE 4

Download Testbook

Latency
 The number of time units between two inputs initialization of a pipeline is called the latency between
them.

 When two or more inputs attempt to use the same pipeline stage at same time, it will cause collision.

 Latencies, after using which cause collisions, are called forbidden latencies.

Pipelining MIPS32 Data Path

Assumptions:

 Each of 5 steps: If, ID, EX, MEM and WB let them as pipeline stages.

 Each stage, let must finish its execution within one clock cycle.

 Since many instructions will be overlapped, we must ensure that there is no conflict.

 We can achieve these assumptions easily.

Let each stage take ‘T’ time units.

Time to execute n instructions = 5 * T * n

In pipelined:

Time to execute n instructions = 5(T + Δ) + (n - 1) (T + Δ)

= (4 + n) (T + Δ)

= (4 + n)T, if T >>Δ

≃5 if n is very large.

COMPUTER ORGANIZATION | Instruction Pipeline PAGE 5

Download Testbook

Conflict Stages
 IF and MEM: Both these stages access memory. So, they should not be in the same cycle.
SOLUTION: Using separate instruction and data cache. (i-cache and d-cache)

 ID and WB: Both these stages access register banks. So, they should not be used in the same stock cycle.
SOLUTION: Allow both read and write access to registers in the same clock cycle.

 Simultaneous read and write may result in caches.

SOLUTION: Write in the 1st half of the cycle and read in the 2nd half of the cycle.

Points to Remember
1. Since, in a pipelined processor we have to fetch an instruction every clock cycle. Hence, we need to incre-
ment the program counter at the fetch stage itself. Otherwise, the next instruction will not be fetched.

2. In a non-pipelined processor there is no need to fetch an instruction every clock cycle. So, we increment
the program counter in the MEM stage.

Basic Performance Issue in Pipeline

 Register stages are inserted with each pipeline stage, which increases overall execution time of first
(single) instruction.

COMPUTER ORGANIZATION | Instruction Pipeline PAGE 6

Download Testbook

Pipeline Hazards
 An instruction pipeline should complete the execution of an instruction every clock cycle.

 Hazards are situations which prevents this from happening (for some instructions)

Hazards
1. Structural Hazards (Resource conflicts)

2. Data Hazards (Data Dependencies)

3. Control Hazard (Branch and relocation change in program counter)

Solution for Hazards

Using special hardware and control circuits.

Inserting stall cycles in pipeline

 When one instruction is stalled, all others that follow that instruction will also get stalled.

 No new instruction can be fetched during the duration of stall.

 Hazards result in performance degradation.

Structural Hazards
 Due to resource conflicts.

 When hardware cannot support overlapped execution.

 Example: single memory cache to store instruction and data.

COMPUTER ORGANIZATION | Instruction Pipeline PAGE 7

Download Testbook

Eliminating Structural Hazards

 To reduce the cost of implementation.

 Pipelining all the functional units may be too costly.

 If structural hazards are not frequent but them happen.

 Make use of operating I & D cache.

COMPUTER ORGANIZATION | Instruction Pipeline PAGE 8

Download Testbook

Data Hazards
Data hazards occur due to data dependencies between instructions.

I1: ADD R2, R5, R8 I2: SUB R2, R2, R6.

Basic Solution: It inserts stall cycles → 3 clocks will be wasted

To reduce number of clock cycles

a) Data forwarding/bypassing: As soon as data is computed it will be forwarded using some additional
hardware consisting of multiplexers, without waiting for data to be written back.

b) Concurrent Register Access: By splitting a clock cycle into two halves.

First half: Register read

Second half: Register write.

Bypassing
 The result computed by the previous instruction is stored in some register within the data path.

 Take the value directly from the register and forward to instruction required.

COMPUTER ORGANIZATION | Instruction Pipeline PAGE 9

Download Testbook

 We can avoid conflict which is occurring in some cycle i.e. WB and ID in the same cycle by using Register
Read/Write Scheme.

 In first half Register Write (in WB)

 In the second half cycle Register Read (in ID).

Data Hazard while Accessing Memory

Memory references are always in order, and so data hazards between memory references never occur.

Cache miss can result in pipeline stalls.

Load instruction followed by the used of the loaded data:

COMPUTER ORGANIZATION | Instruction Pipeline PAGE 10

Download Testbook

How to solve this problem?

 Cannot be eliminated using forwarding

 Pipeline Interlock: In hardware detects the hazard and stalls the pipeline until the hazard is cleared.

 One stall cycle is needed.

Instruction Issue
 Before Ex stage, in ID stage we will decode the instruction, in a typical ALU instruction.

 When we are moving from ID to EX stage it means we are starting to execute the operation. It is when we
issue the instruction.

 All possible data hazards can be checked in the ID stage itself.

If a data hazard exists, the instruction is stalled before it is issued.

Instruction Scheduling or Pipe Scheduling

Compiler tries to avoid generation code with a :

MIPS 32 Code

LW R1, a

LW R2, b

SUB R8, R1, R2 ← Interlock

SW R8, x

LW R1, c

LW R2, d

ADD Ra, R1, R2 ← Interlock

SW R9, y

COMPUTER ORGANIZATION | Instruction Pipeline PAGE 11

Download Testbook

Schedule By Compiler MIPS 32 Code:

LW R1, a

LW R2, b

LW R3, c

SUM R8, R1, R2 Both interlocks eliminated

LW R4, d

SW R8, x

ADD R9, R3, R4

SW R9, y

 Pipeline Scheduling can increase number of registers required but result in performance improvement

 Load instruction requires that the next instruction should not use the currently loaded value which is
delayed load.

 If the compiler cannot move some instruction to fill up the delay slot, it can insert a NOP (No operation)
instruction.

Types of Data Hazards

a) Read After Write (RAW):
Consider two instructions i1 and i2, with i1 occurring before P2 in the program.

 i2 tries to read a source before i1 writes to it

 Situation where an instruction refers to a result that has not yet been calculated.

Example:

i1: R2 ← R5 + R3

i2: R4 ← R2 + R3

b) Write After Read: (WAR)

 i2 tries to write a destination before it is read by i1.

COMPUTER ORGANIZATION | Instruction Pipeline PAGE 12

Download Testbook

 Problem with concurrent execution.

Example:

i1: R4 ← R1 + R5

i2: R5 ← R1 + R2

c) Write After Write (WAW):

 i2 tries to write an operand before it is written by i1.

Example:

i1: R2 ← R4 + R7

i2: R2 ← R1 + R3

Control Hazard
 Arise because of change in flow of control or branch instructions.

 Greater performance loss can be generated that data hazards.

 If the branch is taken the PC is normally net updated until the end of MEM.

 The next instruction can be fetched only after that (3 stall cycles)

 We will redo the fetch again & again if it is a branch.

COMPUTER ORGANIZATION | Instruction Pipeline PAGE 13

Download Testbook

To Reduce Branch Stall Penalty

 In MIPS 32, the branches require testing a register for zero, or comparing the values of two registers.

 Using these registers by comparison logic, we can complete computation of effective addresses by the
end of the ID stage.

Delayed Branch Technique

→ If branch instruction has a penalty of n stall cycles, the execution cycle of a branches instruction:

Branch Instruction

→ Task of the compiler is to try filling up these delay slots to make more effective use.

→ Instructions in branch delay slots are always executed irrespective of whether the branch is taken or not.

COMPUTER ORGANIZATION | Instruction Pipeline PAGE 14

Solution Chapter 1
91% (22)
Solution Chapter 1
2 pages
Pipelining Basic Concept
No ratings yet
Pipelining Basic Concept
23 pages
Mutoh 1324 Service Manual
0% (1)
Mutoh 1324 Service Manual
392 pages
Parallel Computing
No ratings yet
Parallel Computing
46 pages
Konica Minolta bizhub C10 Specs
No ratings yet
Konica Minolta bizhub C10 Specs
0 pages
Computer Pipelining Explained
No ratings yet
Computer Pipelining Explained
45 pages
Samsung Secret Codes and Hacks
100% (2)
Samsung Secret Codes and Hacks
4 pages
Future Tense - Will
No ratings yet
Future Tense - Will
3 pages
5V0-22.23 VMware Certified Specialist - VSAN Practice Questions
No ratings yet
5V0-22.23 VMware Certified Specialist - VSAN Practice Questions
4 pages
Pipe Lining
No ratings yet
Pipe Lining
61 pages
Computer History
No ratings yet
Computer History
138 pages
Chapter 6
No ratings yet
Chapter 6
43 pages
Pipelining New
No ratings yet
Pipelining New
33 pages
Helping Slides Pipelining Hazards Solutions
No ratings yet
Helping Slides Pipelining Hazards Solutions
55 pages
4 Instruction Pipeline
No ratings yet
4 Instruction Pipeline
13 pages
Pipe Lining
No ratings yet
Pipe Lining
35 pages
Low Power Multi GHZ Circuit Techniques in Vlsi: IPASJ International Journal of Electronics & Communication (IIJEC)
No ratings yet
Low Power Multi GHZ Circuit Techniques in Vlsi: IPASJ International Journal of Electronics & Communication (IIJEC)
7 pages
Chapter 6 - Pipelining
0% (1)
Chapter 6 - Pipelining
61 pages
Pipelining and Parallel Processing
No ratings yet
Pipelining and Parallel Processing
26 pages
Lecture 5
No ratings yet
Lecture 5
50 pages
5.1-5.3 Pipelining and Parallel Processing
No ratings yet
5.1-5.3 Pipelining and Parallel Processing
56 pages
CO Pipelining PDF Notes
No ratings yet
CO Pipelining PDF Notes
10 pages
Pipelining Concepts and Problems
No ratings yet
Pipelining Concepts and Problems
33 pages
CPU Pipelining and Cache Basics
No ratings yet
CPU Pipelining and Cache Basics
61 pages
Pipelining in Modern Processors
No ratings yet
Pipelining in Modern Processors
61 pages
Pipelinehazard 160823134502
No ratings yet
Pipelinehazard 160823134502
61 pages
Unit 3
No ratings yet
Unit 3
94 pages
Pipelining
No ratings yet
Pipelining
43 pages
CA-unit 4-Material
No ratings yet
CA-unit 4-Material
31 pages
Chapter 8 - Pipelining
No ratings yet
Chapter 8 - Pipelining
31 pages
Module 4
No ratings yet
Module 4
12 pages
Module 5 Part2 Pipelining
No ratings yet
Module 5 Part2 Pipelining
36 pages
Lec18 Pipeline
No ratings yet
Lec18 Pipeline
59 pages
Computer Architecture and Organization
No ratings yet
Computer Architecture and Organization
49 pages
Pipelinehazard For Class
No ratings yet
Pipelinehazard For Class
61 pages
Pipelining Basic and Intermediate Concepts
No ratings yet
Pipelining Basic and Intermediate Concepts
75 pages
Instruction Manual: FOR BE1-51/27C Time Overcurrent Relay With Voltage Control
No ratings yet
Instruction Manual: FOR BE1-51/27C Time Overcurrent Relay With Voltage Control
112 pages
Module 4 - Parallel & Pipeline Processing - Final
No ratings yet
Module 4 - Parallel & Pipeline Processing - Final
31 pages
8&9-High Voltage Kit
No ratings yet
8&9-High Voltage Kit
7 pages
DLCO Module 6 Sem 3
No ratings yet
DLCO Module 6 Sem 3
40 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
61 pages
Pipelining
No ratings yet
Pipelining
26 pages
CH14-WS - 10thed - Pipeline
No ratings yet
CH14-WS - 10thed - Pipeline
16 pages
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
No ratings yet
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
64 pages
Week 11
No ratings yet
Week 11
33 pages
Pipelining Lecture
No ratings yet
Pipelining Lecture
39 pages
Module 5 - Pipelining
No ratings yet
Module 5 - Pipelining
61 pages
CoA Batch13
No ratings yet
CoA Batch13
30 pages
Astm A 307
No ratings yet
Astm A 307
6 pages
Canvas Pipelining and Parallel Processors
No ratings yet
Canvas Pipelining and Parallel Processors
5 pages
Instruction Pipelining and SuperScalar Development - 2019
No ratings yet
Instruction Pipelining and SuperScalar Development - 2019
53 pages
Pipelining. Pipeline Hazards: Sabina Batyrkhanovna
No ratings yet
Pipelining. Pipeline Hazards: Sabina Batyrkhanovna
19 pages
Pipe Lining
No ratings yet
Pipe Lining
66 pages
ICT Grade 9
No ratings yet
ICT Grade 9
6 pages
Parallel Processing Chapter - 3: Instruction Level Parallelism
No ratings yet
Parallel Processing Chapter - 3: Instruction Level Parallelism
33 pages
Chapter 8 - Pipelining
No ratings yet
Chapter 8 - Pipelining
38 pages
Session - 29 and 30 Instruction Pipelining and Pipeline Hazards, Instruction Level Parallelism
No ratings yet
Session - 29 and 30 Instruction Pipelining and Pipeline Hazards, Instruction Level Parallelism
25 pages
COA Pipelining
No ratings yet
COA Pipelining
35 pages
Week 11 Reduced
No ratings yet
Week 11 Reduced
29 pages
Pipelining and Parallelism
No ratings yet
Pipelining and Parallelism
41 pages
r05320202 Microprocessors and Micro Controllers
No ratings yet
r05320202 Microprocessors and Micro Controllers
7 pages
What Is An Operating System
No ratings yet
What Is An Operating System
12 pages
Unit3 Pipelining
No ratings yet
Unit3 Pipelining
54 pages
Advanced Pipelining Techniques
No ratings yet
Advanced Pipelining Techniques
44 pages
Comparison Between Pipelining
No ratings yet
Comparison Between Pipelining
9 pages
IT Professional Resume - Prabhakaran P
No ratings yet
IT Professional Resume - Prabhakaran P
3 pages
SINAMICS Family of Drives PM VS CU Compatibility
No ratings yet
SINAMICS Family of Drives PM VS CU Compatibility
7 pages
Computer System Organization
No ratings yet
Computer System Organization
26 pages
233CS Chapter 1
No ratings yet
233CS Chapter 1
21 pages
3com Superstack 3 Switch 4400 Se Datasheet
No ratings yet
3com Superstack 3 Switch 4400 Se Datasheet
2 pages
Legion - t530 - 730 MANUAL
No ratings yet
Legion - t530 - 730 MANUAL
55 pages
750 Kva DG Total
No ratings yet
750 Kva DG Total
32 pages
IT Specialist's Career Summary
No ratings yet
IT Specialist's Career Summary
3 pages
Toshiba 2SC5570 NPN Transistor Specs
No ratings yet
Toshiba 2SC5570 NPN Transistor Specs
5 pages
Computer Schematic Diagram
No ratings yet
Computer Schematic Diagram
50 pages
Pipelining in Modern Processors
No ratings yet
Pipelining in Modern Processors
61 pages
Sony dcr-sr190 sr200 sr290 sr300 Level2 Ver1.0 (ET)
No ratings yet
Sony dcr-sr190 sr200 sr290 sr300 Level2 Ver1.0 (ET)
96 pages
Pipe Lining
No ratings yet
Pipe Lining
29 pages
57.design & Development of GSM Based Vehicle Theft Control System
No ratings yet
57.design & Development of GSM Based Vehicle Theft Control System
3 pages
Fiber Distribution Box Datasheet
No ratings yet
Fiber Distribution Box Datasheet
25 pages
Access Control Sipass Readers and Cards: Built To Stand The Test of Time
No ratings yet
Access Control Sipass Readers and Cards: Built To Stand The Test of Time
8 pages
Khaadi Store Electrical Works Tender
No ratings yet
Khaadi Store Electrical Works Tender
13 pages
3D Printing Workshop for Engineers
No ratings yet
3D Printing Workshop for Engineers
4 pages
Asus Prime B250m-Plus/BR Memory QVL
No ratings yet
Asus Prime B250m-Plus/BR Memory QVL
8 pages