0% found this document useful (0 votes)

36 views31 pages

Chapter7 - Basic Processing Unit 1

Uploaded by

ritiksharma203402

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views31 pages

Chapter7 - Basic Processing Unit 1

Uploaded by

ritiksharma203402

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 31

Basic Processing Unit

Overview
 Instruction Set Processor (ISP)
 Central Processing Unit (CPU)

 A typical computing task consists of a series

of steps specified by a sequence of machine

instructions that constitute a program.
 An instruction is executed by carrying out a

sequence of more rudimentary operations.

Some Fundamental
Concepts
Fundamental Concepts
 Processor fetches one instruction at a time and
perform the operation specified.
 Instructions are fetched from successive memory
locations until a branch or a jump instruction is
encountered.
 Processor keeps track of the address of the memory
location containing the next instruction to be fetched
using Program Counter (PC).
 Instruction Register (IR)
Executing an Instruction
 Fetch the contents of the memory location pointed
to by the PC. The contents of this location are
loaded into the IR (fetch phase).
IR ← [[PC]]
 Assuming that the memory is byte addressable,
increment the contents of the PC by 4 (fetch phase).
PC ← [PC] + 4
 Carry out the actions specified by the instruction in
the IR (execution phase).
Processor Organization Internal processor
bus
Control signals

Instruction
Address
decoder and
lines
MDR HAS MAR control logic
TWO INPUTS Memory
AND TWO bus
OUTPUTS MDR
Data
lines IR

Datapath
Y
Constant 4 R0

Select MUX

Add
A B
ALU Sub R n - 1 
control ALU
lines
Carry-in
XOR TEMP

Z
Textbook Page 413

Figure 7.1. Single-bus organization of the datapath inside a processor.

Executing an Instruction
 Transfer a word of data from one processor
register to another or to the ALU.
 Perform an arithmetic or a logic operation
and store the result in a processor register.
 Fetch the contents of a given memory
location and load them into a processor
register.
 Store a word of data from a processor
register into a given memory location.
Register Transfers Riin
Internal processor
bus

Riout

Yin

Constant 4

Select MUX

A B
ALU

Zin

Z out

Figure 7.2. Input and output gating for the registers in Figure 7.1.
Register Transfers
 All operations and data transfers are controlled by the processor clock.
Bus

D Q
1
Q
Riout

Ri in
Clock

Figure
Figure7.3. Inputand
7.3. Input andoutput
outputgating
gatingforfor one
one registerbit.
register bit.
Performing an Arithmetic or
Logic Operation
 The ALU is a combinational circuit that has no
internal storage.
 ALU gets the two operands from MUX and bus.
The result is temporarily stored in register Z.
 What is the sequence of operations to add the
contents of register R1 to those of R2 and store the
result in R3?
1. R1out, Yin
2. R2out, SelectY, Add, Zin
3. Zout, R3in
Fetching a Word from Memory
 Address into MAR; issue Read operation; data into MDR.

Figure: Connection and control signals for register MDR.

Fetching a Word from Memory
 The response time of each memory access varies
(cache miss, memory-mapped I/O,…).
 To accommodate this, the processor waits until it
receives an indication that the requested operation
has been completed (Memory-Function-Completed,
MFC).
 Move (R1), R2
 MAR ← [R1]
 Start a Read operation on the memory bus
 Wait for the MFC response from the memory
 Load MDR from the memory bus
 R2 ← [MDR]
Step 1 2 3

Timing Clock

MARin MAR ← [R1]

Assume MAR
is always available Address
on the address lines
of the memory bus. Start a Read operation on the memory bus
Read

MDRinE

Data

Wait for the MFC response from the memory

MFC

MDR out Load MDR from the memory bus

R2 ← [MDR]

Figure 7.5. Timing of a memory Read operation.

Execution of a Complete
Instruction
 Add (R3), R1
 Fetch the instruction

 Fetch the first operand (the contents of the

memory location pointed to by R3)

 Perform the addition

 Load the result into R1

Architecture Internal processor
bus
Riin

Ri out

Yin

Constant 4

Select MUX

A B
ALU

Z in

Z out

Figure: Input and output gating for the registers.

Execution of a Complete
Instruction Internal processor
bus

Add (R3), R1
Control signals

Instruction
Step Action Address
decoder and
lines
MAR control logic

1 PCout , MAR in , Read,Select4,Add, Zin Memory

bus

2 Zout , PC in , Yin , WMF C MDR

Data
IR
3 MDR out , IR in lines

4 R3out , MAR in , Read Y

Constant 4 R0
5 R1out , Yin , WMF C
6 MDR out , SelectY,Add, Zin Select MUX

7 Zout , R1in , End Add

A B
ALU Sub R n - 1 
control ALU
lines
Carry-in
XOR TEMP
Figure: Control sequenceforexecutionoftheinstructionAdd (R3),R1.
Z

Figure 7.1. Single-bus organization of the datapath inside a processor.

Execution of Branch Instructions
A branch instruction replaces the contents of
PC with the branch target address, which is
usually obtained by adding an offset X given
in the branch instruction.
 The offset X is usually the difference between

the branch target address and the address

immediately following the branch instruction.
 Conditional branch
Step Action

1 PCout , MAR in , Read,Select4,Add, Zin

2 Zout, PCin , Yin, WMF C
3 MDRout , IR in
4 Offset-field-of-IR
out, Add, Zin

5 Zout, PCin , End

Figure : Control sequence for an unconditional branch instruction.

Pipelining
Overview
 Pipelining is widely used in modern
processors.
 Pipelining improves system performance in

terms of throughput.
 Pipelined organization requires sophisticated

compilation techniques.
Basic Concepts
Making the Execution of
Programs Faster
 Use faster circuit technology to build the
processor and the main memory.
 Arrange the hardware so that more than one

operation can be performed at the same time.

 In the latter way, the number of operations

performed per second is increased even though

the elapsed time needed to perform any one
operation is not changed.
Traditional Pipeline Concept

Laundry Example
Ann, Brian, Cathy, Dave

each have one load of clothes

to wash, dry, and fold A B C D
Washer takes 30 minutes

Dryer takes 40 minutes

Folder takes 20 minutes

6 PM 7 8 9 10 11 Midnight

Time

30 40 20 30 40 20 30 40 20 30 40 20
 Sequential laundry takes 6
A hours for 4 loads
 If they learned pipelining, how

long would laundry take?

D
6 PM 7 8 9 10 11 Midnight

Time
T
a 30 40 40 40 40 20
s
k A
 Pipelined laundry takes 3.5
hours for 4 loads
O B
r
d C
e
r D
Traditional Pipeline Concept
 Pipelining doesn’t help latency
6 PM 7 8 9 of single task, it helps
throughput of entire workload
Time  Pipeline rate limited by slowest
T pipeline stage
a 30 40 40 40 40 20
 Multiple tasks operating
s simultaneously using different
A
k resources
 Potential speedup = Number

pipe stages
O B
 Unbalanced lengths of pipe
r
stages reduces speedup
d C  Time to “fill” pipeline and time
e to “drain” it reduces speedup
r  Stall for Dependences
D
Use the Idea of Pipelining in a
Computer
Fetch + Execution
T ime
I1 I2 I3
Time
Clock cycle 1 2 3 4
F E F E F E
1 1 2 2 3 3 Instruction

I1 F1 E1
(a) Sequential execution

I2 F2 E2
Interstage buffer
B1
I3 F3 E3

Instruction Execution
fetch unit (c) Pipelined execution
unit

(b) Hardware organization

Basic idea of instruction pipelining.
Fetch + Decode+ Execution + Write

T ime
Clock cycle 1 2 3 4 5 6 7
Instruction
I1 F1 D1 E 1 W 1

F D E W
I2 2 2 2 2

F D E W
I 3 3 3 3
3
F D E W
I4 4 4 4 4

(a) Instruction execution divided into four steps

Interstage b uf fers

D : Decode
F : Fetch instruction E: Ex ecute W : Write
instruction and fetch operation
operands results
B1 B2 B3

(b) Hardware organization

A 4-stage pipeline.
Role of Cache Memory
 Each pipeline stage is expected to complete in one clock
cycle.
 The clock period should be long enough to let the slowest
pipeline stage to complete.
 Faster stages can only wait for the slowest one to
complete.
 Since main memory is very slow compared to the
execution, if each instruction needs to be fetched from
main memory, pipeline is almost useless.
 Fortunately, we have cache.
Pipeline Performance
 The potential increase in performance resulting
from pipelining is proportional to the number of
pipeline stages.
 However, this increase would be achieved only if

all pipeline stages require the same time to

complete, and there is no interruption throughout
program execution.
 Unfortunately, this is not true.
T ime
Clock c ycle 1 2 3 4 5 6 7 8 9

Instruction

I1 F1 D1 E1 W1

I2 F2 D2 E2 W2

I3 F3 D3 E3 W3

I4 F4 D4 E4 W4

I5 F5 D5 E5

Figure: Ef fect of an ex ecution operation taking more than one clock c ycle.

21css201t Coa Unit IV
No ratings yet
21css201t Coa Unit IV
136 pages
UNIT 2 - Part-1
No ratings yet
UNIT 2 - Part-1
42 pages
Mod3 Processing Unit
No ratings yet
Mod3 Processing Unit
25 pages
COA UNIT - III Processor and Control Unit
No ratings yet
COA UNIT - III Processor and Control Unit
127 pages
21css201t Coa Unit 4 Notes
No ratings yet
21css201t Coa Unit 4 Notes
136 pages
Unit 3 Basic Processing Unit
No ratings yet
Unit 3 Basic Processing Unit
39 pages
Chapter 7 - Basic Processing Unit
0% (1)
Chapter 7 - Basic Processing Unit
47 pages
Chapter 7 Basic Processing Unit
No ratings yet
Chapter 7 Basic Processing Unit
58 pages
Basic Processing Unit
No ratings yet
Basic Processing Unit
45 pages
Module 5
No ratings yet
Module 5
46 pages
Unit3 Control Unit 1
No ratings yet
Unit3 Control Unit 1
47 pages
Module 5
No ratings yet
Module 5
35 pages
Coa Unit Iii Final
No ratings yet
Coa Unit Iii Final
141 pages
CA Unit 2.1
No ratings yet
CA Unit 2.1
30 pages
Unit III - Basic Processing Unit
No ratings yet
Unit III - Basic Processing Unit
123 pages
Chapter3 - Basic Processing Unit
No ratings yet
Chapter3 - Basic Processing Unit
47 pages
Unit 3 Basic Processing Unit
No ratings yet
Unit 3 Basic Processing Unit
86 pages
Module 4 Basicprocessingunit
No ratings yet
Module 4 Basicprocessingunit
105 pages
Execution of Instruction
No ratings yet
Execution of Instruction
60 pages
Unit 3 Basic Processing Unit
No ratings yet
Unit 3 Basic Processing Unit
44 pages
Chapter 4 Notes
No ratings yet
Chapter 4 Notes
32 pages
Module-2: Memory Systems Basic Processing Unit
No ratings yet
Module-2: Memory Systems Basic Processing Unit
183 pages
CO Unit 4 - Processing - Pipelining
No ratings yet
CO Unit 4 - Processing - Pipelining
53 pages
Module-5 DDCO
No ratings yet
Module-5 DDCO
35 pages
Pipelining
No ratings yet
Pipelining
24 pages
Module 5 DD CO Ver4
No ratings yet
Module 5 DD CO Ver4
44 pages
CPU Instruction Execution Guide
No ratings yet
CPU Instruction Execution Guide
103 pages
Unit 6 Basic Processing Unit
No ratings yet
Unit 6 Basic Processing Unit
57 pages
DDCO Jan25 Unit5
No ratings yet
DDCO Jan25 Unit5
30 pages
Basic Processing Unit
No ratings yet
Basic Processing Unit
23 pages
COA Module5 PPT
No ratings yet
COA Module5 PPT
42 pages
CPU Instruction Execution Basics
No ratings yet
CPU Instruction Execution Basics
47 pages
Module 2-Basic-Processing-Unit (CPU)
No ratings yet
Module 2-Basic-Processing-Unit (CPU)
55 pages
Module 5 - Basic Processing Unit
No ratings yet
Module 5 - Basic Processing Unit
33 pages
Coa Unit 3
100% (1)
Coa Unit 3
58 pages
Hamacher Ch7 Microarchitecture
No ratings yet
Hamacher Ch7 Microarchitecture
47 pages
Lec 7 CSE-509 Pipelining
No ratings yet
Lec 7 CSE-509 Pipelining
27 pages
Pretechnical Studies Grade 8 Notes
90% (10)
Pretechnical Studies Grade 8 Notes
7 pages
Ddco Co 03
No ratings yet
Ddco Co 03
21 pages
UNIT-3 (Processor Organization)
No ratings yet
UNIT-3 (Processor Organization)
44 pages
Unit 2-Basic Processing Unit
No ratings yet
Unit 2-Basic Processing Unit
95 pages
M5 Notes
No ratings yet
M5 Notes
14 pages
Chapter 7 Basic Processing Unit
No ratings yet
Chapter 7 Basic Processing Unit
58 pages
Vxworks BSP Developers Guide 6.0
No ratings yet
Vxworks BSP Developers Guide 6.0
189 pages
DDCO Notes-162-171
No ratings yet
DDCO Notes-162-171
10 pages
Unit 07 - IBM I - Licensing - 3448710
No ratings yet
Unit 07 - IBM I - Licensing - 3448710
13 pages
CPU Instruction Execution Guide
No ratings yet
CPU Instruction Execution Guide
50 pages
Basic Processing Unit
No ratings yet
Basic Processing Unit
35 pages
Unit 7 - Basic Processing
No ratings yet
Unit 7 - Basic Processing
85 pages
Digital Design & CPU Basics
No ratings yet
Digital Design & CPU Basics
10 pages
Computer Organization-Basic Processing Unit
100% (2)
Computer Organization-Basic Processing Unit
48 pages
CS621 - Handouts - Mids
No ratings yet
CS621 - Handouts - Mids
61 pages
Unit 2
No ratings yet
Unit 2
96 pages
Basic Processing Unit
No ratings yet
Basic Processing Unit
35 pages
Unit - Vi: Some Fundamental Concepts
No ratings yet
Unit - Vi: Some Fundamental Concepts
46 pages
Coa Lecture Unit 2
No ratings yet
Coa Lecture Unit 2
82 pages
CDCT2203 Information Technology and Environment
100% (2)
CDCT2203 Information Technology and Environment
211 pages
Amit - Optimizing Oracle Essbase Formulas & Calc Scripts
No ratings yet
Amit - Optimizing Oracle Essbase Formulas & Calc Scripts
100 pages
Chapter3 - Basic Processing Unit
No ratings yet
Chapter3 - Basic Processing Unit
38 pages
Processor Instruction Execution
No ratings yet
Processor Instruction Execution
16 pages
What Is The Most Boring Household Activity?
No ratings yet
What Is The Most Boring Household Activity?
27 pages
Python GTU Study Material Presentations Unit-5 20112020032922AM
No ratings yet
Python GTU Study Material Presentations Unit-5 20112020032922AM
24 pages
Microprocessor Unit 4
100% (1)
Microprocessor Unit 4
55 pages
Ict Grade 7
No ratings yet
Ict Grade 7
95 pages
Welcom To EE323-Microprocessor Interfacing Program: CE (Fall 2016)
No ratings yet
Welcom To EE323-Microprocessor Interfacing Program: CE (Fall 2016)
68 pages
W32DASM Setup for Windows Users
100% (8)
W32DASM Setup for Windows Users
2 pages
Multilogin
No ratings yet
Multilogin
4 pages
Build Your Own Executable Crypter
100% (1)
Build Your Own Executable Crypter
5 pages
Final Report
No ratings yet
Final Report
83 pages
Embedded Systems: 1 - Introduction
No ratings yet
Embedded Systems: 1 - Introduction
51 pages
IC695 PBM300 Profibus Master Module
No ratings yet
IC695 PBM300 Profibus Master Module
5 pages
CT1 Armsoc Answerkey Set A
No ratings yet
CT1 Armsoc Answerkey Set A
5 pages
Computer Terms for Beginners
No ratings yet
Computer Terms for Beginners
8 pages
DTM Epa New
No ratings yet
DTM Epa New
59 pages
Elecives Combined
No ratings yet
Elecives Combined
32 pages
Big Data Analysis Introduction
No ratings yet
Big Data Analysis Introduction
42 pages
"Simulasi Motor": Program
No ratings yet
"Simulasi Motor": Program
5 pages
Solutions To Set 4
No ratings yet
Solutions To Set 4
4 pages
Chapter 1 - Introduction To Computers and C/C++ Programming: Outline
No ratings yet
Chapter 1 - Introduction To Computers and C/C++ Programming: Outline
22 pages
Ax PDF
No ratings yet
Ax PDF
20 pages
Intro to Operating Systems
No ratings yet
Intro to Operating Systems
30 pages
SolarWinds Interview Preparation (Edition 3)
No ratings yet
SolarWinds Interview Preparation (Edition 3)
10 pages
Infor LN UI 11.3 Sizing Guide
No ratings yet
Infor LN UI 11.3 Sizing Guide
25 pages
Qdi Gold Client and Qlik Catalog License Metrics
No ratings yet
Qdi Gold Client and Qlik Catalog License Metrics
3 pages

Chapter7 - Basic Processing Unit 1

Uploaded by

Chapter7 - Basic Processing Unit 1

Uploaded by

Basic Processing Unit

 A typical computing task consists of a series

of steps specified by a sequence of machine

sequence of more rudimentary operations.

Figure 7.1. Single-bus organization of the datapath inside a processor.

Figure: Connection and control signals for register MDR.

MARin MAR ← [R1]

Wait for the MFC response from the memory

MDR out Load MDR from the memory bus

Figure 7.5. Timing of a memory Read operation.

 Fetch the first operand (the contents of the

memory location pointed to by R3)

 Load the result into R1

Figure: Input and output gating for the registers.

1 PCout , MAR in , Read,Select4,Add, Zin Memory

2 Zout , PC in , Yin , WMF C MDR

4 R3out , MAR in , Read Y

7 Zout , R1in , End Add

Figure 7.1. Single-bus organization of the datapath inside a processor.

the branch target address and the address

1 PCout , MAR in , Read,Select4,Add, Zin

5 Zout, PCin , End

Figure : Control sequence for an unconditional branch instruction.

operation can be performed at the same time.

performed per second is increased even though

each have one load of clothes

Dryer takes 40 minutes

Folder takes 20 minutes

long would laundry take?

(b) Hardware organization

(a) Instruction execution divided into four steps

(b) Hardware organization

all pipeline stages require the same time to

You might also like