0% found this document useful (0 votes)

27 views5 pages

Unit 1

The document outlines design principles for modern computers including parallelism. It discusses instruction-level parallelism through pipelining and dual pipelines as well as superscalar architectures. Processor-level parallelism through array computers, multiprocessors, and multicomputers is also mentioned.

Uploaded by

prototypes6341

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views5 pages

Unit 1

Uploaded by

prototypes6341

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Lecture Outline

• Design Principles for Modern Computers

• Parallelism

UNIT 1 • Instruction-Level Parallelism

– Pipelining
– Dual Pipelines
– Superscalar Architectures

• Processor-Level Parallelism
– Array Computers
– Multiprocessors
– Multicomputers

Design Principles for Modern Computers

• Instructions Should be Easy to Decode

– a critical limit on the rate of issue of
There is a set of design principles, sometimes instructions
called the RISC design principles, that architects – make instructions regular, fixed length, with
of general-purpose CPUs do their best to follow: a small number of fields.
– the fewer different formats for instructions.
• All Instructions Are Directly Executed by the better.
Hardware
• Only Loads and Stores Should Reference
– eliminates a level of interpretation Memory

• Maximise the Rate at Which Instructions are – operands for most instructions should come
Issued from- and return to- registers.
– access to memory can take a long time
– MIPS = millions of instructions per second – thus, only LOAD and STORE instructions
– MIPS speed related to the number of should reference memory.
instructions issued per second
– Parallelism can play a role • Provide Plenty of Registers
– accessing memory is relatively slow, many
registers (at least 32) need to be provided,
so that once a word is fetched, it can be kept
in a register until it is no longer needed.

2 3 1
Parallelism Instruction-Level Parallelism

• Computer architects are constantly striving to • Parallelism is exploited within individual instructions
improve performance of the machines they to get more instructions/sec out of the machine.
design.
• We will consider two approached
• Making the chips run faster by increasing their
clock speed is one way, – Pipelining
– Superscalar Architectures
• However, most computer architects look to
parallelism (doing two or more things at once)
as a way to get even more performance for a
given clock speed.

• Parallelism comes in two general forms:

– instruction-level parallelism, and
– processor-level parallelism.

4 5

Pipelining
A Example of Pipelining

S1 S2 S3 S4 S5

• Fetching of instructions from memory is a major Instruction

fetch
Instruction
decode
Operand
fetch
Instruction
execution
Write
back
unit unit unit unit unit
bottleneck in instruction execution speed.
(a)
However, computers have the ability to fetch
instructions from memory in advance S1: 1 2 3 4 5 6 7 8 9
S2: 1 2 3 4 5 6 7 8
S3: 1 2 3 4 5 6 7 …
• These instructions were stored in a set of S4: 1 2 3 4 5 6

registers called the prefetch buffer. S5: 1 2 3 4 5

1 2 3 4 5 6 7 8 9
Time
• Thus, instruction execution is divided into two (b)

parts: fetching and actual execution; Figure 2-4. (a) A five-stage pipeline. (b) The state
of each stage as a function of time. Nine clock
• The concept of a pipeline carries this strategy cycles are illustrated.
much further.

• Instead of dividing instruction execution into

only two parts, it is often divided into many
parts, each one handled by a dedicated piece
of hardware, all of which can run in parallel.

6 7 2
Dual Pipelines Example: Dual Pipelines

S1 S2 S3 S4 S5

• If one pipeline is good, then surely two Instruction

decode
Operand
fetch
Instruction
execution
Write
back
pipelines are better. Instruction
fetch
unit unit unit unit

unit
Instruction Operand Instruction Write
decode fetch execution back
• Here a single instruction fetch unit fetches pairs unit unit unit unit

of instructions together and puts each one into

its own pipeline, complete with its own ALU for Figure 2-5. (a) Dual five-stage pipelines with a
parallel operation. common instruction fetch unit.

• To be able to run in parallel, the two instructions

must not conflict over resource usage (e.g.,
registers), and neither must depend on the
result of the other.

8 9

Superscalar Architectures
Superscalar Architectures

• Going to four pipelines is conceivable, but ALU

doing so duplicates too much hardware
ALU
• Instead, a different approach is used on high- S1 S2 S3 S5

end CPUs. Instruction

fetch
Instruction
decode
Operand
fetch LOAD
Write
back
unit unit unit unit

• The basic idea is to have just a single pipeline STORE

but give it multiple functional units.

Floating
point
• This is a superscalar architecture – using
more than one ALU, so that more than one
instruction can be executed in parallel. Figure 2-6. A superscalar processor with five
functional units.
• Implicit in the idea of a superscalar processor
is that the S3 stage can issue instructions
considerably faster than the S4 stage is able
to execute them.

10 11 3
Processor-Level Parallelism Array Computers

• Instruction-level parallelism (pipelining and • An array processor consists of a large number

superscalar operation) rarely win more than a of identical processors that perform the same
factor of five or ten in processor speed. sequence of instructions on different sets of
data.
• To get gains of 50, 100, or more, the only way
is to design computers with multiple CPUS • A vector processor is efficient at at executing
a sequence of operations on pairs of Data
• We will consider three alternative architectures: elements; all of the addition operations are
performed in a single, heavily-pipelined adder.
– Array Computers
– Multiprocessors
– Multicomputers

12 13

Example: Array Computers

Multiprocessors

Control unit

Broadcasts instructions • The processing elements in an array processor

are not independent CPUS, since there is only
one control unit.

8 × 8 Processor/memory grid
• The first parallel system with multiple full-blown
Processor CPUs is the multiprocessor.
Memory
• This is a system with more than one CPU
sharing a common memory co-ordinated in
software.
Figure 2-7. An array processor of the ILLIAC IV
type. • The simplest one is to have a single bus with
multiple CPUs and one memory all plugged
into it.

14 15 4
Example: Multiprocessors Multicomputers

Local memories

• Although multiprocessors with a small number

of processors (< 64) are relatively easy to build,
Shared Shar
memory mem large ones are surprisingly difficult to construct.
CPU CPU CPU CPU CPU CPU CPU CPU
• The difficulty is in connecting all the processors
to the memory.
Bus Bus
(a) (b)
• To get around these problems, many designers
have simply abandoned the idea of having
Figure 2-8. (a) A single-bus multiprocessor. (b) A a shared memory and just build systems
multicomputer with local memories. consisting of large numbers of interconnected
computers, each having its own private
memory, but no common memory.

• These systems are called multicomputers.

16 17

Processors
100% (4)
Processors
44 pages
04 Pipeline
No ratings yet
04 Pipeline
83 pages
Pipeline and Vector Processing
No ratings yet
Pipeline and Vector Processing
52 pages
05 Wideissue
No ratings yet
05 Wideissue
77 pages
Computer Architecture P1
No ratings yet
Computer Architecture P1
37 pages
Unit5 Parallel Processing Multiprocessor
No ratings yet
Unit5 Parallel Processing Multiprocessor
32 pages
Advanced Processor Superscalarclass
50% (2)
Advanced Processor Superscalarclass
73 pages
Labour Rate Recommended
78% (18)
Labour Rate Recommended
106 pages
15.1 Hardware Virtual Machines 2024
No ratings yet
15.1 Hardware Virtual Machines 2024
10 pages
Theory of Architecture Reviewer
No ratings yet
Theory of Architecture Reviewer
63 pages
Computer Architecture 1
No ratings yet
Computer Architecture 1
37 pages
Equipment Capabilites
97% (195)
Equipment Capabilites
19 pages
Coa Unit 5
No ratings yet
Coa Unit 5
20 pages
ITEC582-Chapter 16m
No ratings yet
ITEC582-Chapter 16m
55 pages
Lecture 06 - (New) Pipelining and Parallelism
No ratings yet
Lecture 06 - (New) Pipelining and Parallelism
36 pages
4 - Hardware and Virtual Machines - A2 - 9618
No ratings yet
4 - Hardware and Virtual Machines - A2 - 9618
26 pages
Parallel Computer Architecture
No ratings yet
Parallel Computer Architecture
22 pages
Organization CH 2
No ratings yet
Organization CH 2
102 pages
Campmc Unit Ii
No ratings yet
Campmc Unit Ii
61 pages
L03 Architecture Memory
No ratings yet
L03 Architecture Memory
56 pages
15CS72 ACA Module2Final
No ratings yet
15CS72 ACA Module2Final
29 pages
Parallelism in Microprocessor
No ratings yet
Parallelism in Microprocessor
17 pages
Lecture 2
No ratings yet
Lecture 2
51 pages
Lec 7
No ratings yet
Lec 7
26 pages
MPMC Module 5
No ratings yet
MPMC Module 5
25 pages
Unit 5
No ratings yet
Unit 5
23 pages
2 - Cpe410l2
No ratings yet
2 - Cpe410l2
10 pages
Advanced Computer Architecture Prof Thriveni T K
No ratings yet
Advanced Computer Architecture Prof Thriveni T K
59 pages
Computer Archi
No ratings yet
Computer Archi
58 pages
Unit 5
No ratings yet
Unit 5
44 pages
Instruction Pipelining and SuperScalar Development - 2019
No ratings yet
Instruction Pipelining and SuperScalar Development - 2019
53 pages
Parallel Processing Parallel Processing
No ratings yet
Parallel Processing Parallel Processing
64 pages
Computer Architecture Basics
No ratings yet
Computer Architecture Basics
74 pages
Csa Module Iv Notes
No ratings yet
Csa Module Iv Notes
59 pages
Unit 5
No ratings yet
Unit 5
36 pages
Pipelining
No ratings yet
Pipelining
13 pages
Lecture 10
No ratings yet
Lecture 10
23 pages
Batch 2 ICS 2101 AND BIT 2102 (1) - 1
No ratings yet
Batch 2 ICS 2101 AND BIT 2102 (1) - 1
17 pages
Processor and Computer Achitecture
No ratings yet
Processor and Computer Achitecture
26 pages
Contact Session 8
No ratings yet
Contact Session 8
63 pages
BCA Semester II Computer Organisation and Architecture (COA
No ratings yet
BCA Semester II Computer Organisation and Architecture (COA
24 pages
Ca06 2014 PDF
No ratings yet
Ca06 2014 PDF
53 pages
ch.9 Pipeline MoDIFIED
No ratings yet
ch.9 Pipeline MoDIFIED
76 pages
25 pipelining محاضرة
No ratings yet
25 pipelining محاضرة
7 pages
Computer Architecture Unit 3
No ratings yet
Computer Architecture Unit 3
8 pages
Chapter 9
No ratings yet
Chapter 9
28 pages
L14 MipsPipeline Ovw
No ratings yet
L14 MipsPipeline Ovw
17 pages
Unit 1 Modern Processors
No ratings yet
Unit 1 Modern Processors
52 pages
APL Series Propeller Fans Specs
No ratings yet
APL Series Propeller Fans Specs
6 pages
Pipeline and Vector Processing
100% (1)
Pipeline and Vector Processing
18 pages
Superscalar Processor - Wikipedia
No ratings yet
Superscalar Processor - Wikipedia
5 pages
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
No ratings yet
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
36 pages
Lect5 PDF
No ratings yet
Lect5 PDF
21 pages
Parallel Processors From Client To Cloud: Omputer Rganization and Esign
No ratings yet
Parallel Processors From Client To Cloud: Omputer Rganization and Esign
43 pages
Superscalar Processors & Parallelism
No ratings yet
Superscalar Processors & Parallelism
50 pages
Comparison of ASHRAE Standards 52.1 and 52.2
No ratings yet
Comparison of ASHRAE Standards 52.1 and 52.2
3 pages
Chapter 2 - Complete
No ratings yet
Chapter 2 - Complete
16 pages
Superscaling in Computer Architecture
No ratings yet
Superscaling in Computer Architecture
9 pages
Microprocessor Based Systems: Lecture No 05 Virtual Machines and Pipelining Concept
No ratings yet
Microprocessor Based Systems: Lecture No 05 Virtual Machines and Pipelining Concept
19 pages
Hardware Fittings
No ratings yet
Hardware Fittings
14 pages
Multiprocessor Systems & Pipelining
No ratings yet
Multiprocessor Systems & Pipelining
11 pages
VI. Implicit Parallelism - Instruction Level VI. Implicit Parallelism Instruction Level Parallelism. Pipeline Superscalar & Vector P Processors
No ratings yet
VI. Implicit Parallelism - Instruction Level VI. Implicit Parallelism Instruction Level Parallelism. Pipeline Superscalar & Vector P Processors
26 pages
Designing A Dining Room
No ratings yet
Designing A Dining Room
7 pages
Pipelining & Vector Processing Guide
No ratings yet
Pipelining & Vector Processing Guide
73 pages
Stair Installation Guide
No ratings yet
Stair Installation Guide
12 pages
Introduction To Parallel Processing: Unit-2
No ratings yet
Introduction To Parallel Processing: Unit-2
32 pages
Art Vocabulary for Enthusiasts
No ratings yet
Art Vocabulary for Enthusiasts
372 pages
Gantt Chart Format
No ratings yet
Gantt Chart Format
2 pages
Unit 2
No ratings yet
Unit 2
67 pages
UNOX Mantenedor XL31 S.N 0002-0050
No ratings yet
UNOX Mantenedor XL31 S.N 0002-0050
2 pages
A Quick Guide To Improve Your Bedroom Lights
No ratings yet
A Quick Guide To Improve Your Bedroom Lights
7 pages
Xadc
No ratings yet
Xadc
39 pages
Unit1 - Introduction and Unit 7 Information Security
No ratings yet
Unit1 - Introduction and Unit 7 Information Security
150 pages
P Stein
No ratings yet
P Stein
9 pages
3.memory Management - 20240103
No ratings yet
3.memory Management - 20240103
157 pages
OrlansoftERP Installation Guide 10.0.9.01
No ratings yet
OrlansoftERP Installation Guide 10.0.9.01
29 pages
Unit2 - Basic of Computer Network
No ratings yet
Unit2 - Basic of Computer Network
139 pages
FloorPlan FNCP
100% (1)
FloorPlan FNCP
1 page
Two Dimensioonal Transfrormation - 20240510
No ratings yet
Two Dimensioonal Transfrormation - 20240510
96 pages
Aesthetics of Sustainable Architecture Bas Roijers
No ratings yet
Aesthetics of Sustainable Architecture Bas Roijers
13 pages
Burnt Bricks Wall With RCC Slab (6) Classroom School Bill of Quantity (BOQ)
100% (1)
Burnt Bricks Wall With RCC Slab (6) Classroom School Bill of Quantity (BOQ)
28 pages
PMAY & Tech Sub-Mission Overview
No ratings yet
PMAY & Tech Sub-Mission Overview
15 pages
Multiprocessor Systems Guide
No ratings yet
Multiprocessor Systems Guide
24 pages
Prehistoric Methods of Construction
No ratings yet
Prehistoric Methods of Construction
7 pages
List 2TB
No ratings yet
List 2TB
1 page
Jana Ortiz - UNIT 2 BUILDING CONSTRUCTION TASK
No ratings yet
Jana Ortiz - UNIT 2 BUILDING CONSTRUCTION TASK
8 pages
Binary and IP Conversion Tasks
No ratings yet
Binary and IP Conversion Tasks
3 pages
Unit 6
No ratings yet
Unit 6
36 pages
Tuhye0963an41an Submittal
No ratings yet
Tuhye0963an41an Submittal
2 pages
MPT Micro Project Report Format
No ratings yet
MPT Micro Project Report Format
12 pages
Unit 3
No ratings yet
Unit 3
75 pages
Flame Tower
No ratings yet
Flame Tower
18 pages
The World Is Not Just Integers: Programming Languages Support Numbers With Fraction
No ratings yet
The World Is Not Just Integers: Programming Languages Support Numbers With Fraction
4 pages
Filipino Vernacular Architecture
No ratings yet
Filipino Vernacular Architecture
13 pages
Unit 7
No ratings yet
Unit 7
32 pages
OASE #75 Houses of The Future Jurjen Zeinstra
No ratings yet
OASE #75 Houses of The Future Jurjen Zeinstra
4 pages
Grshmal Btech 4th Sem
No ratings yet
Grshmal Btech 4th Sem
11 pages
Wepik Exploring The Power of Multithreading A Comprehensive Guide To Java Thread Management and Lifecycle 202401191807293R01
No ratings yet
Wepik Exploring The Power of Multithreading A Comprehensive Guide To Java Thread Management and Lifecycle 202401191807293R01
14 pages
El Kheleifah
No ratings yet
El Kheleifah
33 pages

Unit 1

Uploaded by

Unit 1

Uploaded by

Lecture Outline

• Design Principles for Modern Computers

UNIT 1 • Instruction-Level Parallelism

Design Principles for Modern Computers

• Instructions Should be Easy to Decode

• Parallelism comes in two general forms:

• Fetching of instructions from memory is a major Instruction

registers called the prefetch buffer. S5: 1 2 3 4 5

• Instead of dividing instruction execution into

• If one pipeline is good, then surely two Instruction

of instructions together and puts each one into

• To be able to run in parallel, the two instructions

• Going to four pipelines is conceivable, but ALU

end CPUs. Instruction

• The basic idea is to have just a single pipeline STORE

but give it multiple functional units.

• Instruction-level parallelism (pipelining and • An array processor consists of a large number

Example: Array Computers

Broadcasts instructions • The processing elements in an array processor

• Although multiprocessors with a small number

• These systems are called multicomputers.

You might also like