0% found this document useful (0 votes)

22 views20 pages

19 Computer Architecture Vector Processor

The document discusses vector processors, which can operate on entire vectors in a single instruction, allowing for parallel processing and reduced memory access latency. It outlines the architecture, types of vector instructions, advantages, disadvantages, and applications of vector processors, highlighting their efficiency in handling large data sets. Despite their speed in mathematical operations, vector processors are less popular than scalar processors due to higher costs and complexity.

Uploaded by

Aritra Das

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views20 pages

19 Computer Architecture Vector Processor

Uploaded by

Aritra Das

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 20

Computer Architecture

(PCC CS-402)
Vector Processor

May 12, 2025

Introduction
■ A processor can operate on an entire vector in one
instruction.
■ Work done automatically in parallel
(simultaneously).
■ The operand to the instructions are complete vectors
instead of one element.
■ Vector instructions access memory with known
pattern.
■ Reduces branches and branch problems in pipelines.

May 12, 2025 2

Introduction
■ Vector processor is an ensemble of hardware resources,
including vector registers, functional pipelines,
processing elements and register counters for
performing vector operations.
■ It is a coprocessor specially designed for vector
computation.
■ Vector instruction involves a large array of operands.
■ Are often used in multi-pipelined supercomputer.
■ Two different architectures are available:
● Register-to-register architecture (Ex.: Cray
supercomputer)
 Uses shorter instruction and vector register files.
● Memory-to-memory architecture (Ex.: Cyber 205)
 Uses memory based instructions which are longer in length
including memory address.
■ Consists with fixed number of vector registers.
May 12, 2025 3
Register based Vector instruction
■ Typical register-based vector operations listed below
where vector operator is represented by ϋ, a scalar
register as Si, vector register of length n as Vi, memory
array of length n as M(1 : n):
● V1 ϋ V2 → V3 (binary vector)
● S1 ϋ V1 → V2 (scaling)
● V1 ϋ V2 → S1 (binary production)
● M(1 : n) → V1 (vector load)
● V1 → M(1 : n) (vector store)
● V1 → V2 (unary vector)
● V1 → S1 (unary production)
■ Vector length should be equal in all operands used in
vector instruction.
May 12, 2025 4
Memory based Vector instruction
■ Typical memory-based vector operations listed below
where vector operator is represented by ϋ, a scalar
register as Si, memory array of length n as M(1:n),
scalar quantity stored in memory location k is
represented by M(k):
● M1(1 : n) ϋ M2(1 : n) → M(1 : n)
● S1 ϋ M1(1 : n) → M2(1 : n)
● M1(1 : n) → M2(1 : n)
● M1(1 : n) ϋ M2(1 : n) → M(k)
■ Vector length is not restricted by register length.

May 12, 2025 5

Vector instruction types
■ Define vector instruction types by mathematical
mappings between their working registers or memory
where vector operands are stored
● Vector-vector instructions
 One or two vector operands are fetched from the
respective vector registers.
 Enter through a functional pipeline unit, and produce
results in another vector register
 f1 : V i → Vj
 f2 : V j × Vk → Vi

May 12, 2025 6

Vector instruction types

● Vector-scalar instructions
 Each elements of Vk are multiplied by a scalar s to
produce vector Vi of equal length.
 f3 : s × V k → Vi

May 12, 2025 7

Vector instruction types

● Vector-memory instructions
 This corresponds to vector load or vector store element
by element, between the vector register (V) and the
memory (M) as defined below:
 f4 : M → V (vector load)
 f5 : V → M (vector store)

Vector Load instruction Vector Store instruction

May 12, 2025 8

Vector instruction types

● Vector reduction instructions

 f6 include finding the maximum, minimum, sum, and
mean value of all elements in a vector.
 f 6 : V i → Sj
 f7 is the dot product which performs from two vectors A
= (ai) and B = (bi).
 f 7 : V i × V j → Sk

May 12, 2025 9

Vector instruction types
● Masking instructions
 This type of instruction uses a mask vector to
compress or to expand a vector to a shorter or longer
index vector respectively, corresponding to the
following mappings
 f8 : V 0 × V m → V1

May 12, 2025 10

Vector instruction types
● Gather instructions
 This instruction use two vector registers to gather
vector elements randomly throughout the memory.
 f9 : M → V1 × V0 (Gather)

May 12, 2025 11

Vector instruction types
● Scatter instructions
 This instruction use two vector registers to scatter
vector elements randomly throughout the memory.
 f10 : V1 × V0 → M(Scatter)

May 12, 2025 12

Basic Vector Architecture
■ Pipeline architecture may have a number of steps.
■ There is no standard when it comes to pipelining
technique.
■ Cray-1 has 14 stages to perform vector operations.
■ Data is read into vector registers which are FIFO
queues.
■ Can hold 50-100 floating point values.
■ The instruction set:
● Loads a vector register from a location in memory.
● Performs operations on elements in vector registers.
● Stores data back into memory from the vector registers.
■ A vector processor is easy to program parallel SIMD
computer.
■ Memory references and computations are overlapped to
bring about a tenfold speed increase.
May 12, 2025 13
Basic Vector Architecture

Typical vector processor architecture.

May 12, 2025 14

Basic Vector Architecture

Closer view of vector processor register and functional unit.

May 12, 2025 15

Cray-1 Vector Architecture

Cray-1 vector computer architecture.

May 12, 2025 16

Advantages
■ Each result is independent of previous results -
allowing high clock rates.
■ A single vector instruction performs a great deal of
work - meaning less fetches and fewer branches (and
in turn fewer mis-predictions).
■ Vector instructions access memory a block at a time
which results in very low memory latency.
■ Less memory access = faster processing time.
■ Lower cost due to low number of operations
compared to scalar counterparts.

May 12, 2025 17

Disadvantages
■ Works well only with data that can be executed in
highly or completely parallel manner.
■ Needs large blocks of data to operate on to be efficient
because of the recent advances increasing speed of
accessing memory.
■ Severely lacking in performance compared to normal
processors on scalar data.
■ High price of individual chips due to limitations of
on-chip memory.
■ Increased code complexity needed to vectorize the
data.
■ High cost in design and low returns compared to
superscalar microprocessors.
May 12, 2025 18
Applications
■ Useful in applications that involve comparing or
processing large blocks of data.
■ Multimedia Processing (compress., graphics, audio
synthesis, image processing)
■ Speech and handwriting recognition.
■ Lossy Compression (JPEG, MPEG video and audio).
■ Lossless Compression (Zero removal, RLE,
Differencing, LZW).
■ Cryptography (RSA, DES/IDEA, SHA/MD5).

May 12, 2025 19

Conclusion
■ The Vector machine is faster at performing mathematical
operations on larger vectors.
■ The Vector processing computer’s vector register
architecture makes it better able to compute vast
amounts of data quickly.
■ While Vector Processing is not widely popular today, it
still represents a milestone in supercomputing
achievement.
■ Since scalar processors designed can also be used for
general applications their cost per unit is reduced
drastically. Such is not the case for vector
processors/supercomputers.
■ Vector processors will continue to have a future in Large
Scale computing and certain applications but can never
reach the popularity of Scalar microprocessors.
May 12, 2025 20

Standard Ii: Talent Search Examination - 2021-22
100% (2)
Standard Ii: Talent Search Examination - 2021-22
17 pages
Data Analysis and Property Modeling With SKUA-GOCAD Training Manual - Paradigm 15
No ratings yet
Data Analysis and Property Modeling With SKUA-GOCAD Training Manual - Paradigm 15
186 pages
GMP Training for Medical Devices
67% (3)
GMP Training for Medical Devices
110 pages
Altair PBS Eclipse Integraton 2012
No ratings yet
Altair PBS Eclipse Integraton 2012
13 pages
7-VECTOR PROCESSING-04-Jan-2020Material - I - 04-Jan-2020 - VECTOR - PROCESSING PDF
No ratings yet
7-VECTOR PROCESSING-04-Jan-2020Material - I - 04-Jan-2020 - VECTOR - PROCESSING PDF
31 pages
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 26-Aug-2021 Module2-SIMD-VectorProcessors
No ratings yet
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 26-Aug-2021 Module2-SIMD-VectorProcessors
16 pages
Lecture 7
No ratings yet
Lecture 7
29 pages
SIMD
No ratings yet
SIMD
44 pages
Onur Digitaldesign 2020 Lecture19 Simd Beforelecture
No ratings yet
Onur Digitaldesign 2020 Lecture19 Simd Beforelecture
64 pages
Organisasi & Arsitektur Komputer
No ratings yet
Organisasi & Arsitektur Komputer
3 pages
Supercomputers and Vector Machines
No ratings yet
Supercomputers and Vector Machines
40 pages
Computer Architecture Simd Vector Gpu
No ratings yet
Computer Architecture Simd Vector Gpu
16 pages
Ca Part 3
No ratings yet
Ca Part 3
20 pages
Unit Iii Data-Level Parallelism in Vector, Simd, and Gpu Architectures
No ratings yet
Unit Iii Data-Level Parallelism in Vector, Simd, and Gpu Architectures
26 pages
Unit 3-4
No ratings yet
Unit 3-4
76 pages
26-27 SIMD Architecture
No ratings yet
26-27 SIMD Architecture
33 pages
Lec. 12: Vector Computers: EECS 252 Graduate Computer Architecture
No ratings yet
Lec. 12: Vector Computers: EECS 252 Graduate Computer Architecture
31 pages
Module 4 Chapter 2
No ratings yet
Module 4 Chapter 2
42 pages
Unit5 Aca
No ratings yet
Unit5 Aca
11 pages
l22 Vector
No ratings yet
l22 Vector
32 pages
CS7103 - MultiCore Architecture Ppts Unit-II
No ratings yet
CS7103 - MultiCore Architecture Ppts Unit-II
43 pages
Data-Level Parallelism in Vector, SIMD, and GPU Architectures
No ratings yet
Data-Level Parallelism in Vector, SIMD, and GPU Architectures
58 pages
7TH - Unit 4-21ec74h6 - Ca
No ratings yet
7TH - Unit 4-21ec74h6 - Ca
67 pages
Module 1.6
No ratings yet
Module 1.6
53 pages
Vector
No ratings yet
Vector
38 pages
Flynn's Taxonomy: Data-Level Parallelism in Vector, SIMD, and GPU Architectures
No ratings yet
Flynn's Taxonomy: Data-Level Parallelism in Vector, SIMD, and GPU Architectures
28 pages
Data-Level Parallelism Vector and GPU
No ratings yet
Data-Level Parallelism Vector and GPU
6 pages
Unit 4 - 5th Sem-Ec355tbf
No ratings yet
Unit 4 - 5th Sem-Ec355tbf
67 pages
Simple Vector Processor Modeled With VHDL
No ratings yet
Simple Vector Processor Modeled With VHDL
6 pages
VLIW ARCHITECTURE and Pipeline
No ratings yet
VLIW ARCHITECTURE and Pipeline
5 pages
Chapter 04
No ratings yet
Chapter 04
47 pages
CS6461 - Computer Architecture Fall 2016 - Vector Operations
No ratings yet
CS6461 - Computer Architecture Fall 2016 - Vector Operations
47 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
35 pages
Vector Processor
No ratings yet
Vector Processor
83 pages
COE4590 14 Vector
No ratings yet
COE4590 14 Vector
14 pages
CA Classes-201-205
No ratings yet
CA Classes-201-205
5 pages
CH 04. Data-Level Parallelism in Vector, SIMD, and GPU Architectures
No ratings yet
CH 04. Data-Level Parallelism in Vector, SIMD, and GPU Architectures
50 pages
Bangabandhu Sheikh Mujibur Rahman Maritime University Bangladesh
No ratings yet
Bangabandhu Sheikh Mujibur Rahman Maritime University Bangladesh
7 pages
Vector
No ratings yet
Vector
42 pages
Vector and SIMD Computer Systems
No ratings yet
Vector and SIMD Computer Systems
59 pages
Array & Vector Processor
No ratings yet
Array & Vector Processor
17 pages
For Example: C (1:50) A (1:50) + B (1:50)
No ratings yet
For Example: C (1:50) A (1:50) + B (1:50)
7 pages
Guc 315 61 38694 2023-11-23T11 50 52
No ratings yet
Guc 315 61 38694 2023-11-23T11 50 52
33 pages
CSE 820 Graduate Computer Architecture Vectors and Multiprocessor Introduction
No ratings yet
CSE 820 Graduate Computer Architecture Vectors and Multiprocessor Introduction
39 pages
UNIT-V-Pipeline and Array Processing and Multi Processors
No ratings yet
UNIT-V-Pipeline and Array Processing and Multi Processors
51 pages
Vector Computers
No ratings yet
Vector Computers
43 pages
Why Vector Processing: Deep Pipeline More Parallelism
No ratings yet
Why Vector Processing: Deep Pipeline More Parallelism
7 pages
COA Unit V B
100% (1)
COA Unit V B
5 pages
Syllabus Topic: - Vector Processing - Vector Processor
No ratings yet
Syllabus Topic: - Vector Processing - Vector Processor
14 pages
20 Question of CA
No ratings yet
20 Question of CA
26 pages
Vector Processor
No ratings yet
Vector Processor
13 pages
Module 5 Coa
No ratings yet
Module 5 Coa
11 pages
Onur 447 Spring15 Lecture14 Simd Afterlecture
No ratings yet
Onur 447 Spring15 Lecture14 Simd Afterlecture
60 pages
XX-BSC Compact Vector Processing
No ratings yet
XX-BSC Compact Vector Processing
49 pages
CA Classes-196-200
No ratings yet
CA Classes-196-200
5 pages
Unit 2
No ratings yet
Unit 2
43 pages
CS-482 - Lecture#4 - Vector and Array Processors
No ratings yet
CS-482 - Lecture#4 - Vector and Array Processors
40 pages
1 Vector Processing: Solutions
No ratings yet
1 Vector Processing: Solutions
16 pages
Architecture Chapter4 E5 2012
No ratings yet
Architecture Chapter4 E5 2012
92 pages
Stanley Assignment
No ratings yet
Stanley Assignment
6 pages
CA Classes-211-215
No ratings yet
CA Classes-211-215
5 pages
Multivector&SIMD Computers Ch8
No ratings yet
Multivector&SIMD Computers Ch8
12 pages
MCA - HW - Lecture 7and8 - Prelim
No ratings yet
MCA - HW - Lecture 7and8 - Prelim
146 pages
Microbiology
No ratings yet
Microbiology
34 pages
Information Transfer
No ratings yet
Information Transfer
18 pages
Page Replacement
No ratings yet
Page Replacement
3 pages
SS Computer Architecture Cache Memory Organization
No ratings yet
SS Computer Architecture Cache Memory Organization
24 pages
Computer Architecture ILP - Techniques For Increasing
No ratings yet
Computer Architecture ILP - Techniques For Increasing
11 pages
555 Timer
No ratings yet
555 Timer
12 pages
Weighbridge Integration With Sap
No ratings yet
Weighbridge Integration With Sap
10 pages
EI8751-Industrial Data Networks
No ratings yet
EI8751-Industrial Data Networks
10 pages
Windows System Error Codes
No ratings yet
Windows System Error Codes
304 pages
Datasheet ST S5H100
No ratings yet
Datasheet ST S5H100
5 pages
SPLA Licensing Best Practices
No ratings yet
SPLA Licensing Best Practices
1 page
mANT30 PDF
No ratings yet
mANT30 PDF
1 page
6670 01 Que 2003 SPECIMEN
No ratings yet
6670 01 Que 2003 SPECIMEN
4 pages
HFS File Sharing Guide
No ratings yet
HFS File Sharing Guide
53 pages
Native Otp Authentication With Netscaler
No ratings yet
Native Otp Authentication With Netscaler
14 pages
Pony Preservation Project - Voice - A Guide
No ratings yet
Pony Preservation Project - Voice - A Guide
23 pages
Object Oriented Programming - ABAP Oops-Abap - 1
No ratings yet
Object Oriented Programming - ABAP Oops-Abap - 1
8 pages
C 5750 Users Guide
No ratings yet
C 5750 Users Guide
105 pages
Lecture Notes Cybersecurity Ethical Hacking Networking
No ratings yet
Lecture Notes Cybersecurity Ethical Hacking Networking
2 pages
Dennis
No ratings yet
Dennis
27 pages
Entry-Task-Validation-Exit (ETVX)
No ratings yet
Entry-Task-Validation-Exit (ETVX)
13 pages
IntelliSteer Operating Guide PDF
No ratings yet
IntelliSteer Operating Guide PDF
240 pages
Software Requirements Specification
No ratings yet
Software Requirements Specification
7 pages
Least Mastered Competency: Consolidated
No ratings yet
Least Mastered Competency: Consolidated
2 pages
4 Underlying Principles of Parallel
No ratings yet
4 Underlying Principles of Parallel
25 pages
Network Configuration: 69-3 Nguyen Thi Nho, P9, Q.Tbinh, Tp. HCM
No ratings yet
Network Configuration: 69-3 Nguyen Thi Nho, P9, Q.Tbinh, Tp. HCM
20 pages
Daniel B. Botkin - Forest Dynamics - An Ecological Model (1993) PDF
No ratings yet
Daniel B. Botkin - Forest Dynamics - An Ecological Model (1993) PDF
326 pages
Modelling Simuation & Operation Research
No ratings yet
Modelling Simuation & Operation Research
28 pages
45 Excel Formulas
No ratings yet
45 Excel Formulas
138 pages
P8 5.5.0-P85.5.4 Patch Compatibility Matrix 6
No ratings yet
P8 5.5.0-P85.5.4 Patch Compatibility Matrix 6
16 pages
College and Advanced Algebra (Content)
100% (1)
College and Advanced Algebra (Content)
269 pages
S01M03 TP00003SG03F6E0V Ed1 5G System Requirements
No ratings yet
S01M03 TP00003SG03F6E0V Ed1 5G System Requirements
26 pages

19 Computer Architecture Vector Processor

Uploaded by

19 Computer Architecture Vector Processor

Uploaded by

Computer Architecture

May 12, 2025

May 12, 2025 2

May 12, 2025 5

May 12, 2025 6

May 12, 2025 7

Vector Load instruction Vector Store instruction

May 12, 2025 8

● Vector reduction instructions

May 12, 2025 9

May 12, 2025 10

May 12, 2025 11

May 12, 2025 12

Typical vector processor architecture.

May 12, 2025 14

Closer view of vector processor register and functional unit.

May 12, 2025 15

Cray-1 vector computer architecture.

May 12, 2025 16

May 12, 2025 17

May 12, 2025 19

You might also like