0% found this document useful (0 votes)

22 views5 pages

CA Classes-211-215

The document discusses vector processors and their components. It covers topics like vector register architectures being more advantageous than memory-memory architectures. It also discusses vector instructions types and factors that affect how well a program can run in vector mode, like its structure and the compiler's capabilities.

Uploaded by

SrinivasaRao

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views5 pages

CA Classes-211-215

Uploaded by

SrinivasaRao

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Computer Architecture Unit 9

variation in compiler vectorisation level has been noted by various studies of

the functioning of applications on vector processors. The hand-optimised
versions normally depict important gains in level of vectorisation for codes
which the compiler was not able to vectorise properly by itself, as all codes
at present were above 50% vectorisation. Interestingly, the quicker code
created by the Cray programmers had lower vectorisation levels. The
vectorisation level is not enough by itself to decide performance.
Alternative vectorisation methods might implement lesser instructions, or
maintain more values in vector registers, or permit higher chaining and
overlap in the midst of vector operations, and thus enhance performance
even in case the vectorisation level stays the same or decreases.
For instance, BDNA has approximately the same vectorisation level in the
two versions, however the hand-optimised code is more than 50% faster.
There is also huge variation in the way various compilers perform in
vectorising programs. Summing up the state of vectorising compilers, look
at the data in figure 9.5, that depicts the degree of vectorisation for various
processors, which utilise a test suite containing 100 handwritten FORTRAN
kernels.

Figure 9.5: Result of applying Vectorising Compilers to the 100 FORTRAN

Test Kernels

The kernels were planned to verify vectorisation ability and are able to be
vectorised by hand.

Manipal University of Jaipur B1648 Page No. 211

Computer Architecture Unit 9

Self Assessment Questions

10. List two factors which enable a program to run successfully in vector
mode.
11. There does not exist any variation in the capability of compilers to
decide if a loop can be vectorised. (True/False)

Activity 2:
Visit your local computer vendor and get an expert opinion about vector
processors and their working.

9.6 Summary
There are several representative application areas where vector processing
is of the utmost importance. Depending upon the way the operands are
fetched, vector processors can be segregated into two groups.
 Operands are straight away streamed from the memory to the functional
units and outcomes are written back to memory at the time the vector
operation advances in this architecture.
 Operands are read into vector registers wherein they are fed to the
functional units and outcomes of operations are written to vector
registers in this architecture.
 Vector register architectures have several advantages over vector
memory-memory architectures.
 There are several major components of the vector unit of a register-
register vector machine
 The various types of vector instructions for a register-register vector
processor are:
 Vector-scalar Instructions
 Vector-vector Instructions
 Vector-memory Instructions
 Gather and Scatter Instructions
 Masking Instructions
 Vector Reduction Instructions
 CRAY-1 is one of the oldest processors that implemented vector
processing.
Two issues that arise in real programs: (i) the vector length in a program is
not exactly 64. (ii) Non adjacent elements in vectors that reside in memory.

Manipal University of Jaipur B1648 Page No. 212

Computer Architecture Unit 9

 The structure of the program & capability of the compiler are two factors
that affect the success with which a program can be run in vector mode.

9.7 Glossary
 ASC: Advanced Scientific Computer
 Data hazards: the conflicts in register accesses
 ETA-10: A later shared-memory multiprocessor version of the CDC
Cyber 205.
 Functional hazards: the conflicts in functional units.
 Gather: an operation that fetches the non-zero elements of a sparse
vector from memory.
 Masking instructions: These instructions use a mask vector to expand
or compress a vector
 Scatter: It stores a vector in a sparse vector into memory.
 SECDED: single-error correction, double-error detection.
 Small scale integration: it can pack 10 to 20 transistors in a single
chip.
 Strip mining: the vector is partitioned into strips of 64 elements.
 Vector reduction instructions: These instructions accept one or two
vectors as input and produce a scalar as output.

9.8 Terminal Questions

1. Explain the importance of Vector Processors.
2. What are the different types of Vector Processing?
3. How is vector register architecture more advantageous over memory-
memory vector architecture?
4. Write short notes on:
a) CDC Cyber 200 model 205 computer overview
b) CRAY-1
c) Vector Length
d) Vector Stride
5. List the various functional units of Vector Processor and explain each
one in brief.
6. Explain the various types of vector instructions in detail.
7. How effective is the compiler in vector processors?

Manipal University of Jaipur B1648 Page No. 213

Computer Architecture Unit 9

9.9 Answers
Self Assessment Questions
1. Vector processors
2. Data parallelism
3. ETA-10
4. True
5. Crossbars
6. Vector-memory instructions
7. False
8. Strip mining
9. Sequential words
10. Structure of the program & capability of the compiler
11. False

Terminal Questions
1. There are various application areas of vector processors which are of
considerable importance. Refer Section 9.2.
2. Depending upon the way the operands are fetched, vector processors
can be segregated into two groups: Memory-memory vector architecture
and Vector-register architecture. Refer Section 9.3.
3. Due to the capability to overlap memory accesses as well as the
probable use of vector processors again, vector-register vector
processors are normally more efficient as compared to memory-memory
vector processors. Refer Section 9.3.
4. a. The CDC Cyber 205 is based on the concepts initiated for the CDC
Star 100; the first commercial model was produced in 1981. Refer
Section 9.4.
b. CRAY-1 is one of the oldest processors that implemented vector
processing. Refer Section 9.5.
c. The vector size may be less than the vector register size, and the
vector size may be larger than the vector register size. Refer
Section 9.6.
d. As vectors are one-dimensional series, saving a vector in memory is
direct: vector elements are stored as sequential words in memory.
Refer Section 9.6.

Manipal University of Jaipur B1648 Page No. 214

Computer Architecture Unit 9

5. The major components of the vector unit of a register-register vector

machine are Vector Registers, Vector Functional Units, Scalar Registers
etc. Refer Section 9.5.
6. The various types of vector instructions for a register-register vector
processor are: (Refer Section 9.5.)
a. Vector-scalar Instructions
b. Vector-vector Instructions
c. Vector-memory Instructions
d. Gather and Scatter Instructions
e. Masking Instructions
f. Vector Reduction Instructions
7. Like an indication of vectorisation level which can be acquired in
scientific programs, we should observe the vectorisation levels noted for
the Perfect Club benchmarks. Refer Section 9.7.

References:
 Hwang, K. (1993). Advanced Computer Architecture. McGraw-Hill.
 Godse, D. A. & Godse, A. P. (2010). Computer Organisation. Technical
Publications.
 Hennessy, John L., Patterson, David A. & Goldberg David (2011).
Computer Architecture: A Quantitative Approach, Morgan Kaufmann;
5th edition.
 Sima, Dezsö, Fountain, Terry J. &Kacsuk, Péter (1997). Advanced
computer architectures - a design space approach. Addison-Wesley-
Longman.
E-references:
 https://csel.cs.colorado.edu/~csci4576/VectorArch/VectorArch.html
 http://www.cs.clemson.edu/~mark/464/appG.pdf
 nasa_fig.gif

Manipal University of Jaipur B1648 Page No. 215

Worksheet 2.1 Input Devices and Their Uses: Cambridge IGCSE ICT Teacher's Resource
100% (1)
Worksheet 2.1 Input Devices and Their Uses: Cambridge IGCSE ICT Teacher's Resource
2 pages
Simple Vector Processor Modeled With VHDL
No ratings yet
Simple Vector Processor Modeled With VHDL
6 pages
HIKVISION Price List 1
No ratings yet
HIKVISION Price List 1
10 pages
Grass Valley Encore Control System
No ratings yet
Grass Valley Encore Control System
354 pages
Emc E20-593
No ratings yet
Emc E20-593
138 pages
Atmega 16
100% (1)
Atmega 16
323 pages
CS6461 - Computer Architecture Fall 2016 - Vector Operations
No ratings yet
CS6461 - Computer Architecture Fall 2016 - Vector Operations
47 pages
Lec. 12: Vector Computers: EECS 252 Graduate Computer Architecture
No ratings yet
Lec. 12: Vector Computers: EECS 252 Graduate Computer Architecture
31 pages
Architecture Chapter4 E5 2012
No ratings yet
Architecture Chapter4 E5 2012
92 pages
WC5335 PDF
No ratings yet
WC5335 PDF
140 pages
Spots V14 Ig M
No ratings yet
Spots V14 Ig M
420 pages
CSE 820 Graduate Computer Architecture Vectors and Multiprocessor Introduction
No ratings yet
CSE 820 Graduate Computer Architecture Vectors and Multiprocessor Introduction
39 pages
Flynn's Taxonomy: Data-Level Parallelism in Vector, SIMD, and GPU Architectures
No ratings yet
Flynn's Taxonomy: Data-Level Parallelism in Vector, SIMD, and GPU Architectures
28 pages
Vector
No ratings yet
Vector
42 pages
Data-Level Parallelism in Vector, SIMD, and GPU Architectures
No ratings yet
Data-Level Parallelism in Vector, SIMD, and GPU Architectures
58 pages
Olivetti - MS-DOS 3.30 - User Guide
No ratings yet
Olivetti - MS-DOS 3.30 - User Guide
168 pages
Vector Processing and Cray-1 Overview
No ratings yet
Vector Processing and Cray-1 Overview
16 pages
Pharma Investigation Tools Guide
No ratings yet
Pharma Investigation Tools Guide
31 pages
8051 Pin Diagram
No ratings yet
8051 Pin Diagram
20 pages
Printer and Fax Product List
No ratings yet
Printer and Fax Product List
16 pages
Top 100 MCQS of Computer Science
No ratings yet
Top 100 MCQS of Computer Science
98 pages
Chrom@ Disassembly: Required Tools Disassembly Instructions Reassembly Instructions
No ratings yet
Chrom@ Disassembly: Required Tools Disassembly Instructions Reassembly Instructions
20 pages
Computer Architecture Simd Vector Gpu
No ratings yet
Computer Architecture Simd Vector Gpu
16 pages
Vector and SIMD Computer Systems
No ratings yet
Vector and SIMD Computer Systems
59 pages
7-VECTOR PROCESSING-04-Jan-2020Material - I - 04-Jan-2020 - VECTOR - PROCESSING PDF
No ratings yet
7-VECTOR PROCESSING-04-Jan-2020Material - I - 04-Jan-2020 - VECTOR - PROCESSING PDF
31 pages
Instruction Groups: The 8051 Has 255 Instructions - Every 8-Bit Opcode From 00 To FF Is Used Except For A5.
No ratings yet
Instruction Groups: The 8051 Has 255 Instructions - Every 8-Bit Opcode From 00 To FF Is Used Except For A5.
30 pages
Supercomputers and Vector Machines
No ratings yet
Supercomputers and Vector Machines
40 pages
Vector
No ratings yet
Vector
38 pages
S 8 Mod 1
No ratings yet
S 8 Mod 1
33 pages
CH 04. Data-Level Parallelism in Vector, SIMD, and GPU Architectures
No ratings yet
CH 04. Data-Level Parallelism in Vector, SIMD, and GPU Architectures
50 pages
Isr4321 V k9 Datasheet
No ratings yet
Isr4321 V k9 Datasheet
6 pages
26-27 SIMD Architecture
No ratings yet
26-27 SIMD Architecture
33 pages
8255 PIO Programming Guide
No ratings yet
8255 PIO Programming Guide
6 pages
Advanced Computer Architecture: Presented By, Krishna
No ratings yet
Advanced Computer Architecture: Presented By, Krishna
35 pages
Data-Level Parallelism Vector and GPU
No ratings yet
Data-Level Parallelism Vector and GPU
6 pages
Unit Iii - Aca
No ratings yet
Unit Iii - Aca
13 pages
Advance Computer Architecture2
No ratings yet
Advance Computer Architecture2
36 pages
Advanced Computer Exam Guide
No ratings yet
Advanced Computer Exam Guide
8 pages
Unit Iii Data-Level Parallelism in Vector, Simd, and Gpu Architectures
No ratings yet
Unit Iii Data-Level Parallelism in Vector, Simd, and Gpu Architectures
26 pages
Computer Architecture AllClasses-Outline
No ratings yet
Computer Architecture AllClasses-Outline
294 pages
Onur 447 Spring15 Lecture14 Simd Afterlecture
No ratings yet
Onur 447 Spring15 Lecture14 Simd Afterlecture
60 pages
GeForce6100SM-M V1.1
No ratings yet
GeForce6100SM-M V1.1
90 pages
Bangabandhu Sheikh Mujibur Rahman Maritime University Bangladesh
No ratings yet
Bangabandhu Sheikh Mujibur Rahman Maritime University Bangladesh
7 pages
CA Classes-236-240
No ratings yet
CA Classes-236-240
5 pages
Ndless v3.1 Installation Troubleshooting Guide
No ratings yet
Ndless v3.1 Installation Troubleshooting Guide
5 pages
Computer Architecture Basics
No ratings yet
Computer Architecture Basics
99 pages
Differences Between The PICS EU GMP Guidelines and WHO Guidelines - Final
No ratings yet
Differences Between The PICS EU GMP Guidelines and WHO Guidelines - Final
20 pages
l22 Vector
No ratings yet
l22 Vector
32 pages
CS7103 - MultiCore Architecture Ppts Unit-II
No ratings yet
CS7103 - MultiCore Architecture Ppts Unit-II
43 pages
CA Classes-201-205
No ratings yet
CA Classes-201-205
5 pages
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 26-Aug-2021 Module2-SIMD-VectorProcessors
No ratings yet
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 26-Aug-2021 Module2-SIMD-VectorProcessors
16 pages
Ca Part 3
No ratings yet
Ca Part 3
20 pages
Chapter 04
No ratings yet
Chapter 04
47 pages
EE6304 Lecture13 Processors
No ratings yet
EE6304 Lecture13 Processors
69 pages
CA Classes-196-200
No ratings yet
CA Classes-196-200
5 pages
Lilypad Arduino
No ratings yet
Lilypad Arduino
7 pages
DJI+Assistant+2+ (Consumer+Drones+Series) +Release+Notes (2 1 14)
No ratings yet
DJI+Assistant+2+ (Consumer+Drones+Series) +Release+Notes (2 1 14)
14 pages
S3 Euserguide
No ratings yet
S3 Euserguide
32 pages
Computer Architecture AllClasses-Outline-100-198
No ratings yet
Computer Architecture AllClasses-Outline-100-198
99 pages
C Programming AllClasses-Outline-1-98
No ratings yet
C Programming AllClasses-Outline-1-98
98 pages
White Paper CPV Lets Foster Quality
No ratings yet
White Paper CPV Lets Foster Quality
7 pages
Onur Digitaldesign 2020 Lecture19 Simd Beforelecture
No ratings yet
Onur Digitaldesign 2020 Lecture19 Simd Beforelecture
64 pages
MS-16811 Rev2.0
No ratings yet
MS-16811 Rev2.0
45 pages
Computer ARCHITECTURE Lecture 8 10 1738846483
No ratings yet
Computer ARCHITECTURE Lecture 8 10 1738846483
202 pages
Aca
No ratings yet
Aca
3 pages
Array & Vector Processor
No ratings yet
Array & Vector Processor
17 pages
AMD64 Architecture Programmer's Manual - Volume 3 - General-Purpose and System Instructions (24594, r3.25, Dec-2017)
No ratings yet
AMD64 Architecture Programmer's Manual - Volume 3 - General-Purpose and System Instructions (24594, r3.25, Dec-2017)
684 pages
Stanley Assignment
No ratings yet
Stanley Assignment
6 pages
Magnetic Normal Modes of Bi-Component Permalloy Structures : Pam Malagò
No ratings yet
Magnetic Normal Modes of Bi-Component Permalloy Structures : Pam Malagò
5 pages
C04 PDF
No ratings yet
C04 PDF
9 pages
HP Thunderbolt Dock Specifications
No ratings yet
HP Thunderbolt Dock Specifications
20 pages
Lec 18-VectorSIMDGPUArchitectures
No ratings yet
Lec 18-VectorSIMDGPUArchitectures
29 pages
C Programming AllClasses-Outline-198-233
No ratings yet
C Programming AllClasses-Outline-198-233
36 pages
Energy Star Power and Performance Data Sheet
No ratings yet
Energy Star Power and Performance Data Sheet
4 pages
Advanced Computer Architecture Assigment
No ratings yet
Advanced Computer Architecture Assigment
60 pages
SIMD
No ratings yet
SIMD
44 pages
Computer Programming & Application (CSC430) : Introduction To Computers
No ratings yet
Computer Programming & Application (CSC430) : Introduction To Computers
48 pages
Qbdgroup Com en Blog What Is The Gamp 5 V Model in Computeri
No ratings yet
Qbdgroup Com en Blog What Is The Gamp 5 V Model in Computeri
16 pages
VLIW ARCHITECTURE and Pipeline
No ratings yet
VLIW ARCHITECTURE and Pipeline
5 pages
Programming in C - 121-140
No ratings yet
Programming in C - 121-140
20 pages
Programming in C - 161-180
No ratings yet
Programming in C - 161-180
20 pages
Programming in C - 41-60
No ratings yet
Programming in C - 41-60
20 pages
Programming in C - 21-40
No ratings yet
Programming in C - 21-40
20 pages
Guc 315 61 38694 2023-11-23T11 50 52
No ratings yet
Guc 315 61 38694 2023-11-23T11 50 52
33 pages
WWW Pharmaceutical Technology Com Sponsored Pharmaceutical Q
No ratings yet
WWW Pharmaceutical Technology Com Sponsored Pharmaceutical Q
6 pages
Tips For Writing User Friendly GMP Document
No ratings yet
Tips For Writing User Friendly GMP Document
12 pages
COE4590 14 Vector
No ratings yet
COE4590 14 Vector
14 pages
Error
No ratings yet
Error
6 pages
UNIT-V-Pipeline and Array Processing and Multi Processors
No ratings yet
UNIT-V-Pipeline and Array Processing and Multi Processors
51 pages
19 Computer Architecture Vector Processor
No ratings yet
19 Computer Architecture Vector Processor
20 pages
Module 4 Chapter 2
No ratings yet
Module 4 Chapter 2
42 pages
7TH - Unit 4-21ec74h6 - Ca
No ratings yet
7TH - Unit 4-21ec74h6 - Ca
67 pages
Computer Architecture Insights
No ratings yet
Computer Architecture Insights
5 pages
CA Classes-216-220
No ratings yet
CA Classes-216-220
5 pages
CA Classes-16-20
No ratings yet
CA Classes-16-20
5 pages
CA Classes-251-255
No ratings yet
CA Classes-251-255
5 pages
CA Classes-126-130
No ratings yet
CA Classes-126-130
5 pages
CA Classes-86-90
No ratings yet
CA Classes-86-90
5 pages
CA Classes-26-30
No ratings yet
CA Classes-26-30
5 pages
Unit-1 ACA
No ratings yet
Unit-1 ACA
26 pages
AMBA Based System For High Speed IP Validation
No ratings yet
AMBA Based System For High Speed IP Validation
45 pages
Sample PROGRAMS
No ratings yet
Sample PROGRAMS
4 pages
CA Classes-261-265
No ratings yet
CA Classes-261-265
5 pages
CA Classes-221-225
No ratings yet
CA Classes-221-225
5 pages
CA Classes-106-110
No ratings yet
CA Classes-106-110
5 pages
CA Classes-116-120
No ratings yet
CA Classes-116-120
5 pages
CA Classes-36-40
No ratings yet
CA Classes-36-40
5 pages
Unit 3-4
No ratings yet
Unit 3-4
76 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
35 pages
Lecture 7
No ratings yet
Lecture 7
29 pages
Unit 4 - 5th Sem-Ec355tbf
No ratings yet
Unit 4 - 5th Sem-Ec355tbf
67 pages
CS-482 - Lecture#4 - Vector and Array Processors
No ratings yet
CS-482 - Lecture#4 - Vector and Array Processors
40 pages
Unit5 Aca
No ratings yet
Unit5 Aca
11 pages
NX27V RISC V Vector Processor - English
No ratings yet
NX27V RISC V Vector Processor - English
29 pages

CA Classes-211-215

Uploaded by

CA Classes-211-215

Uploaded by

Computer Architecture Unit 9

variation in compiler vectorisation level has been noted by various studies of

Figure 9.5: Result of applying Vectorising Compilers to the 100 FORTRAN

Manipal University of Jaipur B1648 Page No. 211

Self Assessment Questions

Manipal University of Jaipur B1648 Page No. 212

9.8 Terminal Questions

Manipal University of Jaipur B1648 Page No. 213

Manipal University of Jaipur B1648 Page No. 214

5. The major components of the vector unit of a register-register vector

Manipal University of Jaipur B1648 Page No. 215

You might also like