Module 3 Quiz

This document contains 4 multiple choice quiz questions about mapping thread and block indices to data indices for vector addition problems in CUDA. The questions cover cases where each thread calculates: 1) one output element, 2) two adjacent output elements, and 3) two output elements where blocks process sections of two elements at a time. The last question asks how many threads would be in a grid if the vector length is 8000, each thread calculates one element, block size is 1024, and the minimum number of blocks is used. The answer is 8192 threads.

Uploaded by

sy1990010111

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views2 pages

Module 3 Quiz

Uploaded by

sy1990010111

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Quiz Questions for Module 3

1. If we need to use each thread to calculate one output element of a vector addition, what would
be the expression for mapping the thread/block indices to data index:
(A) i=threadIdx.x + threadIdx.y;
(B) i=blockIdx.x + threadIdx.x;
(C) i=blockIdx.x*blockDim.x + threadIdx.x;
(D) i=blockIdx.x * threadIdx.x;

Answer: (C)

Explanation: This is the case we covered in Lecture 2.3.

2. We want to use each thread to calculate two (adjacent) output elements of a vector addition.
Assume that variable i should be the index for the first element to be processed by a thread.
What would be the expression for mapping the thread/block indices to data index of the first
element?
(A) i=blockIdx.x*blockDim.x + threadIdx.x +2;
(B) i=blockIdx.x*threadIdx.x*2
(C) i=(blockIdx.x*blockDim.x + threadIdx.x)*2
(D) i=blockIdx.x*blockDim.x*2 + threadIdx.x

Answer: (C)

Explanation: Every thread covers two adjacent output elements. The starting data index is
simply twice the global thread index. Another way to look at it is that all previous blocks cover
(blockIdx.x*blockDim.x)*2. Within the block, each thread covers 2 elements so the beginning
position for a thread is threadIdx.x.

3. We want to use each thread to calculate two output elements of a vector addition. Each thread
block processes 2*blockDim.x consecutive elements that form two sections. All threads in each
block will first process a section, each processing one element. They will then all move to the
next section, again each processing one element. Assume that variable i should be the index for
the first element to be processed by a thread. What would be the expression for mapping the
thread/block indices to data index of the first element?
(A) i=blockIdx.x*blockDim.x + threadIdx.x +2;
(B) i=blockIdx.x*threadIdx.x*2
(C) i=(blockIdx.x*blockDim.x + threadIdx.x)*2
(D) i=blockIdx.x*blockDim.x*2 + threadIdx.x

Answer: (D)

Explanation: Each previous block covers (blockIdx.xblockDim.x)2. The beginning elements of

the threads are consecutive in this case so just add threadIdx.x to it.
4. For a vector addition, assume that the vector length is 8000, each thread calculates one output
element, and the thread block size is 1024 threads. The programmer configures the kernel
launch to have a minimal number of thread blocks to cover all output elements. How many
threads will be in the grid?
(A) 8000
(B) 8196
(C) 8192
(D) 8200

Answer: (C)

Explanation: ceil(8000/1024)1024 = 8 1024 = 8192. Another way to look at it is the minimal

multiple of 1024 to cover 8000 is 1024*8 = 8192.

Solutions To Exercises On Parallelism and Concurrency
No ratings yet
Solutions To Exercises On Parallelism and Concurrency
5 pages
CUDA Programming Quiz
100% (5)
CUDA Programming Quiz
4 pages
ECE408 S19 ZJUI Exam1 Study Guide
No ratings yet
ECE408 S19 ZJUI Exam1 Study Guide
25 pages
Processors
No ratings yet
Processors
25 pages
BCS3413 Principle & Applications of Parallel Programming Quiz 2: Gpgpu Cuda
No ratings yet
BCS3413 Principle & Applications of Parallel Programming Quiz 2: Gpgpu Cuda
3 pages
217 Lec3
No ratings yet
217 Lec3
46 pages
HW 3
No ratings yet
HW 3
12 pages
Unit 5
No ratings yet
Unit 5
90 pages
CUDA Programming Exam Solutions
No ratings yet
CUDA Programming Exam Solutions
11 pages
CUDA Part-2
No ratings yet
CUDA Part-2
49 pages
#Include #Include #Define
No ratings yet
#Include #Include #Define
8 pages
HPC
No ratings yet
HPC
90 pages
Coursera Quiz Week2 Fall 2012
No ratings yet
Coursera Quiz Week2 Fall 2012
3 pages
Lab 7
No ratings yet
Lab 7
3 pages
Parallel and Distributed Computing Lab Digital Assignment - 3
No ratings yet
Parallel and Distributed Computing Lab Digital Assignment - 3
10 pages
CSE524sp10 01
No ratings yet
CSE524sp10 01
62 pages
Lab 2 Threads
No ratings yet
Lab 2 Threads
6 pages
Mid Sem QP&Solution
No ratings yet
Mid Sem QP&Solution
7 pages
Graphics Processing Unit (GPU) Architecture and Programming: TU/e 5kk73 Zhenyu Ye Henk Corporaal 2011-11-15
No ratings yet
Graphics Processing Unit (GPU) Architecture and Programming: TU/e 5kk73 Zhenyu Ye Henk Corporaal 2011-11-15
53 pages
Par - 1 In-Term Exam - Course 2017/18-Q2
No ratings yet
Par - 1 In-Term Exam - Course 2017/18-Q2
7 pages
Par - 2 In-Term Exam - Course 2019/20-Q1: Memory Line
No ratings yet
Par - 2 In-Term Exam - Course 2019/20-Q1: Memory Line
9 pages
CP4292 Multicore Architecture Lab Manual
No ratings yet
CP4292 Multicore Architecture Lab Manual
36 pages
Questions On Chapter 4 Answers
No ratings yet
Questions On Chapter 4 Answers
12 pages
5 Computation
No ratings yet
5 Computation
13 pages
CS 61C: Great Ideas in Computer Architecture (Machine Structures)
No ratings yet
CS 61C: Great Ideas in Computer Architecture (Machine Structures)
32 pages
Sols Book PDF
100% (1)
Sols Book PDF
120 pages
Vector Addition: Exercise 1 (Openmp-I) Scenario - I
100% (1)
Vector Addition: Exercise 1 (Openmp-I) Scenario - I
15 pages
Programming Parallelism: by Kelvin Chou
No ratings yet
Programming Parallelism: by Kelvin Chou
27 pages
COMPUTER ARCHITECTURE Exam Correction
No ratings yet
COMPUTER ARCHITECTURE Exam Correction
8 pages
4 Performance.4x
No ratings yet
4 Performance.4x
14 pages
HPC Int2 Key
No ratings yet
HPC Int2 Key
10 pages
12 Gpu Cuda 3
No ratings yet
12 Gpu Cuda 3
58 pages
410A Week 2
No ratings yet
410A Week 2
19 pages
GPU Kernel & Memory Quiz
100% (1)
GPU Kernel & Memory Quiz
3 pages
CUDA Matrix Multiplication Quiz
No ratings yet
CUDA Matrix Multiplication Quiz
12 pages
Final Exam: 15-213 Introduction To Computer Systems
No ratings yet
Final Exam: 15-213 Introduction To Computer Systems
17 pages
KH5004CEM Mock Solution
No ratings yet
KH5004CEM Mock Solution
7 pages
Exam2 s09 v2
No ratings yet
Exam2 s09 v2
10 pages
PC Pgms
No ratings yet
PC Pgms
14 pages
Par - 1 In-Term Exam - Course 2018/19-Q2
No ratings yet
Par - 1 In-Term Exam - Course 2018/19-Q2
9 pages
Tilining
No ratings yet
Tilining
23 pages
Pes1ug24cs838 Vikasks PDF
No ratings yet
Pes1ug24cs838 Vikasks PDF
5 pages
Quiz For Chapter 7 With Solutions
No ratings yet
Quiz For Chapter 7 With Solutions
8 pages
Campus Interview Prep: Computer Org MCQs
No ratings yet
Campus Interview Prep: Computer Org MCQs
15 pages
Assignment # 5 - 5
No ratings yet
Assignment # 5 - 5
6 pages
ARM MCQs
No ratings yet
ARM MCQs
16 pages
CS 346: Intermediate Code Generation: Resource
No ratings yet
CS 346: Intermediate Code Generation: Resource
60 pages
HPC File
No ratings yet
HPC File
22 pages
Comp422 2011 Lecture8 UPC
No ratings yet
Comp422 2011 Lecture8 UPC
44 pages
PDC Experiments
No ratings yet
PDC Experiments
11 pages
LLVM 3.4 Instruction Guide
No ratings yet
LLVM 3.4 Instruction Guide
2 pages
Programming Language Essentials
No ratings yet
Programming Language Essentials
36 pages
Slides
No ratings yet
Slides
24 pages
Lab5 Mat Ops Pthreads 11
No ratings yet
Lab5 Mat Ops Pthreads 11
6 pages
Written Asst2
No ratings yet
Written Asst2
27 pages
UNIT-5 Tiling
No ratings yet
UNIT-5 Tiling
23 pages
2022 ST2 Main
No ratings yet
2022 ST2 Main
4 pages
Disk Storage & File Structures
No ratings yet
Disk Storage & File Structures
27 pages
Applied C++ - Practical Techniques For Building Better Software (2003)
No ratings yet
Applied C++ - Practical Techniques For Building Better Software (2003)
429 pages
Csc721 Programming Technique II
No ratings yet
Csc721 Programming Technique II
96 pages
Arm Guide To OpenCL Programming
No ratings yet
Arm Guide To OpenCL Programming
124 pages
Session 31 Quiz Explanation
No ratings yet
Session 31 Quiz Explanation
7 pages
Compiler Design - Practice Set 1
No ratings yet
Compiler Design - Practice Set 1
3 pages
Query Optimization Practice Solutions
No ratings yet
Query Optimization Practice Solutions
5 pages
3 Concurrency Vs Parallelism
No ratings yet
3 Concurrency Vs Parallelism
4 pages
Infosys Technical Interview Questions
No ratings yet
Infosys Technical Interview Questions
29 pages
Fall 2024 - CS401 - 1
No ratings yet
Fall 2024 - CS401 - 1
2 pages
Software Engineer Career Profile
No ratings yet
Software Engineer Career Profile
2 pages
M350 Advanced-Instructions
No ratings yet
M350 Advanced-Instructions
9 pages
Lecture 4
No ratings yet
Lecture 4
4 pages
Business Partner Related Qs
No ratings yet
Business Partner Related Qs
39 pages
Os Lab Manual-r18-Final
No ratings yet
Os Lab Manual-r18-Final
56 pages
Crash 2025 01 02 - 19.15.52 FML
No ratings yet
Crash 2025 01 02 - 19.15.52 FML
3 pages
C# Number Algorithms Guide
No ratings yet
C# Number Algorithms Guide
5 pages
VBA Logic & Looping Guide
No ratings yet
VBA Logic & Looping Guide
30 pages
Curs Javascript
100% (1)
Curs Javascript
370 pages
Python Exercises
No ratings yet
Python Exercises
4 pages
Petrel Workflow Editor Guide
100% (1)
Petrel Workflow Editor Guide
17 pages
Algorithms With JULIA
100% (1)
Algorithms With JULIA
447 pages
List of SQL Keyword
No ratings yet
List of SQL Keyword
3 pages
PBO
No ratings yet
PBO
3 pages
Introduction To Water Tank Monitoring and Controlling System
No ratings yet
Introduction To Water Tank Monitoring and Controlling System
10 pages
Unit - 5 Oodp Final
No ratings yet
Unit - 5 Oodp Final
60 pages
Prctical BC
No ratings yet
Prctical BC
2 pages
Notes Unit 4
No ratings yet
Notes Unit 4
10 pages
Beginning Object Oriented Programming With C 1st Edition Jack Purdum PDF Download
No ratings yet
Beginning Object Oriented Programming With C 1st Edition Jack Purdum PDF Download
52 pages
JVM (Java Virtual Machine) Architecture
No ratings yet
JVM (Java Virtual Machine) Architecture
4 pages

Module 3 Quiz

Uploaded by

Module 3 Quiz

Uploaded by

Quiz Questions for Module 3

Explanation: This is the case we covered in Lecture 2.3.

Explanation: Each previous block covers (blockIdx.x*blockDim.x)*2. The beginning elements of

Explanation: ceil(8000/1024)*1024 = 8 * 1024 = 8192. Another way to look at it is the minimal

You might also like

Explanation: Each previous block covers (blockIdx.xblockDim.x)2. The beginning elements of

Explanation: ceil(8000/1024)1024 = 8 1024 = 8192. Another way to look at it is the minimal