SRM VALLIAMMAI ENGINEERING
COLLEGE
SRM Nagar, Kattankulathur – 603 203
DEPARTMENT OF
COMPUTER SCIENCE AND ENGINEERING
&
CYBER SECURITY
QUESTION BANK
IV SEMESTER
1908006`-COMPUTER ARCHITECTURE
Regulation – 2019
Academic Year 2021– 22
Prepared by
SHANTHI.S, Assistant Professor/CSE
S.BENILA, Assistant Professor/CSE
Ms.SUMA S,Assistant Professor/CSE
Mr.SVENKATESH,Assistant Professor/CYS
SRM VALLIAMMAI ENGINEERING COLLEGE
SRM Nagar , Kattankulathur-603203
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
&
CYBER SECURITY
QUESTION BANK
SUBJECT : 1908006`-COMPUTER ARCHITECTURE
SEM/YEAR : IV/II
UNIT I -OVERVIEW & INSTRUCTIONS
Functional Units – Basic Operational Concepts – Performance – Instructions:
Language of the Computer – Operations, Operands – Instruction representation –
Logical operations – decision making – MIPS Addressing– Bus structure – Bus
operation.
PART-A
Q. Questions BT Competence
No Level
1 Interpret the instruction set Architecture. BTL 2 Understand
2 Identify general characteristics of Relative addressing mode with BTL 4 Analyze
an example.
3 Define Computer Architecture. BTL 1 Remember
4 Tabulate the components of computer system. BTL 1 Remember
5 Give the addressing modes in MIPS. BTL 2 Understand
6 Discuss the use of bus in computer architecture. BTL 2 Understand
7 Differentiate register direct and indirect addressing. BTL 4 Analyze
8 Give the difference between auto increment and auto decrement BTL 2 Understand
addressing mode.
9 What are the functions of control unit? BTL 1 Remember
10 Calculate throughput and response time. BTL 3 Apply
11 Compose the CPU performance equation. BTL 6 Create
12 Measure the performance of the computers: BTL 5 Evaluate
If computer A runs a program in 10 seconds, and computer B runs
the same program in 15 seconds, how much faster is A over B?
13 Formulate the equation of CPU execution time for a program. BTL 6 Create
14 State the value of $s0 after executing the following MIPS BTL 1 Remember
instruction if the content of $s0 is 0000 1001.
15 Show the formula for CPU clock cycles required for a program. BTL 3 Apply
16 Describe about the various logical operators. BTL 1 Remember
17 What are the various types of decision making instructions? BTL 1 Remember
18 Compare address bus and memory bus structure. BTL 4 Analyze
19 Classify the instructions based on the operations they perform and BTL 3 Apply
give one example to each category.
20 Consider the following performance measurements for a program BTL 5 Evaluate
Measurement Computer A Computer B
Instruction 10 billion 8 billion
Count
Clock rate 4GHz 4GHz
CPI 1.0 1.1
Which computer has the higher MIPS rating?
PART B
1 Evaluate the various techniques to represent instructions in a (13) BTL 5 Evaluate
computer system.
2 i)List the various components of computer system and explain with (8) BTL 1 Remember
neat diagram
ii)List the applications of computers (5)
3 i).What is an addressing mode in a computer? (4) BTL 1 Remember
ii).Describe the MIPS addressing modes with suitable examples to
(9)
each category
4. i). Identify the various operations in computer system. (6) BTL 1 Remember
(7)
ii). Examine the operands of computer hardware.
5 i).Discuss the logical operations and control operations of BTL 2 Understand
(7)
computer.
ii).Explain the concept of Arithmetic operation with examples (6)
6 Consider three different processors P1, P2, and P3 executing the BTL 4 Analyze
same instruction set. P1 has a 3 GHz clock rate and a CPI of 1.5. P2
has a 2.5 GHz clock rate and a CPI of 1.0. P3 has a 4.0 GHz clock
rate and has a CPI of 2.2.
i).Which processor has the highest performance expressed in (3)
instructions per second?
ii).If the processors each execute a program in 10 seconds, find the (5)
number of cycles and the number of instructions?
iii).We are trying to reduce the execution time by 30% but this leads
to an increase of 20% in the CPI. What clock rate should we have
(5)
to get this time reduction?
7 Assume a program requires the execution of 50 × 106 FP BTL 3 Apply
instructions,110 × 106 INT instructions, 80 × 106 L/S instructions,
and 16 × 106 branch instructions The CPI for each type of
instruction is 1, 1, 4, and 2, respectively. Assume that the processor
has a 2 GHz clock rate.
i).By how much must we improve the CPI of FP instructions if we
want the program to run two times faster? (4)
ii).By how much must we improve the CPI of L/S instructions?
(4)
iii).By how much is the execution time of the program improved if
the CPI of INT and FP Instructions are reduced by 40% and the CPI
of L/S and Branch is reduced by 30%? (5)
8 Describe the branching operations in detail with suitable example. (13) BTL 2 Understand
9 i).Formulate the performance of CPU. (9) BTL 6 Create
ii).Compose the factors that affect performance.
(4)
10 i).Illustrate the different types of instruction set architecture in (7) BTL 3 Apply
detail
ii).Examine the basic instruction types with examples (6)
11 What is Bus? Describe in detail the bus structure. (13) BTL 1 Remember
12 i).Infer the relationship between the performance and execution (8) BTL 4 Analyze
time
ii).Suppose we develop a new simpler processor that has 85% of the
(5)
capacitive load of the more complex older processor. Further
assume that it has adjustable voltage so that it can reduce voltage
15% compressed to processor B, which results in a 15% compressed
to processor B, which results in a 15% shrink in frequency. Point
out the impact on dynamic power?
13 Analyze the various instruction formats and illustrate with an (13) BTL 4 Analyze
example.
14 Consider two different implementation of the same instruction set (13) BTL 2 Understand
architecture, The instruction can be divided into four classes
according to their CPI ( class A,B,C and D). P1 with clock rate 2.5
Ghz and CPI s of 1,2,3, and 3 respectively and P2 with clock rate 3
Ghz and CPI s of 2,2,2and 2 respectively. Given a program with a
dynamic instruction count of 1.0*106 instruction divided into
classes as follows: 10% class A, 20% class B, 50% class C, and 20%
class D, which implementation is faster? What is the global CPI for
each implementation? Find the clock cycles required in both cases.
PART C
1 Evaluate a MIPS assembly instruction in to a machine instruction, BTL 5 Evaluate
for the add $to, $s1,$s2 MIPS instruction.
(15)
2 (i)Assume the following : BTL 6 Create
$t1=1100 00002 and $t2=1101 00002.
What will be the value of $to in NOR $t1,$t2,$t3
and in OR $t1,$t2,$t3
(15)
(ii) $t0=0xAAAAAAAA,$t1=0x12345678
Find $t2 for the following sequence of instructions:
(a)Sll $t2,$t0,4
(b)OR $t2,$t2,$t1
3 Assume that the variables f and g are assigned to register $s0 and
$s1 respectively. Assume that base address of the array A is in BTL 6 Create
register $s2. Assume f is zero initially.
F=g – A[4]
(15)
A[5]=f + 100
Translate the above C statement into MIPS code .how many MIPS
assembly instructions are needed to perform the C statements and
how many different registers are needed to carry out the C
statements?
4 Evaluate which code sequence will execute faster according to execution BTL 5 Analyze
time for the following conditions:
The c The computer with three instruction classes and CPI measurements as
given below and instruction counts for each instruction class for the (15)
same program from two different compilers are given. Assume that the
computer’s clock rate is 1GHZ.
Code from CPI for the instruction class
A B C
CPI 1 2 3
Code from CPI for the instruction class
A B C
Compiler1 2 1 2
Compiler2 2 1 1
UNIT II -ARITHMETIC FOR COMPUTERS
Addition and Subtraction – Multiplication – Division – Floating Point
Representation – Floating Point Operations – Subword Parallelism.
PART-A
Q. Questions BT Competence
No Level
1 Calculate the following: BTL 3 Apply
Add 510 to 610 in binary and Subtract -610 from 710 in binary.
2 Analyze overflow conditions for addition and subtraction. BTL 4 Analyze
3 Construct the Multiplication hardware diagram. BTL 3 Apply
4 x=0000 1011 1110 1111 and y= 1111 0010 1001 1101 BTL 1 Remember
Examine x-y
5 What is fast multiplication? BTL 1 Remember
6 Subtract (11011)2–(10011)2 using 1’s complement and 2’s BTL 2 Understand
complement method.
7 Illustrate scientific notation and normalization with example. BTL 3 Apply
8 Analyze and Multiply 100011 * 100010 BTL 4 Analyze
9 Give the representation of double precision floating point number. BTL 2 Understand
10 For the following C statement, Develop MIPS assembly code. BTL 6 Create
f = g + (h − 5).
11 Name the floating point instructions in MIPS. BTL 1 Remember
12 Formulate the steps of floating point addition. BTL 6 Create
13 Evaluate the sequence of floating point multiplication. BTL 5 Evaluate
14 Define guard bit. What are the ways to truncate the guard bits? BTL 1 Remember
15 Express the IEEE 754 floating point format. BTL 2 Understand
Represent (63.25)10 in single precision
16 State sub-word parallelism. BTL 1 Remember
17 Interpret single precision floating point number representation with BTL 2 Understand
example.
18 Analyze the normalization notation and perform the same for the BTL 4 Analyze
following number 10.01510 * 10 1
19 List the steps in division algorithm. BTL 1 Remember
20 For the following MIPS assembly instructions above, what is a BTL 5 Evaluate
corresponding C statement?
add f, g, h
add f, i, f
PART-B
1 i).Discuss the multiplication algorithm in detail with diagram. (6) BTL 2 Understand
ii).Express the steps to Multiply 210*310 using the sequential
(7)
version multiplication algorithm.
2 Illustrate the fast multiplication algorithm with hardware block BTL 3 Apply
(13)
and refined version of multiplication hardware.
3 Describe about basic concepts of ALU design. (13) BTL 1 Remember
4 Develop algorithm to implement A*B. calculate the product of BTL 6 Create
octal unsigned 6-bit integers. Assume A and B with values:
(13)
A=101 0002, B=010 0112
5 i) .State the integer division algorithm with diagram. (6) BTL 1 Remember
(7)
ii).Divide 50 by 23 and show the content of registers Assume both
the inputs are unsigned 6-bit integers
6 i).Express in detail about Carry look ahead Adder. (6) BTL 2 Understand
(7)
ii).What is the 5323 +2275 when these values represent unsigned
12-bit octal numbers? Write the result in octal.
7 Analyze the overflow in addition and in subtraction. BTL 4 Analyze
(13)
What is the 0147+3447 when these values are signed 12-bit octal
numbers stored in sign-magnitude format? The result should be in
octal A=1111 B= 0011
8 i).Examine, how floating point addition is carried out in a (8) BTL 1 Remember
computer system?
ii).Give an example for a binary floating point addition. (5)
9 i) How floating point numbers are represented in IEEE 752. (7) BTL 1 Remember
ii) Tabulate the IEEE 752 binary representation of the number
5.00736125 * 10 5 (3)
a. Single precision.
(3)
b. Double precision.
10 i).Design an arithmetic element to perform the basic floating point BTL 2 Understand
(7)
operations.
ii).Discuss sub word parallelism. (6)
11 i).Explain floating point addition algorithm with diagram. (6) BTL 5 Evaluate
ii). Assess the result of the numbers (0.5)10 and (0.4375)10 using
binary Floating point Addition algorithm. (7)
12 Calculate using single precision IEEE 754 representation. BTL 4 Analyze
(6)
i). 32.75
ii).18.125 (7)
13 Arrange the given number 0.0625 BTL 4 Analyze
(6)
i). Single precision.
ii). Double precision formats. (7)
14 Solve using Floating point multiplication algorithm BTL 3 Apply
(7)
i).A= 1.110 10 * 1010 B= 9.200*10-5
ii).0.5 10 * - 0.4375 10 (6)
PART C
1 Multiply the following signed numbers A=(-34)10 =(1011110)2 and BTL 6 Create
(15)
B=(22)10= (0010110) 2 where B is multiplicand and A is multiplier
Evaluate the sum of 2.6125 * 101 and 4.150390625 * 101 by hand, (15) BTL 5 Evaluate
assuming A and B are stored in the 16-bit half precision. Assume
2.
1 guard, 1 round bit and 1 sticky bit and round to the nearest even.
Show all the steps.
3 Summarize 4 bit numbers to save space, which implement the (15) BTL 5 Evaluate
multiplication algorithm for 00102 , 00112 with hardware design.
4 Design 4 bit version of the algorithm to save pages, for (15) BTL 6 Create
dividing 000001112 by 00102 with hardware design.
UNIT III - PROCESSOR AND CONTROL UNIT
A Basic MIPS implementation – Building a Datapath – Control Implementation
Scheme – Pipelining – Pipelined data path and control – Handling Data Hazards &
Control Hazards – Exceptions.
PART-A
Q.No Questions BT Competence
Level
1 Express the control signals required to perform arithmetic BTL 2 Understand
operations.
2 Define hazard. Give an example for data hazard. BTL 2 Understand
3 Recall pipeline bubble. BTL 1 Remember
4 List the state elements needed to store and access an instruction. BTL 1 Remember
5 Draw the diagram of portion of data path used for fetching BTL 2 Understand
instruction.
6 Distinguish Sign Extend and Vector interrupts. BTL 2 Understand
7 Name the R-type instructions. BTL 1 Remember
8 Evaluate branch taken and branch not taken in instruction BTL 5 Evaluate
execution.
9 State the two steps that are common to implement any type of BTL 1 Remember
instruction.
10 Design the instruction format for the jump instruction. BTL 6 Create
11 Classify the different types of hazards with examples. BTL 4 Analyze
12 Illustrate data forwarding method to avoid data hazards. BTL 3 Apply
13 Assess the methods to reduce the pipeline stall. BTL 5 Evaluate
14 Tabulate the use of branch prediction buffer. BTL 1 Remember
15 Show the 5 stages pipeline. BTL 3 Apply
16 Point out the concept of exceptions and interrupts. BTL 4 Analyze
17 What is pipelining? BTL 1 Remember
18 Illustrate the various phases in executing an instruction. BTL 3 Apply
19 Classify the types of instruction classes and their instruction BTL 4 Analyze
formats.
20 Generalize what is exception. Give one example for MIPS BTL6 Create
exception.
PART-B
1 Discuss the basic MIPS implementation of instruction set. (13) BTL 2 Understand
2 State and draw a simple MIPS datapath with control unit and (13) BTL 1 Remember
explain the execution of ALU instruction.
3 i).List the types of hazards. (3) BTL 1 Remember
(10)
ii).Describe the methods for dealing with the control hazards.
4 Design and develop an instruction pipeline working under various (13) BTL 6 Create
situations of pipeline stall.
5 i).What is data hazard? How do you overcome it? (8) BTL 1 Remember
ii).What are its side effects?
(5)
6 i).Summarize control implementation scheme. (9) BTL 2 Understand
ii).Distinguish the data and control path methods in pipelining. (4)
7 i).Differentiate sequential execution and pipelining. (7) BTL 4 Analyze
ii).Select the model for building a data path.
(6)
8 Recommend the techniques for BTL 5 Evaluate
i).Dynamic branch prediction. (7)
ii).Static branch prediction. (6)
9 Examine the approaches would you use to handle exceptions in (13) BTL 3 Apply
MIPS.
10 i).Analyze the hazards caused by unconditional branching (7) BTL 4 Analyze
statements.
ii).Describe operand forwarding in a pipeline processor with a (6)
diagram.
11 Express the modified data path to accommodate pipelined (13) BTL 2 Understand
executions with a diagram.
12 i).Explain single cycle and pipelined performance with examples. (7) BTL 4 Analyze
ii).Point out the advantages of pipeline over single cycle. (6)
13 i).Tabulate the ALU control with suitable truth table. (8) BTL 1 Remember
ii).Differentiate R-type instruction and memory instruction. (5)
14 With a suitable set of sequence of instructions show what happens (13) BTL 3 Apply
when the branch is taken, assuming the pipeline is optimized for
branches that are not taken and that we moved the branch execution
to the ID stage.
PART C
1 Assume the following sequence of instructions are executed on a
BTL6
Create
5 stage pipelined data path:
add r5,r2,r1
lw r3,4(r5)
lw r2,0(r2)
or r3,r5,r3
sw r3,0(r5)
if there is no forwarding or hazard detection, insert NOPS to ensure
correct execution.
i).If the processor has forwarding, but we forgot to implement the
(5)
hazard detection unit, what if happens when this code executes?
ii).If there is forwarding, for the first five cycles, compose which
signals are asserted in each cycle. (5)
iii).If there is no forwarding, what if new inputs and output
(5)
signals do we need for the hazard detection unit.
2 Explain in detail about the laundry process through which the (15) BTL 5 Evaluate
pipelining techniques can be established.
3 Consider the following loop: BTL 5 Evaluate
Loop: lw r1,0(r1)
and r1,r1,r2
lw r1,0(r1)
lw r1,0(r1)
beq r1,r0,loop
Assume that perfect branch prediction is used (no stalls) that there
are no delay slots, and that the pipeline has full forwarding support.
Also assume that many iterations of this loop are executed before
the loop exits.
i).Assess a pipeline execution diagram for the third iteration of this (8)
loop.
ii).Show all instructions that are in the pipeline during these cycles
( for all iterations). (7)
4 Plan the pipelining in MIPS architecture and generate the (15) BTL 6 Creating
exceptions handled in MIPS.
UNIT IV-MEMORY AND I/O SYSTEMS
Memory Hierarchy – memory technologies – cache memory – measuring and improving cache
performance – virtual memory, TLB‘s - Accessing I/O Devices, programmed I/O, Interrupts –
Direct Memory Access– Arbitration.
PART-A
Q.No Questions BT Competence
Level
1 Distinguish the types of locality of references. BTL 2 Understand
2 Draw the structure of memory hierarchy. BTL 1 Remember
3 Give the definition of memory –mapped I/O. BTL 2 Understand
4 Compare and contrast SRAM and DRAM. BTL 4 Analyze
5 What is the need to implement memory as a hierarchy? BTL 1 Remember
6 Define Rotational Latency. BTL 1 Remember
7 State the use of direct-mapped cache. BTL 1 Remember
8 Evaluate the following instance wherein the cache size is 64 BTL 5 Evaluate
blocks and block size is 16 bytes. What block number does byte
address 1200 map?
9 Formulate, how many total bits are required for a direct-mapped BTL 6 Create
cache with 16 KB of data and 4-word blocks, assuming a 32-bit
address?
10 Analyze the writing strategies in cache memory. BTL 4 Analyze
11 Integrate the functional steps required in an instruction cache BTL 6 Create
miss.
12 State hit rate and miss rate. BTL 1 Remember
13 Summarize the various block placement schemes in cache BTL 2 Understand
memory.
14 Quote the purpose of Dirty/Modified bit in Cache memory. BTL 1 Remember
15 Point out how DMA can improve I/O speed. BTL 4 Analyze
16 Show the role of TLB in virtual memory. BTL 3 Apply
17 Illustrate the advantages of virtual memory. BTL 3 Apply
18 Assess the relationship between physical address and logical BTL 5 Evaluate
address.
19 Differentiate Programmed I/O and Interrupt I/O. BTL 2 Understand
20 Demonstrate the sequence of events involved in handling an BTL 3 Apply
interrupt request from a single device.
PART-B
1 i).List the various memory technologies and examine its relevance (8) BTL 1 Remember
in architecture design.
ii). Identify the characteristics of memory system. (5)
2 Elaborate in detail the memory hierarchy with neat diagram. (13) BTL 1 Remember
3 i).Give the advantages of cache. (4) BTL 2 Understand
ii).Identify the basic operations of cache in detail with diagram.
(9)
4 Express the following various mapping schemes used in cache BTL 2 Understand
design.
(4)
i). Direct.
(4)
ii).Associative.
iii).Set associative. (5)
5 i).Analyze the given problem:
BTL 4 Analyze
A byte addressable computer has a small data cache capable of
holding eight 32-bit words. Each cache block contains 132-bit
word. When a given program is executed, the processor reads data
from the following sequence of hex addresses – 200, 204, 208, 20C,
2F4, 2F0, 200,204,218, 21C, 24C, 2F4. The pattern is repeated four
times. Assuming that the cache is initially empty, show the contents
of the cache at the end of each pass, and compute the hit rate for a
direct mapped cache. (8)
ii).What are the methods used to measure and improve the
performance of the cache. (5)
6 i).Define virtual memory and its importance. (5) BTL 1 Remember
ii).Examine TLB with necessary diagram.
(8)
7 i).Demonstrate the DMA controller. (4) BTL 3 Apply
ii).Illustrate how DMA controller is used for direct data transfer
(9)
between memory and peripherals?
8 i).Evaluate the advantages of interrupts. (5) BTL 5 Evaluate
ii).Summarize the concept of interrupts with neat diagrams.
(8)
9 Design standard input and output interfaces required to connect the (13) BTL 6 Create
I/O device to the bus.
10 Classify the bus arbitration techniques of DMA in detail. (13) BTL 4 Analyze
11 Point out the following in detail BTL 4 Analyze
(7)
i).Programmed I/O.
ii).Instructions executed by IOP. (6)
12 Describe in detail about the methods used to reduce cache misses. (13) BTL 1 Remember
13 Discuss virtual memory address translation in detail with necessary (13) BTL 2 Understand
diagram.
14 Calculate the performance the processor : (13) BTL 3 Apply
Assume the miss rate of an instruction cache is 2% and the miss rate
of the data cache is 4%. If a processor has a CPI of 2 without any
memory stalls and the miss penalty is 100 cycles for all misses,
estimate how much faster a processor would run with a perfect cache
that never missed. Assume the frequency of all loads and stores is
36%.
PART C
1 Mean Time Between Failures (MTBF), Mean Time To
Replacement (MTTR) and Mean Time To Failure (MTTF) are
useful metrics for evaluating the reliability and availability of a
storage resource. Explore these concepts by answering the
questions about devices with the following metrics:
BTL 6 Create
MTTF : 3 years MTTR: 1 day
i). Develop and calculate the MTBF for each of the devices. (3)
ii).Develop and calculate the availability for each of the devices.
(4)
iii).What if happens to availability as the MTTR approaches 0? (4)
(4)
iv).What if happens to availability as the MTTR gets very high?
2 Design and explain parallel priority interrupt hardware for a (15)
BTL 6 Evaluate
system with eight interrupt sources.
3 For a direct mapped cache design with a 32 bit address, the BTL 5 Evaluate
following bits of the address are used to access the cache.
Tag : 31-10 Index: 9-5 Offset: 4-0
(5)
i). Judge what is the cache block size?
ii).Decide how many entries does the cache have? (5)
iii).Assess what is the ratio between total bits required for such a
cache implementation over the data storage bits? (5)
4 Summarize by considering web application. Assuming both (15) BTL Evaluate
client and servers are involved in the process of web browsing 5
application, where can caches be placed to speed up the process.
Design a memory hierarchy for the system. Show the typical size
and latency at various levels of the hierarchy. What is the
relationship between the cache size and its access latency? What
are the units of data transfers between hierarchies? What is the
relationship between data location, data size and transfer latency?
UNIT V-PARALLELISM
Instruction-level-parallelism - Parallel processing challenges – Flynn‘s classification – SISD,
MIMD, SIMD, SPMD, and Vector Architectures – Multi-core processors and other Shared
Memory Multiprocessors.
PART-A
Q.No Questions BT Competence
Level
1 Describe the main idea of ILP. BTL 2 Understand
2 Illustrate the overall speedup if a webserver is to be enhanced with BTL 3 Apply
a new CPU which is 10 times faster on computation than an old
CPU .The original CPU spent 40% of its time processing and 60%
of its time waiting for I/O.
3 List the three important properties of vector instructions. BTL 1 Remember
4 Analyze the main characteristics of SMT processor. BTL 4 Analyze
5 Quote the importance of loop unrolling technique. BTL1 Remember
6 Define VLIW processor. BTL1 Remember
7 Express anti-dependence. How is it removed? BTL 2 Understand
8 State the efficiency of superscalar processor. BTL 1 Remember
9 Differentiate between strong scaling and weak scaling. BTL 2 Understand
10 Show the performance of cluster organization. BTL 3 Apply
11 Compare SMT and hardware multithreading. BTL 5 Evaluate
12 Define the Flynn classification. BTL 1 Remember
13 Integrate the ideas of in-order execution and out-of-order execution. BTL 6 Create
14 Evaluate the ideas that justify a multicore processor. BTL 5 Evaluate
15 Quote fine grained multithreading. BTL 1 Remember
16 Express the need for instruction level parallelism. BTL 2 Understand
17 Formulate the various approaches to hardware multithreading. BTL 6 Create
18 Categorize the various multithreading options. BTL 4 Analyze
19 Differentiate fine grained multithreading and coarse grained BTL 4 Analyze
multithreading.
20 Classify shared memory multiprocessor based on the memory BTL 3 Apply
access latency.
PART-B
1 i).Define parallelism and its types. (4) BTL 1 Remember
ii).List the main characteristics of Instruction level parallelism. (9)
2 i).Give the concept of parallel processing. (4) BTL 2 Understand
ii).Summarize the challenges faced by parallel processing. (9)
3 Express in detail about hardware multithreading. (13) BTL 2 Understand
4 Solve: suppose you want to achieve a speed up to 90 times faster (13) BTL 3 Apply
with 100 processors. What percentage of the original computation
can be sequential?
5 List the software and hardware techniques to achieve Instruction (13) BTL 1 Remember
Level Parallelism.
6 i).Point out how will you use shared memory concept in multi- (7) BTL 4 Analyze
processor?
ii).Compare and contrast Fine grained and Coarse grained
multithreading. (8)
7 i)Evaluate the features of Multicore processors. (6) BTL 5 Evaluate
ii) How message passing is implemented in Multiprocessors (7)
8 i).Classify the types of multithreading. (9) BTL 4 Analyze
ii).Analyze the advantages in multithreading. (4)
9 Formulate the ideas of Flynn’s classification. (13) BTL 6 Create
10 Elaborate in detail about the following
(8) BTL 1 Remember
i).SISD.
ii).MIMD (5)
11 Explain simultaneous Multithreading with example. (13) BTL 4 Analyze
12 i)Describe about Graphics Processing unit (5) BTL 1 Remember
ii) Discuss about cluster and warehouse architecture
(8)
13 Illustrate the following in detail BTL 3 Apply
i).Data Dependence
(5)
ii).Name Dependence (4)
iii).Control dependence (4)
14 Discuss the following in detail BTL 2 Understand
(7)
i). Vector processor.
ii).Superscalar processor. (6)
PART C
1 Explain how would this loop be scheduled on a static two issue (15)
pipeline for MIPS?
Loop: lw $t0,0($s1) #$t0=array element
Addu $t0,$t0,$s2 #add scalar in $s2
Sw $t0, 0($s1) # store result
BTL 6 Create
Addi; $s1,$s1, -4#decrement pointer
Bne $s1,$zero,loop # branch $s1!=0
Decide and reorder the instruction to avoid as many pipeline stalls
as possible. Assume branches are predicted, so that control hazards
are handled by the hardware.
2 A pipelined processor uses delayed branch technique. (15)
Recommend any one of the following possibility for the design of
the processor. In the first possibility, the processor has a 4-satge
pipeline and one delay slot. In the second possibility, it has a 6-
stage pipeline and two delay slots. Compare the performance of
BTL 5 Evaluate
these two alternatives, taking only the branch penalty into account.
Assume that 20% of the instructions are branch instructions and
that an optimizing compiler has an 80% success rate in filling in
the single delay slot. For the second alternative, the compiler is able
to fill the second slot 25% of the time.
3 Consider the following portions of two different programs running BTL 6 Create
at the same time on four processors in a symmetric multicore
processor (SMP). Assume that before this code is run, both x and y
are 0?
Core 1: x=2;
Core 2: y=2;
Core 3: w= x + y +1;
Core 4: z= x + y;
i. What if all the possible resulting values of w,x,y,z ? For each
possible outcomes, explain how we might arrive at those values.
(8)
ii. Develop the execution more deterministic so that only one set of (7)
values is possible?
4 Suppose we want to perform 2 sums: one is a sum of 10 scalar BTL 6 Create
variables and one is a matrix sum of a pair of two dimensional
arrays, with dimensions 10 by 10. For now let’s assume only the
matrix sum is parallelizable. What if the speed up do you get with
10 versus 40 processors and next calculate the speed ups assuming
(15)
the matrices grow to 20 by 20.