0% found this document useful (0 votes)

84 views23 pages

Lec 4 Superscalarprocessor PDF

The document discusses multithreading and how it can be used to exploit thread-level parallelism (TLP) in addition to instruction-level parallelism (ILP) in modern processors. It describes different multithreading models including many-to-one, one-to-one, and many-to-many. It also explains coarse-grained and fine-grained multithreading approaches and how simultaneous multithreading (SMT) utilizes both TLP and ILP by issuing instructions from multiple threads.

Uploaded by

Muhammad Imran

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

84 views23 pages

Lec 4 Superscalarprocessor PDF

Uploaded by

Muhammad Imran

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Multithreading,

Superscalar,
Intel's HT
Contents…
2

 Using ILP support to exploit

thread –level parallelism

 performance and efficiency in

advanced multiple issue
processors
Threads
3

 A thread is a basic unit of CPU utilization.

 A thread is a separate process with its own instructions and data.

 A thread may represent a process that is part of a parallel program

consisting of multiple processes, or it may represent an
independent program.
Threads
4

 It comprises of a thread ID, a program counter, a register set and a

stack.

 It shares its code section, data section, and other operating-system

resources, such as open files and signals with other threads
belonging to the same Process.

 A traditional process has a single thread of control. If a process has

multiple threads of control, it can perform more than one task at a time.
Threads
5

 Many software packages that run

on modern desktop PCs are
multithreaded.

 For example:
A word processor may have:
a thread for displaying graphics,
another thread for responding to
keystrokes from the user, and
a third thread for performing spelling
and grammar checking in the
background.
Threads
6

 Threads also play a vital role in remote procedure call (RPC)

systems.

 RPCs allows interprocess communication by providing a

communication mechanism similar to ordinary function or procedure
calls.

 Many operating system kernels are multithreaded; several threads

operate in the kernel, and each thread performs a specific task, such as
managing devices or interrupt handling.
Multithreading
7

 Benefits:

1. Responsiveness: Multithreading is an interactive application that

may allow a program to continue running even if part of it is
blocked, thereby increasing responsiveness to the user.
For example: A multithreaded web browser could still allow user
interaction in one thread while an image was being loaded in another
thread.

2. Resource sharing: By default, threads share the memory and the

resources of the process to which they belong. The benefit of sharing
code and data is that it allows an application to have several different
threads of activity within the same address space.
Multithreading
8

 Benefits:

3. Economy: Allocating memory and resources for process creation is

costly. Since threads share resources of the process to which they
belong, they will provide cost effective solution.

4. Utilization of multiprocessor architectures: In multiprocessor

architecture, threads may be running in parallel on different processors.
A single threaded process can only run on one CPU, no matter how
many are available.
Multithreading on a multi-CPU machine increases concurrency.
Multithreading Models
9

 Support for threads may be provided either at the user level or at

the kernel level.

 User threads are supported above the kernel and are managed
without kernel support, whereas kernel threads are supported and
managed directly by the operating system.
Multithreading Models
10

 Many-to-One Model:

 The many-to-one model maps many user-

level threads to one kernel thread.

 Thread management is done by the

thread library in user space, so it is
efficient.

 Only one thread can access the kernel at

a time, hence multiple threads are unable to
run in parallel on multiprocessors.
Multithreading Models
11

 One-to-One Model:

 The one-to-one model maps each user

thread to a kernel thread.

 It provides more concurrency than the many-

to-one model. It allows multiple threads to run in
parallel on multiprocessors.

 The only drawback to this model is that

creating a user thread requires creating the
corresponding kernel thread.

 The overhead of creating kernel threads can

burden the performance of an application.
Multithreading Models
12

 Many-to-Many Model :

 The many-to-many model multiplexes many

user-level threads to a smaller or equal
number of kernel threads.

 The number of kernel threads may be specific

to either a particular application or a particular
machine.

 Developers can create as many user threads

as necessary, and the corresponding kernel
threads can run in parallel on a
multiprocessor.
Multithreading: ILP Support to Exploit
Thread-Level Parallelism
14

 Although ILP increases the performance of system; then also ILP

can be quite limited or hard to exploit in some applications.
Furthermore, there may be parallelism occurring naturally at a higher
level in the application.

For example:
An online transaction-processing system has parallelism among the
multiple queries and updates. These queries and updates can be
processed mostly in parallel, since they are largely independent of one
another.
Multithreading: ILP Support to Exploit
Thread-Level Parallelism
15

 This higher-level parallelism is called thread-level parallelism (TLP)

because it is logically structured as separate threads of execution.

 ILP is parallel operations within a loop or straight-line code.

 TLP is represented by the use of multiple threads of execution that

are in parallel.
Multithreading: ILP Support to Exploit
Thread-Level Parallelism
16

 Thread-level parallelism is an important alternative to instruction-

level parallelism.

 In many applications thread-level parallelism occurs naturally (many

server applications).

 If software is written from scratch, then expressing the parallelism

is much easy.

 But if established applications written without parallelism in mind,

then there can be significant challenges and can be extremely costly
to rewrite them to exploit thread-level parallelism.
Multithreading: ILP Support to Exploit
Thread-Level Parallelism
19

 There are two main approaches to multithreading.

 Fine-grained multithreading &

 Coarse-grained multithreading
Multithreading: ILP Support to Exploit
Thread-Level Parallelism
20

 Fine-grained multithreading:

 It switches between threads on each instruction, causing the

execution of multiple threads to be interleaved.

 This interleaving is often done in a round-robin fashion.

 To make fine-grained multithreading practical, the CPU must be

able to switch threads on every clock cycle.


Multithreading: ILP Support to Exploit
Thread-Level Parallelism
21

 Coarse-grained multithreading:

 It was invented as an alternative to fine-grained multithreading.

 Coarse-grained multithreading switches threads only on costly

(larger) interrupt/process.

 This change relieves the need to have thread switching.

 The main difference between fine grained and coarse grained

multithreading is that, in fine grained multithreading, the threads issue
instructions in round-robin manner while in coarse grained
multithreading, the threads issue instructions until a stall occurs.
SCALAR PROCESSOR

 Scalar processors is classified as SISD

processor(single instruction,single data) .A scalar
processor processes only one datum at a time.
 In a scalar organization, a single pipelined functional
unit exists for:
 • Integer operations;

 • And one for floating-point operations (FLOPS);

 Functional unit:

 • Part of the CPU responsible for calculations

SUPERSCALAR PROCESSOR
 A superscalar processor is a CPU that implements
a form of parallelism called instruction-level
parallelism within a single processor.
 A superscalar CPU can execute more than one
instruction per clock cycle. Because processing
speeds are measured in clock cycles per second
(megahertz), a superscalar processor will be faster
than a scalar processor
 Ability to execute instructions in different
pipelines:
 • Independently and concurrently;
PIPELINE PROBLEMS:

 Pipeline concept already introduced some

problems.
 A resource hazard exists when the pipeline
required for an instruction is unavailable due to a
previous instruction.

 Data hazards:
 occur when the pipeline changes the order of
read/write accesses to operands so that the order
differs from the order seen by sequentially
executing instructions on the unpipelined machine.
 execution scenario

 o:
Simultaneous multithreading(SMT)
•Mix of superscalar and multithreading
technique
•All hardware contexts are active leading to
competition
•Issue multiple instructions from multiple
threads
•Both TLP and ILP comes into play
•Multiple slots for different threads are
filled
•Resource organization
•Resource sharing

Process Verification Audit Checklist
100% (1)
Process Verification Audit Checklist
5 pages
ColorGATE RIP-Software Release Notes 8.00 Build 5055
No ratings yet
ColorGATE RIP-Software Release Notes 8.00 Build 5055
34 pages
MYH Case Study
75% (16)
MYH Case Study
62 pages
Which Device (A-H) Would You Use For The Tasks (1-8) ? ( ../8)
100% (3)
Which Device (A-H) Would You Use For The Tasks (1-8) ? ( ../8)
3 pages
Multithreading: Multithreading Computers Have Hardware Support To Efficiently Execute Multiple
No ratings yet
Multithreading: Multithreading Computers Have Hardware Support To Efficiently Execute Multiple
5 pages
System Programming - II Threads
No ratings yet
System Programming - II Threads
46 pages
Operating System 4
No ratings yet
Operating System 4
33 pages
Multi Threading
No ratings yet
Multi Threading
5 pages
Threads: by Salman Memon 2K12/IT/109 University of Sindh Jamshoro
No ratings yet
Threads: by Salman Memon 2K12/IT/109 University of Sindh Jamshoro
16 pages
Multi Thread2
No ratings yet
Multi Thread2
37 pages
Hreads: Program Counter: Registers
No ratings yet
Hreads: Program Counter: Registers
21 pages
Thread in Java
No ratings yet
Thread in Java
41 pages
What Is Serial Computing?: Traditionally, Software Has Been Written For Serial Computation
No ratings yet
What Is Serial Computing?: Traditionally, Software Has Been Written For Serial Computation
22 pages
MULTITHREADING
No ratings yet
MULTITHREADING
30 pages
Lec 4 Superscalarprocessor Updated PDF
No ratings yet
Lec 4 Superscalarprocessor Updated PDF
40 pages
Operating Systems:: Threads
No ratings yet
Operating Systems:: Threads
26 pages
Operating Systems: Suad Alaofi
No ratings yet
Operating Systems: Suad Alaofi
13 pages
Hardware Multithreading
No ratings yet
Hardware Multithreading
22 pages
Kubernetes Sec
No ratings yet
Kubernetes Sec
73 pages
Concurrency & Threads Guide
No ratings yet
Concurrency & Threads Guide
47 pages
Accounts Payable User Manual
No ratings yet
Accounts Payable User Manual
32 pages
Hi-Target V30 50 GNSS RTK System Manual PDF
100% (2)
Hi-Target V30 50 GNSS RTK System Manual PDF
70 pages
G3 Thread Functionality
No ratings yet
G3 Thread Functionality
20 pages
4 Threads
No ratings yet
4 Threads
37 pages
Object Oriented Programming - ABAP Oops-Abap - 1
No ratings yet
Object Oriented Programming - ABAP Oops-Abap - 1
8 pages
Workflow Attributes - HTML Body
No ratings yet
Workflow Attributes - HTML Body
12 pages
Presentation On Multithreading/Vector
No ratings yet
Presentation On Multithreading/Vector
7 pages
Ch4 Threads
No ratings yet
Ch4 Threads
18 pages
Week 4 - Threads
No ratings yet
Week 4 - Threads
37 pages
EE6304 Lecture12 TLP
No ratings yet
EE6304 Lecture12 TLP
70 pages
Silicon N-Channel Power MOSFET: General Description
No ratings yet
Silicon N-Channel Power MOSFET: General Description
10 pages
UNIT 2-PART 2 (Threads)
No ratings yet
UNIT 2-PART 2 (Threads)
23 pages
4 Threads
No ratings yet
4 Threads
17 pages
2021 22 1 - Ÿsletimsistemleri 5 - en
No ratings yet
2021 22 1 - Ÿsletimsistemleri 5 - en
33 pages
2.2 DD2356 Threads
No ratings yet
2.2 DD2356 Threads
22 pages
Esther Joy. M: Resume
No ratings yet
Esther Joy. M: Resume
7 pages
Biruk Tewoderos 1790
No ratings yet
Biruk Tewoderos 1790
21 pages
DigitalLogic ComputerOrganization L23 Multicore Handout
No ratings yet
DigitalLogic ComputerOrganization L23 Multicore Handout
32 pages
Lecture19 ILP SMT
No ratings yet
Lecture19 ILP SMT
31 pages
Latest - IFTA International Schedule of Definitions (V2018)
No ratings yet
Latest - IFTA International Schedule of Definitions (V2018)
9 pages
Unit IV QB With Answers
No ratings yet
Unit IV QB With Answers
16 pages
Independent Speed Test Analysis of 4G Mobile Networks Performed by DIKW Consulting
No ratings yet
Independent Speed Test Analysis of 4G Mobile Networks Performed by DIKW Consulting
50 pages
EIM Performance Tuning Guide
No ratings yet
EIM Performance Tuning Guide
3 pages
Chapter 04
No ratings yet
Chapter 04
37 pages
08 Systems Programming-Concurrent Programming
No ratings yet
08 Systems Programming-Concurrent Programming
61 pages
Phase in Oxo Connect C080 en
100% (1)
Phase in Oxo Connect C080 en
2 pages
Threads OS
No ratings yet
Threads OS
21 pages
NV Operating Systems UNIT II
No ratings yet
NV Operating Systems UNIT II
91 pages
(OS) - Unit-2.2-2.5 Process Management
No ratings yet
(OS) - Unit-2.2-2.5 Process Management
72 pages
OS Module 1 Slides-2
No ratings yet
OS Module 1 Slides-2
47 pages
4 Threads
No ratings yet
4 Threads
41 pages
Threads
No ratings yet
Threads
38 pages
Chapter 4
No ratings yet
Chapter 4
18 pages
Concurrency in Computing
No ratings yet
Concurrency in Computing
16 pages
DLL - Math6 - Week 1
No ratings yet
DLL - Math6 - Week 1
12 pages
OS Module2 Unit2
No ratings yet
OS Module2 Unit2
43 pages
ACA Lecture 28 Multiprocessors
No ratings yet
ACA Lecture 28 Multiprocessors
20 pages
Unit 4
No ratings yet
Unit 4
13 pages
Silicon Rectifier Specs
No ratings yet
Silicon Rectifier Specs
4 pages
Lec04 SOFE3950 Threads
No ratings yet
Lec04 SOFE3950 Threads
53 pages
OS-PROCESS MANAGEMENT Module - 2.2
No ratings yet
OS-PROCESS MANAGEMENT Module - 2.2
89 pages
Lecture 16
No ratings yet
Lecture 16
30 pages
Algorithm Efficiency Analysis Guide
No ratings yet
Algorithm Efficiency Analysis Guide
2 pages
KIDNAPPERS AND ROBBERS THREAT-ALERT INTELLIGENT SYSTEM 2 Unical Conference
No ratings yet
KIDNAPPERS AND ROBBERS THREAT-ALERT INTELLIGENT SYSTEM 2 Unical Conference
13 pages
Ch03-Multithread Programming
No ratings yet
Ch03-Multithread Programming
35 pages
Csi 3131 Mod 3 Threads
No ratings yet
Csi 3131 Mod 3 Threads
54 pages
CV Riswanda Zikrawi
No ratings yet
CV Riswanda Zikrawi
1 page
Daniel B. Botkin - Forest Dynamics - An Ecological Model (1993) PDF
No ratings yet
Daniel B. Botkin - Forest Dynamics - An Ecological Model (1993) PDF
326 pages
E Commerce Term Paper Topics
100% (1)
E Commerce Term Paper Topics
8 pages
WM 2024
No ratings yet
WM 2024
6 pages
4.OS Threads Dr. Punit
No ratings yet
4.OS Threads Dr. Punit
48 pages
SuperMark 1.5T Proposal - 2108
No ratings yet
SuperMark 1.5T Proposal - 2108
29 pages
Thesis Statement About Gadgets
100% (2)
Thesis Statement About Gadgets
7 pages
Lecture 4
No ratings yet
Lecture 4
38 pages
06b Multithreading MF
No ratings yet
06b Multithreading MF
37 pages
RX1 Getting Started
No ratings yet
RX1 Getting Started
60 pages
Chapter - 4 Threads (Full)
No ratings yet
Chapter - 4 Threads (Full)
72 pages
Os2 &3module
No ratings yet
Os2 &3module
69 pages
EAadhaar 0648019028606520240216115645 26022024194147
No ratings yet
EAadhaar 0648019028606520240216115645 26022024194147
1 page
S01M03 TP00003SG03F6E0V Ed1 5G System Requirements
No ratings yet
S01M03 TP00003SG03F6E0V Ed1 5G System Requirements
26 pages
Thread
No ratings yet
Thread
15 pages
Lecture Slide 5 OS Spring 25
No ratings yet
Lecture Slide 5 OS Spring 25
39 pages
Java Past Paper
No ratings yet
Java Past Paper
3 pages
Multithreading, SMT and CMP
No ratings yet
Multithreading, SMT and CMP
7 pages
THREADS
No ratings yet
THREADS
9 pages
Processes and Threads
No ratings yet
Processes and Threads
53 pages
Threads
No ratings yet
Threads
27 pages
Sachin Sharma Resume
No ratings yet
Sachin Sharma Resume
1 page
5 Thread
No ratings yet
5 Thread
17 pages

Lec 4 Superscalarprocessor PDF

Uploaded by

Lec 4 Superscalarprocessor PDF

Uploaded by

Multithreading,

 Using ILP support to exploit

 performance and efficiency in

 A thread is a basic unit of CPU utilization.

 A thread is a separate process with its own instructions and data.

 A thread may represent a process that is part of a parallel program

 It comprises of a thread ID, a program counter, a register set and a

 It shares its code section, data section, and other operating-system

 A traditional process has a single thread of control. If a process has

 Many software packages that run

 Threads also play a vital role in remote procedure call (RPC)

 RPCs allows interprocess communication by providing a

 Many operating system kernels are multithreaded; several threads

1. Responsiveness: Multithreading is an interactive application that

2. Resource sharing: By default, threads share the memory and the

3. Economy: Allocating memory and resources for process creation is

4. Utilization of multiprocessor architectures: In multiprocessor

 Support for threads may be provided either at the user level or at

 The many-to-one model maps many user-

 Thread management is done by the

 Only one thread can access the kernel at

 The one-to-one model maps each user

 It provides more concurrency than the many-

 The only drawback to this model is that

 The overhead of creating kernel threads can

 The many-to-many model multiplexes many

 The number of kernel threads may be specific

 Developers can create as many user threads

 Although ILP increases the performance of system; then also ILP

 This higher-level parallelism is called thread-level parallelism (TLP)

 ILP is parallel operations within a loop or straight-line code.

 TLP is represented by the use of multiple threads of execution that

 Thread-level parallelism is an important alternative to instruction-

 In many applications thread-level parallelism occurs naturally (many

 If software is written from scratch, then expressing the parallelism

 But if established applications written without parallelism in mind,

 There are two main approaches to multithreading.

 Fine-grained multithreading &

 It switches between threads on each instruction, causing the

 This interleaving is often done in a round-robin fashion.

 To make fine-grained multithreading practical, the CPU must be

 It was invented as an alternative to fine-grained multithreading.

 Coarse-grained multithreading switches threads only on costly

 This change relieves the need to have thread switching.

 The main difference between fine grained and coarse grained

 Scalar processors is classified as SISD

 • And one for floating-point operations (FLOPS);

 • Part of the CPU responsible for calculations

 Pipeline concept already introduced some

You might also like