0% found this document useful (0 votes)

10 views4 pages

Tp2 - Openmp (Introduction) : Imad Kissami

The document outlines a series of exercises for an OpenMP course, focusing on parallel programming techniques. Exercises include creating OpenMP programs for thread management, parallelizing PI calculations, matrix multiplication, and the Jacobi method. Each exercise emphasizes the importance of understanding shared versus private variables and performance analysis through various threading configurations.

Uploaded by

Mohi Gpt4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views4 pages

Tp2 - Openmp (Introduction) : Imad Kissami

Uploaded by

Mohi Gpt4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Mohammed VI Polytechnic University

TP2 - OpenMP (Introduction)

Imad Kissami
February 16, 2025

Exercise 1:
In this very simple exercise, you need to :

1. Write an OpenMP program displaying the number of threads used for the execution and
the rank of each of the threads.

2. Compile the code manually to create a monoprocessor executable and a parallel executable.

3. Test the programs obtained with different numbers of threads for the parallel program,
without submitting in batch.

Output example for the parallel program with 4 threads :

Hello from the rank 2 thread
Hello from the rank 1 thread
Hello from the rank 3 thread
Hello from the rank 0 thread
Parallel execution of hello_world with 4 threads

Exercise 2: Parallelizing of PI calculation

static long num_steps = 100000;
double step;
int main ()
{
int i; double x, pi , sum = 0.0;
step = 1.0/( double) num_steps;
for (i=0;i< num_steps; i++){
x = (i+0.5)* step;
sum = sum + 4.0/(1.0+x*x);
}
pi = step * sum;
}

1. Create a parallel version of the pi program using a parallel construct.

2. Don’t use #pragma parallel for

3. Pay close attention to shared versus private variables.

4. use double omp_get_wtime() to calculate the CPU time.

Exercise 3: Pi with loops

• Go back to the serial pi program and parallelize it with a loop construct

• Your goal is to minimize the number of changes made to the serial program (add only 1
line)
2

Exercise 4: Parallelizing Matrix Multiplication with OpenMP

// Allocate memory dynamically

double *a = (double *) malloc(m * n * sizeof(double ));
double *b = (double *) malloc(n * m * sizeof(double ));
double *c = (double *) malloc(m * m * sizeof(double ));

// Initialize matrices
for (int i = 0; i < m; i++) {
for (int j = 0; j < n; j++) {
a[i * n + j] = (i + 1) + (j + 1); // Access via 1D indexing
}
}

for (int i = 0; i < n; i++) {

for (int j = 0; j < m; j++) {
b[i * m + j] = (i + 1) - (j + 1);
}
}

for (int i = 0; i < m; i++) {

for (int j = 0; j < m; j++) {
c[i * m + j] = 0;
}
}

// Matrix multiplication
for (int i = 0; i < m; i++) {
for (int j = 0; j < m; j++) {
for (int k = 0; k < n; k++) {
c[i * m + j] += a[i * n + k] * b[k * m + j];
}
}
}

The code calculates the matrix product:

C =A×B

• In this exercise, you must:

1. Insert the appropriate OpenMP directives and analyze the code performance.
2. Use Collapse directive to parallelize this matrix multiplication code.
3. Run the code using 1, 2, 4, 8, 16 threads and plot the speedup and eﬀiciency.
4. Test the loop iteration repartition modes (STATIC, DYNAMIC, GUIDED) and vary the
chunk sizes.

Exercise 5: Parallelizing of Jacobi Method with OpenMP

The program solves a general linear system using the Jacobi iterative method.
# include <stdio.h>
# include <stdlib.h>
# include <string.h>
# include <float.h>
# include <math.h>
# include <sys/time.h>
# include <omp.h> // Replaces time.h

// Default matrix size

# ifndef VAL_N
# define VAL_N 120
#endif
# ifndef VAL_D
# define VAL_D 80
#endif

// Random initialization of an array

void random_number(double* array , int size) {

for (int i = 0; i < size; i++) {
array[i] = (double)rand () / (double )( RAND_MAX - 1);
}
}

int main () {
int n = VAL_N , diag = VAL_D;
int i, j, iteration = 0;
double norme;

// Correct 2D matrix allocation

double *a = (double *) malloc(n * n * sizeof(double ));
double *x = (double *) malloc(n * sizeof(double ));
double *x_courant = (double *) malloc(n * sizeof(double ));
double *b = (double *) malloc(n * sizeof(double ));

if (!a || !x || !x_courant || !b) {

fprintf(stderr , "Memory␣allocation␣failed !\n");
exit(EXIT_FAILURE );
}

// Time measurement variables

struct timeval t_elapsed_0 , t_elapsed_1;
double t_elapsed;

double t_cpu_0 , t_cpu_1 , t_cpu;

// Matrix and RHS initialization

srand (421); // For reproducibility
random_number(a, n * n);
random_number(b, n);

// Strengthening the diagonal

for (i = 0; i < n; i++) {
a[i * n + i] += diag; // Corrected indexing
}

// Initial solution
for (i = 0; i < n; i++) {
x[i] = 1.0;
}

// Start timing
t_cpu_0 = omp_get_wtime ();
gettimeofday (& t_elapsed_0 , NULL );

// Jacobi Iteration
while (1) {
iteration ++;

for (i = 0; i < n; i++) {

x_courant[i] = 0;
for (j = 0; j < i; j++) {
x_courant[i] += a[j * n + i] * x[j]; // Corrected indexing
}
for (j = i + 1; j < n; j++) {
x_courant[i] += a[j * n + i] * x[j]; // Corrected indexing
}
x_courant[i] = (b[i] - x_courant[i]) / a[i * n + i]; // Corrected indexing
}

// Convergence test
double absmax = 0;
for (i = 0; i < n; i++) {
double curr = fabs(x[i] - x_courant[i]);
if (curr > absmax)
absmax = curr;
}
norme = absmax / n;

if (( norme <= DBL_EPSILON) || (iteration >= n)) break;

// Copy x_courant to x
memcpy(x, x_courant , n * sizeof(double ));
}
4

// End timing
gettimeofday (& t_elapsed_1 , NULL );
t_elapsed = (t_elapsed_1.tv_sec - t_elapsed_0.tv_sec) +
(t_elapsed_1.tv_usec - t_elapsed_0.tv_usec) / 1e6;

t_cpu_1 = omp_get_wtime ();

t_cpu = t_cpu_1 - t_cpu_0;

// Print result
fprintf(stdout , "\n\n"
"␣␣␣System␣size␣␣␣␣␣␣␣␣␣:␣%5d\n"
"␣␣␣Iterations␣␣␣␣␣␣␣␣␣␣:␣%4d\n"
"␣␣␣Norme␣␣␣␣␣␣␣␣␣␣␣␣␣␣␣:␣%10.3E\n"
"␣␣␣Elapsed␣time␣␣␣␣␣␣␣␣:␣%10.3E␣sec.\n"
"␣␣␣CPU␣time␣␣␣␣␣␣␣␣␣␣␣␣:␣%10.3E␣sec.\n",
n, iteration , norme , t_elapsed , t_cpu
);

// Free allocated memory

free(a);
free(x);
free(x_courant );
free(b);

return EXIT_SUCCESS;
}

A×x=b

1. In this exercice, you must solve the system in parallel.

2. Run the code using 1, 2, 4, 8, 16 threads and plot the speedup and eﬀiciency.

Parallel Computing Lab Manual PDF
100% (1)
Parallel Computing Lab Manual PDF
51 pages
Passwords
100% (2)
Passwords
487 pages
Inf3380 Oblig2 2011
No ratings yet
Inf3380 Oblig2 2011
3 pages
Parallel Computing Lab Manual
No ratings yet
Parallel Computing Lab Manual
26 pages
Java Practise Exercise
No ratings yet
Java Practise Exercise
3 pages
OpenMP Shared
No ratings yet
OpenMP Shared
28 pages
OpenMP Tutorial: Hands-On Introduction
No ratings yet
OpenMP Tutorial: Hands-On Introduction
153 pages
HPC Lab Manual 2317 Merged Organized
No ratings yet
HPC Lab Manual 2317 Merged Organized
35 pages
EXCEL Lab Exercise
78% (18)
EXCEL Lab Exercise
20 pages
Parallel Computing Manual
No ratings yet
Parallel Computing Manual
15 pages
Lab Programs
No ratings yet
Lab Programs
18 pages
Multi Core
No ratings yet
Multi Core
25 pages
Omp Hands On SC08 PDF
No ratings yet
Omp Hands On SC08 PDF
153 pages
OpenMP Programs
No ratings yet
OpenMP Programs
4 pages
Cp4292 Multicore Lab Multicore Lab Removed
No ratings yet
Cp4292 Multicore Lab Multicore Lab Removed
37 pages
CP4292 Multicore Architecture Lab Manual
No ratings yet
CP4292 Multicore Architecture Lab Manual
36 pages
Assignment 04
No ratings yet
Assignment 04
16 pages
MAP Lab Completed
No ratings yet
MAP Lab Completed
29 pages
CP4292 Mcap
No ratings yet
CP4292 Mcap
24 pages
Advanced OpenMP Pitfalls & Solutions
No ratings yet
Advanced OpenMP Pitfalls & Solutions
52 pages
OpenMP Basics
No ratings yet
OpenMP Basics
47 pages
Jquery 17 Visual Cheat Sheet
100% (1)
Jquery 17 Visual Cheat Sheet
8 pages
Assignment 5
No ratings yet
Assignment 5
6 pages
Mcap-Lab Manual 1
No ratings yet
Mcap-Lab Manual 1
19 pages
Gauravkumar 221it027@it301 Lab2
No ratings yet
Gauravkumar 221it027@it301 Lab2
28 pages
(Serial)
No ratings yet
(Serial)
8 pages
MAP Lab Mannual
No ratings yet
MAP Lab Mannual
24 pages
Practice OpenMP
No ratings yet
Practice OpenMP
2 pages
Google Dorks
No ratings yet
Google Dorks
3 pages
MPC LAB Manual New
No ratings yet
MPC LAB Manual New
23 pages
OpenMP Matrix
No ratings yet
OpenMP Matrix
6 pages
Question 1 - Serial: Output
No ratings yet
Question 1 - Serial: Output
9 pages
Multicore
No ratings yet
Multicore
23 pages
CP4252 Multicore Architecture and Programming Lab Manual
No ratings yet
CP4252 Multicore Architecture and Programming Lab Manual
26 pages
MPC LAB Manual New
No ratings yet
MPC LAB Manual New
24 pages
Tp3 - Openmp (Parallel Sections, Single, Master, Synchronization)
No ratings yet
Tp3 - Openmp (Parallel Sections, Single, Master, Synchronization)
3 pages
Omp Exercises
No ratings yet
Omp Exercises
81 pages
High Performance Computing WS2022 Slides 10 Openacc
No ratings yet
High Performance Computing WS2022 Slides 10 Openacc
8 pages
Lab # 2 by Akram
No ratings yet
Lab # 2 by Akram
14 pages
Lab 2
No ratings yet
Lab 2
2 pages
Chapter 6 Lesson Plan (Introduction To Computer Graphics)
No ratings yet
Chapter 6 Lesson Plan (Introduction To Computer Graphics)
5 pages
Parallel Assignment 3
No ratings yet
Parallel Assignment 3
9 pages
PDC-Lab 21BCE10419
No ratings yet
PDC-Lab 21BCE10419
20 pages
OpenMP Programming Examples
No ratings yet
OpenMP Programming Examples
29 pages
Programming Assignment: On Openmp
No ratings yet
Programming Assignment: On Openmp
19 pages
Vector Addition: Exercise 1 (Openmp-I) Scenario - I
100% (1)
Vector Addition: Exercise 1 (Openmp-I) Scenario - I
15 pages
CP 4292 MCP Lab Manual
No ratings yet
CP 4292 MCP Lab Manual
20 pages
Lab 3
No ratings yet
Lab 3
23 pages
Excelente
No ratings yet
Excelente
64 pages
HPC Project Report
No ratings yet
HPC Project Report
10 pages
Lab 7
No ratings yet
Lab 7
3 pages
Lab Manual
No ratings yet
Lab Manual
31 pages
OpenMP Matrix Operations Comparison
No ratings yet
OpenMP Matrix Operations Comparison
6 pages
Python for Analytics Course Guide
No ratings yet
Python for Analytics Course Guide
13 pages
HPC Programs
No ratings yet
HPC Programs
19 pages
Mostafa Ali Ismail Morsy Original
No ratings yet
Mostafa Ali Ismail Morsy Original
3 pages
Openmp Lab: Antonio Gómez-Iglesias Agomez@Tacc - Utexas.Edu Texas Advanced Computing Center
No ratings yet
Openmp Lab: Antonio Gómez-Iglesias Agomez@Tacc - Utexas.Edu Texas Advanced Computing Center
17 pages
Itil v3
No ratings yet
Itil v3
7 pages
E 3 (Openmp - Iii) : Matrix Multiplication
No ratings yet
E 3 (Openmp - Iii) : Matrix Multiplication
10 pages
From Sqli To Shell II
No ratings yet
From Sqli To Shell II
37 pages
PC File
No ratings yet
PC File
57 pages
Parallel and Distributed Computing
No ratings yet
Parallel and Distributed Computing
5 pages
4 Performance.4x
No ratings yet
4 Performance.4x
14 pages
GSM RF Optimization Guide
No ratings yet
GSM RF Optimization Guide
7 pages
Parallel and Distributed Computing Lab Digital Assignment - 3
No ratings yet
Parallel and Distributed Computing Lab Digital Assignment - 3
10 pages
OpenMP Programming Exercises
No ratings yet
OpenMP Programming Exercises
10 pages
Part 1
No ratings yet
Part 1
48 pages
Part 7
No ratings yet
Part 7
41 pages
BE - Cyber - Security - and - Digital - Forensics - Question Bank
No ratings yet
BE - Cyber - Security - and - Digital - Forensics - Question Bank
2 pages
Cyberstalking & Cyberbullying Guide
No ratings yet
Cyberstalking & Cyberbullying Guide
8 pages
Part 2
No ratings yet
Part 2
33 pages
Syllabus
No ratings yet
Syllabus
50 pages
Excel Add-In User Guide
No ratings yet
Excel Add-In User Guide
7 pages
EdYoda Data Scientist Program Curriculum
No ratings yet
EdYoda Data Scientist Program Curriculum
24 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
41 pages
Requisition
No ratings yet
Requisition
8 pages
Control Center
No ratings yet
Control Center
3 pages
Oracle Cloud Engineer Profile
No ratings yet
Oracle Cloud Engineer Profile
1 page
H13 511 - V5.5 Demo
No ratings yet
H13 511 - V5.5 Demo
8 pages
Prob pp2
No ratings yet
Prob pp2
2 pages
Log
No ratings yet
Log
3 pages
Office Admin: Trends & Innovations
No ratings yet
Office Admin: Trends & Innovations
3 pages
OS Concepts for Computer Science Students
No ratings yet
OS Concepts for Computer Science Students
10 pages
OPTIMAX PowerFit Virtual Power Plants Unit Commitment Pooling
No ratings yet
OPTIMAX PowerFit Virtual Power Plants Unit Commitment Pooling
4 pages
Encoder Selection Guide
No ratings yet
Encoder Selection Guide
4 pages
Python Basic Codes
No ratings yet
Python Basic Codes
8 pages
IWCF Forum Candidate User Role
No ratings yet
IWCF Forum Candidate User Role
7 pages
Multimedia Systems-L5
No ratings yet
Multimedia Systems-L5
13 pages
AI Proctored Examination System: Nerds of A Feather ™ Team Number: 81
No ratings yet
AI Proctored Examination System: Nerds of A Feather ™ Team Number: 81
8 pages
MCMC Pocket Book of Statistics Q1 2013
No ratings yet
MCMC Pocket Book of Statistics Q1 2013
60 pages
Sticker Book PDF
No ratings yet
Sticker Book PDF
66 pages

Tp2 - Openmp (Introduction) : Imad Kissami

Uploaded by

Tp2 - Openmp (Introduction) : Imad Kissami

Uploaded by

Mohammed VI Polytechnic University

TP2 - OpenMP (Introduction)

Output example for the parallel program with 4 threads :

Exercise 2: Parallelizing of PI calculation

1. Create a parallel version of the pi program using a parallel construct.

2. Don’t use #pragma parallel for

3. Pay close attention to shared versus private variables.

4. use double omp_get_wtime() to calculate the CPU time.

Exercise 3: Pi with loops

Exercise 4: Parallelizing Matrix Multiplication with OpenMP

// Allocate memory dynamically

for (int i = 0; i < n; i++) {

for (int i = 0; i < m; i++) {

The code calculates the matrix product:

• In this exercise, you must:

Exercise 5: Parallelizing of Jacobi Method with OpenMP

// Default matrix size

// Random initialization of an array

void random_number(double* array , int size) {

// Correct 2D matrix allocation

if (!a || !x || !x_courant || !b) {

// Time measurement variables

double t_cpu_0 , t_cpu_1 , t_cpu;

// Matrix and RHS initialization

// Strengthening the diagonal

for (i = 0; i < n; i++) {

if (( norme <= DBL_EPSILON) || (iteration >= n)) break;

t_cpu_1 = omp_get_wtime ();

// Free allocated memory

1. In this exercice, you must solve the system in parallel.

You might also like