High Performance Computing (HPC) Class Assignments

This repository contains the assignments and projects completed during my High Performance Computing (HPC) course at University of Thessaly. The coursework focuses on utilizing advanced computing techniques to solve complex, computationally intensive problems efficiently.

Overview

The assignments cover a variety of topics central to HPC, including:

Parallel Programming: Using multi-threading and distributed computing to maximize efficiency.
Optimization: Analyzing and improving the performance of code.
Performance Analysis: Profiling and benchmarking algorithms.

These assignments make use of popular tools and libraries, such as OpenMP (Open Multi-Processing) and CUDA (Compute Unified Device Architecture), to implement solutions in parallel environments.

Technologies Used

The projects and assignments were built using the following tools and libraries:

C/C++ for core programming
OpenMP for shared memory multiprocessing
CUDA for GPU-based parallel computing
GCC/ICX compilers for C/C++ programs
NVCC compiler for CUDA programs
Profiling tools like gprof, nvprof, etc.

Assignments

Assignment 1: Code optimizations on Sobel filter
- Implement code optimization techniques to enhance the performance of the Sobel filter, focusing on methods like: loop interchange, loop unrolling, function inlining, etc.
- Use compiler optimizations to further improve the code performance (e.g register allocation, restrict pointer declarations).
- Code profiling and analysis.
Assignment 2: Parallelizing KMeans clustering using OpenMP
- Identifying the parallelizable sections of the algorithm and implementing them using OpenMP.
- Applying optimizations to enhance the parallelized algorithm, such as minimizing critical or atomic sections of the code and utilizing reduction techniques.
- Using AVX/SSE instructions (manual vectorization) to boost performance in areas where parallelization is not effective.
Assignment 3: Introduction to CUDA: Convolutions
- Implemented a 2D convolution filter by decomposing it into row-wise and column-wise operations, applying these separately to an image.
- Experimented with different grid/block geometries for GPU execution, analyzing performance and comparing results with CPU execution.
- Investigated the impact of using double-precision instead of single-precision floating-point numbers on accuracy and performance.
- Addressed the problem of thread divergence by padding image arrays, eliminating boundary checks, and evaluating its impact on CPU and GPU performance.
Assignment 4: Histogram Equalization - Acceleration with CUDA
- The primary objective of this project was to reduce execution time of histogram calculation and image equalization as much as possible.
- Used CUDA to accelerate a histogram equalization algorithm for greyscale images.
- Explored a variety of optimizations techniques, including shared memory utilization, aggregation, privatization, and memory access optimizations (pinned, unified, texture memory).
- Investigated the use of CUDA streams to overlap dara transfers with kernel exectuion.
Assignment 5: N-Body Simulation Using CUDA
- Parallelized the sequential n-body simulation using OpenMP.
- Ported the n-body simulation to CUDA for further parallelization.
- Implemented different optimization strategies like: data distribution, tiling, loop unrolling, approximate optimizations etc.
- Profiled and compared all implementation (serial, OpenMP, CUDA).

Installation

To run these assignments on your local machine, follow these steps:

Clone the repository:

git clone https://github.com/your-username/your-repo-name.git

Navigate to the assignment folder:
```
cd assignment-1
```

For all assignments (OpenMP, CUDA etc), specific compilation instructions are provided within the respective assignment folders.

Usage

Each assignment folder includes detailed instructions for compiling and running the code. Refer to the README.md inside each folder for assignment-specific details. Also, under each assignment folder there is a detailed report about each assignment.

Name		Name	Last commit message	Last commit date
Latest commit History 247 Commits
Lab1		Lab1
Lab2		Lab2
Lab3		Lab3
Lab4		Lab4
Lab5		Lab5
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

High Performance Computing (HPC) Class Assignments

Table of Contents

Overview

Technologies Used

Assignments

Installation

Usage

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

sliaskonis/High-Performance-Computing

Folders and files

Latest commit

History

Repository files navigation

High Performance Computing (HPC) Class Assignments

Table of Contents

Overview

Technologies Used

Assignments

Installation

Usage

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages