Riven

Parallel programing with Cuda

Clone with submodule

git clone --recursive [email protected]:mrzhuzhe/riven.git

Submodules list:

Eigen

Notice

all application is compiler with cuda Arch 80 (RTX 3090) you can change it on CMakeLists

Applications

Linear equation Solver

cd solver

# build
cmake -S src -B build
cmake --build build

# test_case
# 1. simple guassion elimination solver (as same as middle school)
test_simp

# 2. LU factor, PLU factor, PLU linear equation solver 
test_lu

# 3. power method for eigen value, household process for simillar matrix, qr factor for eigen values
test_eigen

# 4. jacobian iteration, guassion_seidel iteration, multi grid method, conjugate gradient
test_iteration 32 1

# 5. conjugate gradient, biconjugate gradient, preconditioner(jacobian and Incomplete_Cholesky_factorization) conjugate gradient, GMRES, biconjugate gradient stablized
test_cg 32 1

Gemm

both for CPU and CUDA

// x86 gemm
cd /gemm/
// nowaday the best result /MMult22_avx.c it's about 60GFlops (corresponding openBLAS is about 75GFLOPS )
// [TODO]use core-avx2 is much better than mavx
// [TODO] 8x6_avx is only about 40GFLOPS  MMult22_avx3_8x6.c
// [TODO] inline volatile seems not work

// cuda gemm
cd /cuda_gemm/
//  best result is /MMult_cuda_6_1.cu

CUDA

cd /cuda_test/

// subfolders with corrosponding apps

/mm  // shared and texture memory

/nn  // a neural network like caffe

/warp   // cuda concept about grid block warp and cooperate group

/pipline    // cuda stream events 

/pattern    // application like convulition


# build
cmake -S src -B build
cmake --build build

Legacy

/RayTracing learn ray tracing in one week
/openmp_test
/llvm llvm totorial
/cuda_fluid

Name		Name	Last commit message	Last commit date
Latest commit History 532 Commits
MultiThreads		MultiThreads
RayTracing/src		RayTracing/src
avx/src		avx/src
clang		clang
cmake_test/step1/src		cmake_test/step1/src
conv/src		conv/src
cpp_test/src		cpp_test/src
cuda_conv/src		cuda_conv/src
cuda_fluid/src		cuda_fluid/src
cuda_gemm/src		cuda_gemm/src
cuda_practice/src		cuda_practice/src
cuda_test		cuda_test
cutlass		cutlass
docs		docs
eigen @ 0951ad2		eigen @ 0951ad2
eigen_test		eigen_test
fp16		fp16
gemm/src		gemm/src
intel		intel
llvm		llvm
lowgemm		lowgemm
neon/test		neon/test
openmp_test/src		openmp_test/src
operator/src		operator/src
qemu		qemu
solver		solver
sse/src		sse/src
template/src		template/src
valgrind/src		valgrind/src
vulkan		vulkan
.gitignore		.gitignore
.gitmodules		.gitmodules
ReadMe.md		ReadMe.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Riven

Clone with submodule

Notice

Applications

Linear equation Solver

Gemm

CUDA

Legacy

About

Uh oh!

Releases

Packages

Languages

mrzhuzhe/riven

Folders and files

Latest commit

History

Repository files navigation

Riven

Clone with submodule

Notice

Applications

Linear equation Solver

Gemm

CUDA

Legacy

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages