Thanks to visit codestin.com
Credit goes to github.com

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
1_CUDA_Softmax		1_CUDA_Softmax
2_Triton_Softmax		2_Triton_Softmax
3_Triton_GeMM		3_Triton_GeMM
4_Nsight_Compute		4_Nsight_Compute
6_Triton_Flash_Attn		6_Triton_Flash_Attn
7_Triton_Fused_Kernels		7_Triton_Fused_Kernels
8_More_Triton_Kernels		8_More_Triton_Kernels
Slides		Slides
README.md		README.md

Repository files navigation

Triton Kernel Study Group

Introduction

Triton crash course prepared for beginners.

Outline

Introduction to GPU architecture
Write a simple CUDA kernel: softmax
Introduction to Triton and Triton softmax kernel
Tensor Core and Triton matrix multiplication
Debugging kernels using NVIDIA NCU
Flash-Attention algorithm
Triton Flash-Attention kernels (fwd & bwd)
Triton kernel examples #1
Triton kernel examples #2

About

No description, website, or topics provided.

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 95.0%
Cuda 4.6%
Other 0.4%