Some common CUDA kernel implementations (Not the fastest).
-
Updated
Aug 14, 2025 - Cuda
Some common CUDA kernel implementations (Not the fastest).
Add a description, image, and links to the layernorm topic page so that developers can more easily learn about it.
To associate your repository with the layernorm topic, visit your repo's landing page and select "manage topics."