pytorch-extension

Star

Here are 19 public repositories matching this topic...

ildoonet / pytorch-gradual-warmup-lr

Star

Gradually-Warmup Learning Rate Scheduler for PyTorch

deep-learning pytorch large-scale-learning multigpu multinode learning-rate-decay pytorch-extension

Updated Oct 10, 2024
Python

stevewongv / SPANet

Star

Spatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset (CVPR'19)

computer-vision cuda pytorch cupy low-level-vision pytorch-extension deraining cvpr2019

Updated Aug 30, 2021
Jupyter Notebook

stevewongv / DSC-PyTorch

Star

A PyTorch implementation of "Direction-Aware Spatial Context Features for Shadow Detection" CVPR'18 | T-PAMI'19

pytorch dsc cvpr cvpr2018 pytorch-extension pami

Updated Sep 3, 2024
Python

cmpark0126 / pytorch-polynomial-lr-decay

Star

Polynomial Learning Rate Decay Scheduler for PyTorch

deep-learning pytorch learning-rate-decay pytorch-extension learning-rate-scheduling

Updated Dec 25, 2021
Python

ZichaoLong / aTEAM

Star

A pyTorch Extension for Applied Mathematics

applied-mathematics pytorch-extension

Updated Mar 17, 2020
Python

aredden / torch-bnb-fp4

Star

Faster Pytorch bitsandbytes 4bit fp4 nn.Linear ops

python pytorch cuda-kernels quantization pytorch-extension

Updated Mar 16, 2024
Python

lavawolfiee / mini-flash-attention

Star

Minimal FlashAttention in CUDA C++/CuTe: readable WMMA/CuTe kernels, no NxN workspace, up to 4.5x faster than naive PyTorch

cuda attention cutlass cute gpu-kernels pytorch-extension tensor-cores llm flash-attention flashattention wmma

Updated Jun 2, 2026
Cuda

openml / openml-pytorch

Sponsor

Star

Pytorch extension for openml-python

python pytorch openml hacktoberfest pytorch-extension

Updated May 19, 2026
Jupyter Notebook

frgfm / torch-cuda-template

Sponsor

Star

Template for CUDA / C++ extension writing with PyTorch

cpp cuda pytorch pytorch-extension

Updated Sep 9, 2020
Python

pminhtam / xnor_conv_pytorch_extension

Star

XNOR-Net with binary conv2d kernels with XNOR GEMM op, support both CPU and GPU.

cpp cuda pytorch xnor-net gemm binary-convolutions xnor-convolutions binary-neural-networks binary-op pytorch-extension

Updated Oct 25, 2022
C

artitw / apex

Star

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

amp pytorch apex pytorch-extension

Updated Sep 11, 2021
Python

djalex88 / gaussian_rbf

Star

PyTorch extension

machine-learning pytorch pytorch-extension

Updated Jun 1, 2021
Cuda

atm-mistake / block-sparse-attn

Star

Pre-compiled custom CUDA extension for Block Sparse Attention (Python 3.11 / PyTorch 2.6.0+cu124).

python machine-learning cuda pytorch pytorch-extension llm-optimization cuda-12 block-sparse-attention precompiled-wheel

Updated May 4, 2026

AmirMardan / pytorch_extending_cpp_binding

Star

Binding C++ to PyTorch and extending PyTorch

autograd pytorch-examples cpp-bindings pytorch-extension pytorch-learning torch-cpp

Updated Nov 2, 2022
Python

NoNans / nonans

Star

The numerical continuity layer for GPU computing

training gpu cuda pytorch numerical-stability ml-infrastructure pytorch-extension llm

Updated May 6, 2026
Python

tbox98 / freegrad

Star

PyTorch extension for alternative backward rules and gradient transforms (STE, gradient jamming, non-standard activations).

training research deep-learning python3 pytorch autograd neural-networks gradient backpropagation binary-neural-networks activation-functions gradient-clipping pytorch-extension gradient-modification

Updated Nov 30, 2025
Python

eshibusawa / Simple-Examples

Star

simple examples of tools and libraries

python cuda pybind11 cupy cub pytorch-extension tensorcore

Updated Jan 27, 2026
Python

torajharsh / aether-scale

Star

High-performance matrix engine for Unit-Domain Flow (UDF). Eliminates Mantissa Friction with 0.00 MSE integrity.

Updated Feb 17, 2026
Python

LioEinaudi / mini-vllm-cuda

Star

CUDA kernels for LLM decode-stage inference, built as a PyTorch extension with correctness tests and latency benchmarks.

gpu-computing pytorch-extension inference-optimization kv-cache rmsnorm llm-inference

Updated May 21, 2026
Python

Improve this page

Add a description, image, and links to the pytorch-extension topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pytorch-extension topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pytorch-extension

Here are 19 public repositories matching this topic...

ildoonet / pytorch-gradual-warmup-lr

stevewongv / SPANet

stevewongv / DSC-PyTorch

cmpark0126 / pytorch-polynomial-lr-decay

ZichaoLong / aTEAM

aredden / torch-bnb-fp4

lavawolfiee / mini-flash-attention

openml / openml-pytorch

frgfm / torch-cuda-template

pminhtam / xnor_conv_pytorch_extension

artitw / apex

djalex88 / gaussian_rbf

atm-mistake / block-sparse-attn

AmirMardan / pytorch_extending_cpp_binding

NoNans / nonans

tbox98 / freegrad

eshibusawa / Simple-Examples

torajharsh / aether-scale

LioEinaudi / mini-vllm-cuda

Improve this page

Add this topic to your repo