bzhng-development
Popular repositories Loading
-
sglang
sglang PublicForked from sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Python
-
flashinfer
flashinfer PublicForked from flashinfer-ai/flashinfer
FlashInfer: Kernel Library for LLM Serving
Python
-
-
LeetCUDA
LeetCUDA PublicForked from xlite-dev/LeetCUDA
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
Cuda
-
DeepGEMM
DeepGEMM PublicForked from deepseek-ai/DeepGEMM
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Cuda
-
flash-attention
flash-attention PublicForked from Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Python
Repositories
- sglang Public Forked from sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
bzhng-development/sglang’s past year of commit activity - SpecForge Public Forked from sgl-project/SpecForge
Train speculative decoding models effortlessly and port them smoothly to SGLang serving.
bzhng-development/SpecForge’s past year of commit activity - flash-attention Public Forked from Dao-AILab/flash-attention
Fast and memory-efficient exact attention
bzhng-development/flash-attention’s past year of commit activity - TensorRT-Model-Optimizer Public Forked from NVIDIA/Model-Optimizer
A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed.
bzhng-development/TensorRT-Model-Optimizer’s past year of commit activity - DeepGEMM Public Forked from deepseek-ai/DeepGEMM
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
bzhng-development/DeepGEMM’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…