Thanks to visit codestin.com
Credit goes to github.com

nobythecreator

Follow

Noby nobythecreator

Follow

3 followers · 4 following

Popular repositories Loading

smoothquant smoothquant Public

Forked from mit-han-lab/smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python
Mixture-Compressor-MoE Mixture-Compressor-MoE Public

Forked from Aaronhuang-778/Mixture-Compressor-MoE

[ICLR 2025] Mixture Compressor for Mixture-of-Experts LLMs Gains More

Python
nano-vllm nano-vllm Public

Forked from GeeeekExplorer/nano-vllm

Nano vLLM

Python
MxMoE MxMoE Public

Forked from cat538/MxMoE

[ICML 2025] MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design

Python
paroquant paroquant Public

Forked from z-lab/paroquant

[ICLR 2026] ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference

Python
FP-Quant FP-Quant Public

Forked from IST-DASLab/FP-Quant

Python