Popular repositories Loading
-
smoothquant
smoothquant PublicForked from mit-han-lab/smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Python
-
Mixture-Compressor-MoE
Mixture-Compressor-MoE PublicForked from Aaronhuang-778/Mixture-Compressor-MoE
[ICLR 2025] Mixture Compressor for Mixture-of-Experts LLMs Gains More
Python
-
-
MxMoE
MxMoE PublicForked from cat538/MxMoE
[ICML 2025] MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design
Python
-
paroquant
paroquant PublicForked from z-lab/paroquant
[ICLR 2026] ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference
Python
-
If the problem persists, check the GitHub status page or contact support.