Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View nobythecreator's full-sized avatar

Block or report nobythecreator

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. smoothquant smoothquant Public

    Forked from mit-han-lab/smoothquant

    [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

    Python

  2. Mixture-Compressor-MoE Mixture-Compressor-MoE Public

    Forked from Aaronhuang-778/Mixture-Compressor-MoE

    [ICLR 2025] Mixture Compressor for Mixture-of-Experts LLMs Gains More

    Python

  3. nano-vllm nano-vllm Public

    Forked from GeeeekExplorer/nano-vllm

    Nano vLLM

    Python

  4. MxMoE MxMoE Public

    Forked from cat538/MxMoE

    [ICML 2025] MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design

    Python

  5. paroquant paroquant Public

    Forked from z-lab/paroquant

    [ICLR 2026] ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference

    Python

  6. FP-Quant FP-Quant Public

    Forked from IST-DASLab/FP-Quant

    Python