Popular repositories Loading
-
qlora
qlora PublicForked from artidoro/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
Jupyter Notebook
-
QIGen
QIGen PublicForked from IST-DASLab/QIGen
Repository for CPU Kernel Generation for LLM Inference
Python
-
RPTQ4LLM
RPTQ4LLM PublicForked from hahnyuan/RPTQ4LLM
Reorder-based post-training quantization for large language model
Python
-
smoothquant
smoothquant PublicForked from mit-han-lab/smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Python
-
gptq
gptq PublicForked from IST-DASLab/gptq
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
Python
-
bit
bit PublicForked from facebookresearch/bit
Code repo for the paper BiT Robustly Binarized Multi-distilled Transformer
Python
If the problem persists, check the GitHub status page or contact support.