Qubitium

🙌

....

Qubitium-ModelCloud Qubitium

🙌

....

Golang, Python, Kotlin. GPTQModel maintainer and OSS contributor to SGLang, vLLM, and others. @ModelCloudAi founder

87 followers · 93 following

ModelCloud.ai
Earth/Epoch 2.0
https://modelcloud.ai
@qubitium

Achievements

x4 x3 x3

Achievements

x4 x3 x3

flash-attention Public
Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python BSD 3-Clause "New" or "Revised" License Updated Dec 17, 2025
transformers Public
Forked from huggingface/transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python Apache License 2.0 Updated Dec 12, 2025
huggingface_hub Public
Forked from huggingface/huggingface_hub

The official Python client for the Hugging Face Hub.

Python Apache License 2.0 Updated Nov 12, 2025
flash-linear-attention Public
Forked from fla-org/flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models

Python MIT License Updated Nov 1, 2025
triton Public
Forked from triton-lang/triton

Development repository for the Triton language and compiler

MLIR MIT License Updated Oct 25, 2025
BitBLAS Public
Forked from microsoft/BitBLAS

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Python 1 MIT License Updated Oct 23, 2025
nanochat Public
Forked from karpathy/nanochat

The best ChatGPT that $100 can buy.

Python MIT License Updated Oct 22, 2025
accelerate Public
Forked from huggingface/accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python Apache License 2.0 Updated Oct 14, 2025
hf_transfer Public
Forked from huggingface/hf_transfer

Rust Apache License 2.0 Updated Oct 10, 2025
xet-core Public
Forked from huggingface/xet-core

xet client tech, used in huggingface_hub

Rust Apache License 2.0 Updated Oct 3, 2025
vllm Public
Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python Apache License 2.0 Updated Oct 3, 2025
dill Public
Forked from uqfoundation/dill

serialize all of Python

Python Other Updated Oct 2, 2025
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python MIT License Updated Sep 26, 2025
h2 Public
Forked from python-hyper/h2

Pure-Python HTTP/2 protocol implementation

Python MIT License Updated Sep 20, 2025
duskpilot-c3-clone Public

Updated Sep 5, 2025
tokenizers Public
Forked from huggingface/tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust Apache License 2.0 Updated May 27, 2025
threadpoolctl Public
Forked from joblib/threadpoolctl

Python helpers to limit the number of threads used in native libraries that handle their own internal threadpool (BLAS and OpenMP implementations)

Python BSD 3-Clause "New" or "Revised" License Updated May 8, 2025
datasets Public
Forked from huggingface/datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Python Apache License 2.0 Updated May 3, 2025
mav Public
Forked from attentionmech/mav

model activation visualiser

Python MIT License Updated Mar 28, 2025
pytorch Public
Forked from ROCm/pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python Other Updated Mar 17, 2025
clod-code Public
Forked from qpwo/clod-code

rot13 version of claw code

Grammatical Framework Updated Mar 12, 2025
ethos-paper Public
Forked from ipolharvard/ethos-paper

Jupyter Notebook MIT License Updated Mar 8, 2025
QQQ Public
Forked from HandH1998/QQQ

QQQ is an innovative and hardware-optimized W4A8 quantization solution for LLMs.

Python Updated Feb 18, 2025
GPTQModel Public
Forked from 1096125073/GPTQModel

Production ready LLM model compression/quantization toolkit with accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

Python Apache License 2.0 Updated Jan 20, 2025
sglang Public
Forked from sgl-project/sglang

SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.

Python 1 Apache License 2.0 Updated Jan 3, 2025
evalplus Public
Forked from evalplus/evalplus

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

Python Apache License 2.0 Updated Dec 22, 2024
unsloth Public
Forked from unslothai/unsloth

5X faster 60% less memory QLoRA finetuning

Python Apache License 2.0 Updated Aug 30, 2024
auto-round Public
Forked from intel/auto-round

SOTA Weight-only Quantization Algorithm for LLMs

Python Apache License 2.0 Updated Jul 23, 2024
hqq Public
Forked from dropbox/hqq

Official implementation of Half-Quadratic Quantization (HQQ)

Python Apache License 2.0 Updated Jul 22, 2024
AutoGPTQ Public
Forked from AutoGPTQ/AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 2 Apache License 2.0 Updated Jun 27, 2024

Qubitium-ModelCloud Qubitium

Achievements

Achievements

flash-attention Public

Uh oh!

transformers Public

Uh oh!

huggingface_hub Public

Uh oh!

flash-linear-attention Public

Uh oh!

triton Public

Uh oh!

BitBLAS Public

Uh oh!

nanochat Public

Uh oh!

accelerate Public

Uh oh!

hf_transfer Public

Uh oh!

xet-core Public

Uh oh!

vllm Public

Uh oh!

dill Public

Uh oh!

lm-evaluation-harness Public

Uh oh!

h2 Public

Uh oh!

duskpilot-c3-clone Public

Uh oh!

tokenizers Public

Uh oh!

threadpoolctl Public

Uh oh!

datasets Public

Uh oh!

mav Public

Uh oh!

pytorch Public

Uh oh!

clod-code Public

Uh oh!

ethos-paper Public

Uh oh!

QQQ Public

Uh oh!

GPTQModel Public

Uh oh!

sglang Public

Uh oh!

evalplus Public

Uh oh!

unsloth Public

Uh oh!

auto-round Public

Uh oh!

hqq Public

Uh oh!

AutoGPTQ Public

Uh oh!