Stars
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
wake word engine benchmark framework
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.
Verilog to Routing -- Open Source CAD Flow for FPGA Research
Open-source implementation of AlphaEvolve
Accessible large language models via k-bit quantization for PyTorch.
Minimal reproduction of DeepSeek R1-Zero
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Code for studying the super weight in LLM
galatolofederico / vanilla-llama
Forked from meta-llama/llamaPlain pytorch implementation of LLaMA
Implementation of normalizing flows from 1d to Nd
An introduction to ARM64 assembly on Apple Silicon Macs
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
A minimalistic full working bitcoin miner implemented in python.
GPU programming related news and material links
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
A Free and Open Source Python Library for Multiobjective Optimization