Highlights
- Pro
Stars
Algorithm powering the For You feed on X
Source code for the X Recommendation Algorithm
Pocket Flow: 100-line LLM framework. Let Agents build Agents!
Introduction to Machine Learning Systems
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
Accelerating MoE with IO and Tile-aware Optimizations
An extremely fast Python type checker and language server, written in Rust.
An extremely fast Python linter and code formatter, written in Rust.
An extremely fast Python package and project manager, written in Rust.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
A toolchain for web projects, aimed to provide functionalities to maintain them. Biome offers formatter and linter, usable via CLI and LSP.
Virtual whiteboard for sketching hand-drawn like diagrams
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Fast and memory-efficient exact attention
Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.
slime is an LLM post-training framework for RL Scaling.
A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.
🚀 Efficient implementations of state-of-the-art linear attention models
verl: Volcano Engine Reinforcement Learning for LLMs
SGLang is a high-performance serving framework for large language models and multimodal models.
A framework for managing and maintaining multi-language pre-commit hooks.
Share terminal sessions via SVG and CSS
UI Library for Design Engineers. Animated components and effects you can copy and paste into your apps. Free. Open Source.
🤖 Powerful asynchronous state management, server-state utilities and data fetching for the web. TS/JS, React Query, Solid Query, Svelte Query and Vue Query.
Model Context Protocol Servers