- Bay Area
Starred repositories
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Breakout Detection via Robust E-Statistics
Chaos is a physics-based quantum computing simulator with GPU acceleration and NumPy support. It helps researchers model quantum systems and test algorithms 🐙.
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Post-training with Tinker
Senpai is an automated memory sizing tool for container applications.
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, or on-prem).
A high-performance non-blocking I/O networking framework focusing on RPC scenarios.
a unified scheduler for online and offline tasks
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
FUSE-based file system backed by Amazon S3
Development repository for the Triton language and compiler
verl: Volcano Engine Reinforcement Learning for LLMs
GPUd automates monitoring, diagnostics, and issue identification for GPUs
[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)
🤗 smolagents: a barebones library for agents that think in code.
ApeCloud's Data Transfer Suite, written in Rust. Provides ultra-fast data replication between MySQL, PostgreSQL, Redis, MongoDB, Kafka and ClickHouse, ideal for disaster recovery (DR) and migration…
a Docker + Kubernetes network trouble-shooting swiss-army container