zhuohan123

Zhuohan Li zhuohan123

back to building @vllm-project | 🎓 cs phd @ 🌁 uc berkeley | machine learning system | the real agi is the friends we made along the way

1.4k followers · 130 following

vLLM
San Francisco Bay Area
07:23 (UTC -07:00)
https://zhuohan.li
@zhuohan123
in/zhuohan-li

Achievements

x4 x3 x2

Achievements

x4 x3 x2

Organizations

Stars

vllm-project / tpu-inference

TPU inference for vLLM, with unified JAX and PyTorch support.

Python 131 19 Updated Oct 28, 2025

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,113 149 Updated Oct 28, 2025

MoonshotAI / checkpoint-engine

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 791 58 Updated Oct 20, 2025

thinking-machines-lab / batch_invariant_ops

Python 862 61 Updated Oct 14, 2025

vllm-project / ci-infra

This repo hosts code for vLLM CI & Performance Benchmark infrastructure.

HCL 23 42 Updated Oct 28, 2025

wangrongding / wechat-bot

🤖一个基于 WeChaty 结合 DeepSeek / ChatGPT / Kimi / 讯飞等Ai服务实现的微信机器人，可以用来帮助你自动回复微信消息，或者管理微信群/好友，检测僵尸粉等...

JavaScript 8,897 1,063 Updated Oct 24, 2025

openai / harmony

Renderer for the harmony response format to be used with gpt-oss

Rust 3,937 220 Updated Aug 15, 2025

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 18,981 1,884 Updated Oct 23, 2025

mirage-project / mirage

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 1,911 142 Updated Oct 28, 2025

Tencent-Hunyuan / Hunyuan3D-2.1

From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

Python 2,323 306 Updated Oct 17, 2025

HazyResearch / Megakernels

kernels, of the mega variety

Python 591 26 Updated Sep 28, 2025

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 4,606 578 Updated Oct 28, 2025

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 3,747 282 Updated Oct 28, 2025

ndjc / controlfreak

A program to read, merge, and write programs for the Breville Control °Freak®

Java 25 2 Updated Aug 19, 2025

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 94,329 25,678 Updated Oct 28, 2025

ZachGoldberg / Startup-CTO-Handbook

The Startup CTO's Handbook, a book covering leadership, management and technical topics for leaders of software engineering teams

13,863 773 Updated Jul 30, 2025

openai / chz

Python 179 6 Updated Aug 4, 2025

huggingface / kernels

Load compute kernels from the Hub

Python 309 25 Updated Oct 27, 2025

deepseek-ai / 3FS

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,422 955 Updated Oct 24, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,837 726 Updated Oct 15, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,928 285 Updated May 15, 2025

dmlc / dlpack

common in-memory tensor structure

C++ 1,088 154 Updated Oct 11, 2025

genmoai / mochi

The best OSS video generation models, created by Genmo

Python 3,474 444 Updated Sep 5, 2025

google / pyglove

Manipulating Python Programs

Python 694 31 Updated Oct 22, 2025

vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,157 269 Updated Oct 27, 2025

efeslab / Nanoflow

A throughput-oriented high-performance serving framework for LLMs

Jupyter Notebook 909 44 Updated Oct 22, 2025

microsoft / vattention

Dynamic Memory Management for Serving LLMs without PagedAttention

C 432 33 Updated May 30, 2025

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 10,465 2,808 Updated Oct 27, 2025

bytedance / flux

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,156 82 Updated Aug 28, 2025

EricLBuehler / mistral.rs

Blazingly fast LLM inference.

Rust 6,174 464 Updated Oct 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhuohan Li zhuohan123

Achievements

Achievements

Organizations

Block or report zhuohan123

Stars

vllm-project / tpu-inference

NovaSky-AI / SkyRL

MoonshotAI / checkpoint-engine

thinking-machines-lab / batch_invariant_ops

vllm-project / ci-infra

wangrongding / wechat-bot

openai / harmony

openai / gpt-oss

mirage-project / mirage

Tencent-Hunyuan / Hunyuan3D-2.1

HazyResearch / Megakernels

pytorch / torchtitan

tile-ai / tilelang

ndjc / controlfreak

pytorch / pytorch

ZachGoldberg / Startup-CTO-Handbook

openai / chz

huggingface / kernels

deepseek-ai / 3FS

deepseek-ai / DeepGEMM

deepseek-ai / open-infra-index

dmlc / dlpack

genmoai / mochi

google / pyglove

vllm-project / llm-compressor

efeslab / Nanoflow

microsoft / vattention

EleutherAI / lm-evaluation-harness

bytedance / flux

EricLBuehler / mistral.rs