shuaills

Shi Shuai shuaills

28 followers · 31 following

08:33 (UTC)
https://shuaills.github.io/

Achievements

x3 x2

Achievements

x3 x2

Lists (1)

Sort

🚀 My stack

3 repositories

Stars

UfoMiao / zcf

Zero-Config Code Flow for Claude code & Codex

TypeScript 3,333 247 Updated Oct 31, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 34,814 3,929 Updated Oct 30, 2025

woodx9 / build-your-claude-code-from-scratch

Build a Claude Code–like CLI coding agent from scratch.

Python 52 6 Updated Sep 21, 2025

RLsys-Foundation / TritonForge

🔥 LLM-powered GPU kernel synthesis: Train models to convert PyTorch ops into optimized Triton kernels via SFT+RL. Multi-turn compilation feedback, cross-platform NVIDIA/AMD, Kernelbook + KernelBench

Python 92 2 Updated Oct 9, 2025

QuantumNous / new-api

AI模型聚合管理中转分发系统，支持将多种大模型转为统一格式调用，支持OpenAI、Claude、Gemini等格式，可供个人或者企业内部管理与分发渠道使用。🍥 The next-generation LLM gateway and AI asset management system supports multiple languages.

JavaScript 11,896 2,302 Updated Nov 1, 2025

stepfun-ai / Step-Audio2

Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.

Python 1,184 83 Updated Sep 22, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 2,324 236 Updated Nov 1, 2025

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,048 1,889 Updated Nov 1, 2025

google-gemini / gemini-cli

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 81,116 9,008 Updated Nov 1, 2025

NickL77 / BaldEagle

3x Faster Inference; Unofficial implementation of EAGLE Speculative Decoding

Python 78 14 Updated Jul 3, 2025

sgl-project / SpecForge

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 452 103 Updated Oct 30, 2025

bytedance / flux

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,160 82 Updated Aug 28, 2025

bitcoin / bitcoin

Bitcoin Core integration/staging tree

C++ 86,558 38,142 Updated Oct 31, 2025

modelcontextprotocol / python-sdk

The official Python SDK for Model Context Protocol servers and clients

Python 19,765 2,704 Updated Oct 31, 2025

inclusionAI / AReaL

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 2,926 214 Updated Nov 1, 2025

NVIDIA / TensorRT-Model-Optimizer

A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment…

Python 1,489 190 Updated Nov 1, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 14,029 3,220 Updated Nov 1, 2025

mlc-ai / xgrammar

Fast, Flexible and Portable Structured Generation

C++ 1,335 95 Updated Oct 20, 2025

deepseek-ai / DualPipe

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,873 305 Updated Mar 10, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 4,031 243 Updated Oct 6, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,929 285 Updated May 15, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 3,994 554 Updated Nov 1, 2025

k3s-io / k3s

Lightweight Kubernetes

Go 31,181 2,533 Updated Oct 31, 2025

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 151,914 31,007 Updated Oct 31, 2025

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 94,434 25,732 Updated Nov 1, 2025

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 41,220 4,536 Updated Oct 13, 2025

apache / tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 12,776 3,689 Updated Nov 1, 2025

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 39,626 6,851 Updated Nov 1, 2025

NVIDIA / TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

C++ 12,008 1,828 Updated Nov 1, 2025

guidance-ai / llguidance

Super-fast Structured Outputs

Rust 577 38 Updated Oct 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shi Shuai shuaills

Achievements

Achievements

Block or report shuaills

Lists (1)

🚀 My stack

Stars

UfoMiao / zcf

karpathy / nanochat

woodx9 / build-your-claude-code-from-scratch

RLsys-Foundation / TritonForge

QuantumNous / new-api

stepfun-ai / Step-Audio2

THUDM / slime

openai / gpt-oss

google-gemini / gemini-cli

NickL77 / BaldEagle

sgl-project / SpecForge

bytedance / flux

bitcoin / bitcoin

modelcontextprotocol / python-sdk

inclusionAI / AReaL

NVIDIA / TensorRT-Model-Optimizer

NVIDIA / Megatron-LM

mlc-ai / xgrammar

deepseek-ai / DualPipe

zhaochenyang20 / Awesome-ML-SYS-Tutorial

deepseek-ai / open-infra-index

flashinfer-ai / flashinfer

k3s-io / k3s

huggingface / transformers

pytorch / pytorch

hpcaitech / ColossalAI

apache / tvm

ray-project / ray

NVIDIA / TensorRT-LLM

guidance-ai / llguidance