yushengsu-thu

Ethan (Yusheng) Su yushengsu-thu

From the open-source community, to the open-source community

174 followers · 61 following

SGLang | AMD | Tsinghua University
California, USA
01:27 (UTC -08:00)
https://yushengsu-thu.github.io/
@thu_yushengsu

Achievements

x2 x3

Achievements

x2 x3

Highlights

Organizations

Lists (3)

Sort

Stars

xlite-dev / LeetCUDA

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 9,654 956 Updated Feb 13, 2026

anthropics / claudes-c-compiler

Claude Opus 4.6 wrote a dependency-free C compiler in Rust, with backends targeting x86 (64- and 32-bit), ARM, and RISC-V, capable of compiling a booting Linux kernel.

Rust 2,114 128 Updated Feb 5, 2026

VikParuchuri / triton_tutorial

Tutorials for Triton, a language for writing gpu kernels

Jupyter Notebook 73 8 Updated Aug 23, 2023

yushengsu-thu / Megatron-Bridge

Forked from NVIDIA-NeMo/Megatron-Bridge

HuggingFace conversion and training library for Megatron-based models

Python 1 Updated Jan 10, 2026

NVIDIA-NeMo / Megatron-Bridge

Training library for Megatron-based models with bidirectional Hugging Face conversion capability

Python 427 172 Updated Feb 13, 2026

meta-pytorch / torchforge

PyTorch-native post-training at scale

Python 619 88 Updated Feb 12, 2026

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,468 432 Updated Feb 11, 2026

dsl-learn / triton-tutorial

Getting Started with Triton: A Tutorial for Python Beginners

HTML 35 2 Updated Oct 21, 2025

GMISWE / tinker-cloud

Python 22 3 Updated Feb 11, 2026

AI-Hypercomputer / maxtext

A simple, performant and scalable Jax LLM!

Python 2,140 467 Updated Feb 13, 2026

microsoft / agent-lightning

The absolute trainer to light up AI agents.

Python 14,656 1,249 Updated Feb 11, 2026

Orchestra-Research / AI-Research-SKILLs

Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…

TeX 3,125 260 Updated Feb 9, 2026

radixark / miles

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 875 108 Updated Feb 13, 2026

RLsys-Foundation / TritonForge

🔥 LLM-powered GPU kernel synthesis: Train models to convert PyTorch ops into optimized Triton kernels via SFT+RL. Multi-turn compilation feedback, cross-platform NVIDIA/AMD, Kernelbook + KernelBench

Python 120 2 Updated Nov 10, 2025