Thanks to visit codestin.com
Credit goes to github.com

shixun404

Follow

Shixun Wu shixun404

Follow

University of California, Riverside. High-Performance Computing, Reinforcement Learning.

29 followers · 10 following

University of California, Riverside
Riverside, CA
www.shixun404.com

Achievements

Achievements

Stars

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,990 2,402 Updated Oct 31, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 2,323 236 Updated Nov 1, 2025

inclusionAI / AReaL

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 2,925 214 Updated Nov 1, 2025

ByteDance-Seed / Triton-distributed

Distributed Compiler based on Triton for Parallel Systems

Python 1,209 100 Updated Oct 17, 2025

NVIDIA / TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

C++ 12,008 1,827 Updated Nov 1, 2025

meta-pytorch / applied-ai

Applied AI experiments and examples for PyTorch

Python 301 29 Updated Aug 22, 2025

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,874 535 Updated Oct 31, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,847 729 Updated Oct 15, 2025

LeCAR-Lab / ASAP

[RSS 2025] "ASAP: Aligning Simulation and Real-World Physics for Learning Agile Humanoid Whole-Body Skills"

Python 1,704 174 Updated Sep 9, 2025

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 2,859 191 Updated Oct 24, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 19,530 3,229 Updated Nov 1, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,671 971 Updated Oct 30, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 20,267 2,104 Updated Oct 31, 2025

pytorch / FBGEMM

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1,468 674 Updated Nov 1, 2025

LMAX-Exchange / disruptor

High Performance Inter-Thread Messaging Library

Java 18,065 3,958 Updated Apr 2, 2025

facebook / openzl

A novel data compression framework

C 2,665 106 Updated Oct 30, 2025

hpdps-group / MANS

An optimized ANS compressor for multi-byte integer data on NVIDIA GPUs.

Cuda 4 Updated Aug 7, 2025

LeanModels / DFloat11

DFloat11: Lossless LLM Compression for Efficient GPU Inference

Python 556 33 Updated Aug 24, 2025

causalflow-ai / petit-kernel

Optimized FP16/BF16 x FP4 GPU kernels for AMD GPUs

C++ 34 6 Updated Oct 9, 2025

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 3,812 293 Updated Oct 31, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 3,993 554 Updated Nov 1, 2025

NVlabs / cuHPX

Cuda 11 2 Updated Sep 24, 2025

deepreinforce-ai / CUDA-L1

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning

Python 201 17 Updated Oct 28, 2025

NVIDIA / hpc-container-maker

HPC Container Maker

Python 497 100 Updated Oct 22, 2025

ChenmienTan / RL2

Python 907 96 Updated Oct 31, 2025

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 17,422 2,351 Updated Nov 1, 2025

NVIDIA / pyxis

Container plugin for Slurm Workload Manager

C 389 37 Updated Oct 2, 2025

openpmix / openpmix

OpenPMIx Project Repository

C 251 124 Updated Oct 31, 2025

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 4,622 590 Updated Nov 1, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 14,029 3,220 Updated Nov 1, 2025