Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View vwxyzjn's full-sized avatar
😃
😃

Block or report vwxyzjn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 610 56 Updated Dec 19, 2025

Super fast serving stack for LLM on Windows/Linux/Macos

Cuda 9 1 Updated Dec 17, 2025

A single-file educational implementation for understanding vLLM's core concepts and running LLM inference.

Python 33 4 Updated Jun 22, 2025
HTML 4 Updated Nov 18, 2025

wheels for TransformerEngine

Python 5 2 Updated Nov 27, 2025

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,199 267 Updated Dec 19, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,451 475 Updated Dec 19, 2025

Scalable toolkit for efficient model reinforcement

Python 1,147 199 Updated Dec 19, 2025
Python 26 1 Updated Jul 31, 2025

A simple Python sandbox for helpful LLM data agents

Python 299 50 Updated Jun 18, 2024

Async RL Training at Scale

Python 948 162 Updated Dec 19, 2025

🚴 Call stack profiler for Python. Shows you why your code is slow!

Python 7,539 256 Updated Nov 17, 2025

Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"

Jupyter Notebook 567 54 Updated Oct 7, 2025

APPS: Automated Programming Progress Standard (NeurIPS 2021)

Python 497 67 Updated Jun 19, 2024

Realtime log viewer for containers. Supports Docker, Swarm and K8s.

Go 10,558 456 Updated Dec 19, 2025
Python 86 8 Updated Nov 11, 2025

A lightweight data processing framework built on DuckDB and 3FS.

Python 4,871 430 Updated Mar 5, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 16,226 1,254 Updated Dec 19, 2025

Fast reinforcement learning 💨

Cython 28 1 Updated Jul 15, 2025

A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.

Python 152 10 Updated Nov 11, 2025

Our library for RL environments + evals

Python 3,646 454 Updated Dec 19, 2025

A PyTorch native platform for training generative AI models

Python 4,856 644 Updated Dec 19, 2025

nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)

Python 136 9 Updated May 8, 2025

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

Python 396 59 Updated Jun 10, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,926 918 Updated Dec 15, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,948 288 Updated May 15, 2025

Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!

Python 251 19 Updated Oct 31, 2025

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,278 106 Updated Dec 15, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 35,844 4,232 Updated Dec 14, 2025
Next