Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View yuxuandexter's full-sized avatar

Highlights

  • Pro

Block or report yuxuandexter

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

slime is an LLM post-training framework for RL Scaling.

Python 3,378 427 Updated Jan 18, 2026

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 4,368 901 Updated Sep 4, 2025

VoiceNavigator: AI-Powered Speech-to-Speech Web Interaction System

Python 1 Updated Mar 11, 2025

An Open Source implementation of Notebook LM with more flexibility and features

TypeScript 18,120 1,966 Updated Jan 17, 2026

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 22,170 3,512 Updated Jan 5, 2026

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,259 127 Updated Nov 9, 2025

[NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning

Python 59 6 Updated Oct 31, 2025

AgentFlow: In-the-Flow Agentic System Optimization

Python 1,495 190 Updated Dec 17, 2025

A god-simulation sandbox game built on Godot 4 as a multi-agent AI social simulation system. In this virtual world, AI characters possess independent thinking and memory, capable of autonomous soci…

GDScript 2,052 352 Updated Dec 26, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 67,747 12,655 Updated Jan 18, 2026

Training API and CLI

Python 316 33 Updated Jan 16, 2026

Post-training with Tinker

Python 2,743 297 Updated Jan 17, 2026
Jupyter Notebook 318 28 Updated Sep 17, 2025

Build RL environments for LLM training

Python 603 52 Updated Jan 16, 2026

[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents

Python 226 42 Updated Jul 13, 2025

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.

JavaScript 22,876 2,310 Updated Oct 17, 2025

Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.

Python 559 60 Updated Sep 11, 2025

Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model

Python 1,807 191 Updated Oct 4, 2025

A curated list of awesome AI tools for game developers

882 66 Updated Nov 19, 2024

Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning

Python 58 10 Updated Dec 18, 2025

A Lightweight LLM Post-Training Library

Python 2,108 224 Updated Jan 18, 2026

🙌 OpenHands: AI-Driven Development

Python 66,728 8,291 Updated Jan 18, 2026

A list of AI autonomous agents

25,251 2,141 Updated Feb 26, 2025

OctoTools: An agentic framework with extensible tools for complex reasoning

Python 1,402 182 Updated Oct 11, 2025

Democratizing Reinforcement Learning for LLMs

Python 4,995 487 Updated Jan 18, 2026

Learn how to design systems at scale and prepare for system design interviews

39,995 4,965 Updated Dec 15, 2025

Resources related to distributed systems, system design, microservices, scalability and performance, etc

1,146 135 Updated Jan 22, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,461 222 Updated Jan 17, 2026
Next