yuxuandexter

Yuxuan Zhang yuxuandexter

I am interested in AI agents.

29 followers · 25 following

UCSD
21:25 (UTC -12:00)
https://yuxuandexter.github.io

Achievements

Highlights

Stars

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 3,378 427 Updated Jan 18, 2026

eliahuhorwitz / Academic-project-page-template

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 4,368 901 Updated Sep 4, 2025

yuxuandexter / VoiceNavigator

VoiceNavigator: AI-Powered Speech-to-Speech Web Interaction System

Python 1 Updated Mar 11, 2025

lfnovo / open-notebook

An Open Source implementation of Notebook LM with more flexibility and features

TypeScript 18,120 1,966 Updated Jan 17, 2026

zai-org / Open-AutoGLM

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 22,170 3,512 Updated Jan 5, 2026

TsinghuaC3I / Awesome-RL-for-LRMs

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,259 127 Updated Nov 9, 2025

hao-ai-lab / LookaheadReasoning

[NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning

Python 59 6 Updated Oct 31, 2025

lupantech / AgentFlow

AgentFlow: In-the-Flow Agentic System Optimization

Python 1,495 190 Updated Dec 17, 2025

xhyumiracle / Awesome-AgenticLLM-RL-Papers

1,440 64 Updated Jan 17, 2026

KsanaDock / Microverse

A god-simulation sandbox game built on Godot 4 as a multi-agent AI social simulation system. In this virtual world, AI characters possess independent thinking and memory, capable of autonomous soci…

GDScript 2,052 352 Updated Dec 26, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 67,747 12,655 Updated Jan 18, 2026

thinking-machines-lab / tinker

Training API and CLI

Python 316 33 Updated Jan 16, 2026

thinking-machines-lab / tinker-cookbook

Post-training with Tinker

Python 2,743 297 Updated Jan 17, 2026

agentica-project / rllm

Jupyter Notebook 318 28 Updated Sep 17, 2025

NVIDIA-NeMo / Gym

Build RL environments for LLM training

Python 603 52 Updated Jan 16, 2026

R2E-Gym / R2E-Gym

[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents

Python 226 42 Updated Jul 13, 2025

Gar-b-age / CookLikeHOC

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工，非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》，并做归纳、编辑与整理。CookLikeHOC.

JavaScript 22,876 2,310 Updated Oct 17, 2025

WooooDyy / AgentGym-RL

Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.

Python 559 60 Updated Sep 11, 2025