Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View zhaoxu98's full-sized avatar

Highlights

  • Pro

Block or report zhaoxu98

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An interface library for RL post training with environments.

Python 852 136 Updated Dec 19, 2025

🌎💪 BrowserGym, a Gym environment for web task automation

Python 1,044 143 Updated Dec 16, 2025

🔥 Stay motivated and show off your contribution streak! 🌟 Display your total contributions, current streak, and longest streak on your GitHub profile README

PHP 6,344 1,175 Updated Oct 24, 2025

A note taking application that is good both for outlining and long-form writing.

179 3 Updated Dec 12, 2025

Native Multimodal Models are World Learners

Python 1,367 52 Updated Nov 28, 2025

Start your own digital garden using this Jekyll template 🌱

HTML 1,208 850 Updated Dec 1, 2025

This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language Models".

156 2 Updated May 14, 2025

🐹 Deep clean and optimize your Mac.

Shell 11,528 381 Updated Dec 20, 2025

Automatic Video Generation from Scientific Papers

Python 2,005 297 Updated Oct 20, 2025

MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

Python 368 18 Updated Aug 26, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,677 309 Updated Nov 13, 2025

[NeurIPS2025] MM-Agent: LLM as Agents for Real-world Mathematical Modeling Problem

Python 96 14 Updated Dec 9, 2025

The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".

587 27 Updated Dec 16, 2025

A collection of resources and papers on Diffusion Models

HTML 12,207 1,011 Updated Aug 1, 2024

dLLM: Simple Diffusion Language Modeling

Python 1,476 149 Updated Dec 19, 2025

[R]einforcement [L]earning from [M]odel-rewarded [T]hinking - code for the paper "Language Models That Think, Chat Better"

Python 122 6 Updated Oct 27, 2025

MCP-Universe is a comprehensive framework designed for developing, testing, and benchmarking AI agents

Python 527 64 Updated Dec 9, 2025

MCP-Zero: Active Tool Discovery for Autonomous LLM Agents

Python 423 46 Updated Jul 2, 2025

Audio Normalization for Python/ffmpeg

HTML 1,458 125 Updated Nov 9, 2025

A benchmark for LLMs on complicated tasks in the terminal

Python 1,237 439 Updated Dec 20, 2025

τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment

Python 553 121 Updated Dec 18, 2025

Zero Academic Homepage is a clean, modern and responsive theme for academic personal websites.

CSS 37 4 Updated Jun 6, 2025

AI+ Lab, HKUST(Guangzhou)

HTML 1 Updated Dec 17, 2025

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 4,263 879 Updated Sep 4, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,831 3,815 Updated Dec 21, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,650 2,859 Updated Dec 21, 2025

本仓库包含对 Claude Code v1.0.33 进行逆向工程的完整研究和分析资料。包括对混淆源代码的深度技术分析、系统架构文档,以及重构 Claude Code agent 系统的实现蓝图。主要发现包括实时 Steering 机制、多 Agent 架构、智能上下文管理和工具执行管道。该项目为理解现代 AI agent 系统设计和实现提供技术参考。

JavaScript 11,685 3,036 Updated Jul 19, 2025

🔥 Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.

2,748 182 Updated Aug 5, 2025

A repo lists papers related to LLM based agent

Python 2,157 133 Updated Jul 12, 2025
Next