-
CS@UCLA
- Los Angeles
-
18:26
(UTC -08:00) - https://zhaohsu.top/
- @BillHsu98
- in/bill-hsu
Highlights
- Pro
Lists (15)
Sort Name ascending (A-Z)
Stars
An interface library for RL post training with environments.
🌎💪 BrowserGym, a Gym environment for web task automation
🔥 Stay motivated and show off your contribution streak! 🌟 Display your total contributions, current streak, and longest streak on your GitHub profile README
A note taking application that is good both for outlining and long-form writing.
Native Multimodal Models are World Learners
Start your own digital garden using this Jekyll template 🌱
This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language Models".
Automatic Video Generation from Scientific Papers
MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
[NeurIPS2025] MM-Agent: LLM as Agents for Real-world Mathematical Modeling Problem
The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".
A collection of resources and papers on Diffusion Models
[R]einforcement [L]earning from [M]odel-rewarded [T]hinking - code for the paper "Language Models That Think, Chat Better"
MCP-Universe is a comprehensive framework designed for developing, testing, and benchmarking AI agents
MCP-Zero: Active Tool Discovery for Autonomous LLM Agents
A benchmark for LLMs on complicated tasks in the terminal
τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment
Zero Academic Homepage is a clean, modern and responsive theme for academic personal websites.
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
SGLang is a fast serving framework for large language models and vision language models.
verl: Volcano Engine Reinforcement Learning for LLMs
本仓库包含对 Claude Code v1.0.33 进行逆向工程的完整研究和分析资料。包括对混淆源代码的深度技术分析、系统架构文档,以及重构 Claude Code agent 系统的实现蓝图。主要发现包括实时 Steering 机制、多 Agent 架构、智能上下文管理和工具执行管道。该项目为理解现代 AI agent 系统设计和实现提供技术参考。
🔥 Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.
A repo lists papers related to LLM based agent