😆
Ph.D. Candidate@PennState
-
Pennsylvania State University
- State College, PA
-
20:28
(UTC -05:00) - https://shuzhao.me/
- in/shu-zhao-532ab8250
Stars
An extensible RL framework for training LLM agents with advanced search capabilities, built on VERL and supporting state-of-the-art search strategies.
AgentFlow: In-the-Flow Agentic System Optimization
A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
verl: Volcano Engine Reinforcement Learning for LLMs