-
Sun Yat-sen University
- China
-
03:42
(UTC +08:00) - https://necolizer.github.io/
- https://orcid.org/0000-0001-6644-4075
- https://scholar.google.com/citations?user=fxBaCW8AAAAJ
Highlights
- Pro
Lists (2)
Sort Name ascending (A-Z)
Stars
Agent0 Series: Self-Evolving Agents from Zero Data
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Search Self-Play: Pushing the Frontier of Agent Capability without Supervision
slime is an LLM post-training framework for RL Scaling.
MedSoft-Diffusion was early accepted to MICCAI 2025 (top 9%, scores: 5/4/4).
🥨 Lobe Icons - Brings AI/LLM brand logos to your React & React Native apps — static SVG/PNG/WebP, no dependencies.
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
Scaling Deep Research via Reinforcement Learning in Real-world Environments.
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
SkyRL: A Modular Full-stack RL Library for LLMs
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
Training VLM agents with multi-turn reinforcement learning
High-velocity, monorepo-scale workflow for Git
Megvii FILE Library - Working with Files in Python same as the standard library
A Python package with CLI designed to accelerate the calculation and analysis of materials’︁ transport and thermoelectric properties
Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities
A curated list of reinforcement learning (RL) for agents.
Computer Agent Arena: Test & compare AI agents in real desktop apps & web environments. Code/data coming soon!
Build effective agents using Model Context Protocol and simple workflow patterns
Hierarchical Cross-Modal Alignment for Open-Vocabulary 3D Object Detection (AAAI 2025)
Official Repo for Open-Reasoner-Zero
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
No fortress, purely open ground. OpenManus is Coming.
verl: Volcano Engine Reinforcement Learning for LLMs