Necolizer

Necolizer

SYSU | Alibaba QuarkLLM | Moonshot AI | Working on RL for agents

25 followers · 19 following

Sun Yat-sen University
China
03:42 (UTC +08:00)
https://necolizer.github.io/
https://orcid.org/0000-0001-6644-4075
https://scholar.google.com/citations?user=fxBaCW8AAAAJ

Achievements

Highlights

Lists (2)

Sort

🎮 RL for Agents

12 repositories

🧰 tools

6 repositories

Stars

aiming-lab / Agent0

Agent0 Series: Self-Evolving Agents from Zero Data

Python 894 100 Updated Dec 21, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,634 838 Updated Dec 18, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,675 1,356 Updated Dec 17, 2025

Alibaba-Quark / SSP

Search Self-Play: Pushing the Frontier of Agent Capability without Supervision

Python 76 5 Updated Nov 13, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 2,925 353 Updated Dec 21, 2025

DaniellaHe / MedSoft_Diffusion

MedSoft-Diffusion was early accepted to MICCAI 2025 (top 9%, scores: 5/4/4).

Python 41 Updated Mar 1, 2025

lobehub / lobe-icons

🥨 Lobe Icons - Brings AI/LLM brand logos to your React & React Native apps — static SVG/PNG/WebP, no dependencies.

TypeScript 1,311 130 Updated Dec 20, 2025

qwqqaqqwq00 / siren-music-player-vsc-ext

TypeScript 1 Updated May 29, 2025

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 13,897 1,303 Updated Oct 28, 2025

GAIR-NLP / DeepResearcher

Scaling Deep Research via Reinforcement Learning in Real-world Environments.

Python 675 46 Updated Oct 15, 2025

openai / simple-evals

Python 4,242 458 Updated Jul 31, 2025

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,679 309 Updated Nov 13, 2025

Alibaba-NLP / ZeroSearch

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Python 1,213 112 Updated Aug 16, 2025

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,394 204 Updated Dec 20, 2025

0russwest0 / Agent-R1

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 1,049 75 Updated Nov 25, 2025

mll-lab-nu / VAGEN

Training VLM agents with multi-turn reinforcement learning

Python 349 42 Updated Dec 1, 2025

arxanas / git-branchless

High-velocity, monorepo-scale workflow for Git

Rust 3,950 101 Updated Nov 24, 2025

megvii-research / megfile

Megvii FILE Library - Working with Files in Python same as the standard library

Python 164 18 Updated Dec 17, 2025

TepLabCode / Tepkit

A Python package with CLI designed to accelerate the calculation and analysis of materials’︁ transport and thermoelectric properties

Python 2 Updated Oct 30, 2025

MoonshotAI / Kimi-VL

Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities

1,132 66 Updated Jul 15, 2025

Necolizer / awesome-rl-for-agents

A curated list of reinforcement learning (RL) for agents.

55 1 Updated Dec 19, 2025

xlang-ai / computer-agent-arena

Computer Agent Arena: Test & compare AI agents in real desktop apps & web environments. Code/data coming soon!

51 4 Updated Apr 7, 2025

0russwest0 / Awesome-Agent-RL

452 18 Updated Oct 11, 2025

lastmile-ai / mcp-agent

Build effective agents using Model Context Protocol and simple workflow patterns

Python 7,874 792 Updated Dec 13, 2025

YoujunZhao / HCMA

Hierarchical Cross-Modal Alignment for Open-Vocabulary 3D Object Detection (AAAI 2025)

6 1 Updated Nov 8, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 2,084 119 Updated Jun 2, 2025

mll-lab-nu / RAGEN

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Jupyter Notebook 2,447 194 Updated Dec 3, 2025

FoundationAgents / OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python 51,390 8,967 Updated Nov 17, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,663 2,861 Updated Dec 21, 2025

bytedance / UI-TARS

Python 8,615 608 Updated Nov 12, 2025