Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Necolizer's full-sized avatar

Highlights

  • Pro

Block or report Necolizer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Agent0 Series: Self-Evolving Agents from Zero Data

Python 894 100 Updated Dec 21, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,634 838 Updated Dec 18, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,675 1,356 Updated Dec 17, 2025

Search Self-Play: Pushing the Frontier of Agent Capability without Supervision

Python 76 5 Updated Nov 13, 2025

slime is an LLM post-training framework for RL Scaling.

Python 2,925 353 Updated Dec 21, 2025

MedSoft-Diffusion was early accepted to MICCAI 2025 (top 9%, scores: 5/4/4).

Python 41 Updated Mar 1, 2025

🥨 Lobe Icons - Brings AI/LLM brand logos to your React & React Native apps — static SVG/PNG/WebP, no dependencies.

TypeScript 1,311 130 Updated Dec 20, 2025
TypeScript 1 Updated May 29, 2025

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 13,897 1,303 Updated Oct 28, 2025

Scaling Deep Research via Reinforcement Learning in Real-world Environments.

Python 675 46 Updated Oct 15, 2025
Python 4,242 458 Updated Jul 31, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,679 309 Updated Nov 13, 2025

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Python 1,213 112 Updated Aug 16, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,394 204 Updated Dec 20, 2025

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 1,049 75 Updated Nov 25, 2025

Training VLM agents with multi-turn reinforcement learning

Python 349 42 Updated Dec 1, 2025

High-velocity, monorepo-scale workflow for Git

Rust 3,950 101 Updated Nov 24, 2025

Megvii FILE Library - Working with Files in Python same as the standard library

Python 164 18 Updated Dec 17, 2025

A Python package with CLI designed to accelerate the calculation and analysis of materials’︁ transport and thermoelectric properties

Python 2 Updated Oct 30, 2025

Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities

1,132 66 Updated Jul 15, 2025

A curated list of reinforcement learning (RL) for agents.

55 1 Updated Dec 19, 2025

Computer Agent Arena: Test & compare AI agents in real desktop apps & web environments. Code/data coming soon!

51 4 Updated Apr 7, 2025

Build effective agents using Model Context Protocol and simple workflow patterns

Python 7,874 792 Updated Dec 13, 2025

Hierarchical Cross-Modal Alignment for Open-Vocabulary 3D Object Detection (AAAI 2025)

6 1 Updated Nov 8, 2025

Official Repo for Open-Reasoner-Zero

Python 2,084 119 Updated Jun 2, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Jupyter Notebook 2,447 194 Updated Dec 3, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 51,390 8,967 Updated Nov 17, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,663 2,861 Updated Dec 21, 2025
Python 8,615 608 Updated Nov 12, 2025
Next