Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View wyxscir's full-sized avatar
🍒
🍒

Block or report wyxscir

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 1,069 76 Updated Nov 25, 2025
Jupyter Notebook 171 2 Updated Dec 19, 2025

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

1,027 57 Updated Aug 17, 2025

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

TypeScript 20,097 1,918 Updated Dec 15, 2025

Kortix – build, manage and train AI Agents.

TypeScript 18,867 3,249 Updated Dec 25, 2025

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Python 192 15 Updated Dec 24, 2025

τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment

Python 565 125 Updated Dec 18, 2025

Code and Data for Tau-Bench

Python 1,028 164 Updated Aug 28, 2025

Public repository for Agent Skills

Python 26,710 2,463 Updated Dec 20, 2025

A Python example project showcasing the capabilities of **DeepSeek-V3.2** models combining "Thinking Mode" (Reasoning) with **Tool Calling**.

Python 19 1 Updated Dec 5, 2025

Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/

TypeScript 9,543 739 Updated May 8, 2025

Scaling Agentic Environments Automatically.

Python 40 2 Updated Dec 5, 2025

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 5,517 764 Updated Dec 22, 2025

An early research stage expert-parallel load balancer for MoE models based on linear programming.

Python 475 27 Updated Nov 19, 2025

Kimi K2 Thinking Agentic Search Unofficial Implementation

HTML 10 Updated Nov 9, 2025

A Collection of Papers about Memory for Language Agents

215 8 Updated Dec 16, 2025

Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

Python 673 98 Updated Sep 11, 2025

Democratizing Reinforcement Learning for LLMs

Python 4,907 469 Updated Dec 24, 2025

Code repository for "RL Grokking Recipe: How RL Unlocks and Transfers New Algorithms in LLMs""

Python 23 Updated Oct 12, 2025

The official code of ARPO & AEPO

Python 832 38 Updated Dec 20, 2025

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 517 34 Updated Nov 26, 2025

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

Python 141,952 8,209 Updated Dec 25, 2025

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,314 117 Updated Dec 11, 2025

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 3,000 290 Updated Jan 14, 2025
Jupyter Notebook 1,153 143 Updated Dec 22, 2025

MiniMax-M2, a model built for Max coding & agentic workflows.

2,114 162 Updated Nov 13, 2025

Code for "APTBench: Benchmarking Agentic Potential of Base LLMs During Pre-Training"

Python 28 2 Updated Dec 23, 2025

🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets

Python 877 111 Updated Nov 2, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 11,478 1,157 Updated Apr 30, 2025
Next