Thanks to visit codestin.com
Credit goes to github.com

wyxscir

Follow

🍒

wyxscir

🍒

Follow

[email protected]

11 followers · 49 following

beijing

Lists (4)

Sort

efficient

14 repositories

largemodel

65 repositories

papercode

tools

Stars

0russwest0 / Agent-R1

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 1,069 76 Updated Nov 25, 2025

ByteDance-Seed / Seed-1.8

Jupyter Notebook 171 2 Updated Dec 19, 2025

showlab / Awesome-GUI-Agent

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

1,027 57 Updated Aug 17, 2025

bytedance / UI-TARS-desktop

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

TypeScript 20,097 1,918 Updated Dec 15, 2025

kortix-ai / suna

Kortix – build, manage and train AI Agents.

TypeScript 18,867 3,249 Updated Dec 25, 2025

hkust-nlp / Toolathlon

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Python 192 15 Updated Dec 24, 2025

sierra-research / tau2-bench

τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment

Python 565 125 Updated Dec 18, 2025

sierra-research / tau-bench

Code and Data for Tau-Bench

Python 1,028 164 Updated Aug 28, 2025

anthropics / skills

Public repository for Agent Skills

Python 26,710 2,463 Updated Dec 20, 2025

freedomkk-qfeng / DeepSeek-ReAct-Native-example

A Python example project showcasing the capabilities of **DeepSeek-V3.2** models combining "Thinking Mode" (Reasoning) with **Tool Calling**.

Python 19 1 Updated Dec 5, 2025

jina-ai / reader

Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/

TypeScript 9,543 739 Updated May 8, 2025

FoundationAgents / AutoEnv

Scaling Agentic Environments Automatically.

Python 40 2 Updated Dec 5, 2025

deepseek-ai / DeepSeek-Math-V2

Python 1,497 122 Updated Dec 1, 2025

Infrasys-AI / AIInfra

AIInfra（AI 基础设施）指AI系统从底层芯片等硬件，到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 5,517 764 Updated Dec 22, 2025

deepseek-ai / LPLB

An early research stage expert-parallel load balancer for MoE models based on linear programming.

Python 475 27 Updated Nov 19, 2025

prnake / kimi-deepresearch

Kimi K2 Thinking Agentic Search Unofficial Implementation

HTML 10 Updated Nov 9, 2025

TsinghuaC3I / Awesome-Memory-for-Agents

A Collection of Papers about Memory for Language Agents

215 8 Updated Dec 16, 2025

WooooDyy / AgentGym

Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

Python 673 98 Updated Sep 11, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 4,907 469 Updated Dec 24, 2025

sunblaze-ucb / rl-grok-recipe

Code repository for "RL Grokking Recipe: How RL Unlocks and Transfers New Algorithms in LLMs""

Python 23 Updated Oct 12, 2025

RUC-NLPIR / ARPO

The official code of ARPO & AEPO

Python 832 38 Updated Dec 20, 2025

inclusionAI / ASearcher

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 517 34 Updated Nov 26, 2025

langflow-ai / langflow

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

Python 141,952 8,209 Updated Dec 25, 2025

langfengQ / verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,314 117 Updated Dec 11, 2025

noahshinn / reflexion

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 3,000 290 Updated Jan 14, 2025

NousResearch / Hermes-Function-Calling

Jupyter Notebook 1,153 143 Updated Dec 22, 2025

MiniMax-AI / MiniMax-M2

MiniMax-M2, a model built for Max coding & agentic workflows.

2,114 162 Updated Nov 13, 2025

TencentYoutuResearch / APTBench

Code for "APTBench: Benchmarking Agentic Potential of Base LLMs During Pre-Training"

Python 28 2 Updated Dec 23, 2025

RUC-NLPIR / DeepAgent

🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets

Python 877 111 Updated Nov 2, 2025

wdndev / llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

HTML 11,478 1,157 Updated Apr 30, 2025