- China
-
08:16
(UTC -12:00)
Stars
A Survey of Reinforcement Learning for Large Reasoning Models
MiroMind Research Agent: Fully Open-Source Deep Research Agent with Reproducible State-of-the-Art Performance on FutureX, GAIA, HLE, BrowserComp and xBench.
香蕉超市|各种玩法一键生成,无需提示词,支持局部涂选、连续编辑
AgentScope: Agent-Oriented Programming for Building LLM Applications
Recursive-Open-Meta-Agent v0.1 (Beta). A meta-agent framework to build high-performance multi-agent systems.
A simple yet powerful agent framework that delivers with open-source models
Letta is the platform for building stateful agents: open AI with advanced memory that can learn and self-improve over time.
Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
slime is an LLM post-training framework for RL Scaling.
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
In-depth tutorials on LLMs, RAGs and real-world AI agent applications.
Multi-agent framework, runtime and control plane. Built for speed, privacy, and scale.
Free Bootstrap 5 based admin dashboard template - open source and licensed under MIT license
[NeurIPS 2025] Thinkless: LLM Learns When to Think
Learning To Parse Excel Tables, Generate Docx Files And Convert To PDF Files
Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electro…
这是一个简单的技术科普教程项目,主要聚焦于解释一些有趣的,前沿的技术概念和原理。每篇文章都力求在 5 分钟内阅读完成。
Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.
A customer segmentation project can be approached in multiple ways. In this repository, we will explore advanced techniques for defining clusters and analyzing the results.
An LLM agent that conducts deep research (local and web) on any given topic and generates a long report with citations.
Scaling Deep Research via Reinforcement Learning in Real-world Environments.