Lists (2)
Sort Name ascending (A-Z)
Stars
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
A comprehensive code domain benchmark review of LLM researches.
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。
PaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoking search tools, reading papers, and selecting relevant refe…
[NeurIPS 2025 D&B] 🚀 SWE-bench Goes Live!
SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?
RepoMaster: The open-source AI agent that masters GitHub. It turns any code repository into a powerful tool, achieving a new level of autonomous task-solving. An open alternative to Claude-Code.
SE-Agent is a self-evolution framework for LLM Code agents. It enables trajectory-level evolution to exchange information across reasoning paths via Revision, Recombination, and Refinement, expandi…
Socratic-Zero is a fully autonomous framework that generates high-quality training data for mathematical reasoning
📄 Awesome CV is LaTeX template for your outstanding job application
Master programming by recreating your favorite technologies from scratch.
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.
Awesome LLM compression research papers and tools.
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
Code search MCP for Claude Code. Make entire codebase the context for any coding agent.
(ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Trae Agent is an LLM-based agent for general purpose software engineering tasks.
"Context engineering is the delicate art and science of filling the context window with just the right information for the next step." — Andrej Karpathy. A frontier, first-principles handbook inspi…
A repo lists papers related to LLM based agent
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…
SWE-Exp: Experience-Driven Software Issue Resolution
SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution
🔥 Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.