The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >70% on SWE-bench verified!

Python 1,916 200 Updated Oct 21, 2025

google-gemini / gemini-cli

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 80,141 8,812 Updated Oct 23, 2025

codefuse-ai / Awesome-Code-LLM

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

2,975 193 Updated Oct 15, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,642 2,332 Updated Oct 23, 2025

OS-Copilot / ScienceBoard

Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"

Python 112 10 Updated Aug 28, 2025

RUCKBReasoning / OmniSQL

[VLDB' 25] Synthesizing High-quality Text-to-SQL Data at Scale. SynSQL-2.5M is the first million-scale cross-domain text-to-SQL dataset.

Python 357 40 Updated Sep 8, 2025

bytebase / dbhub

Universal database MCP server connecting to MySQL, PostgreSQL, SQL Server, MariaDB.

TypeScript 1,453 129 Updated Oct 12, 2025

All-Hands-AI / OpenHands

🙌 OpenHands: Code Less, Make More

Python 64,399 7,808 Updated Oct 22, 2025

bird-bench / BIRD-Interact

[BIRD-INTERACT] Re-imagines Text-to-SQL evaluation via lens of dynamic interactions.

Python 319 12 Updated Oct 21, 2025

bird-bench / livesqlbench

Python 109 5 Updated Oct 21, 2025

decisionintelligence / TFB

[PVLDB 2024 Best Paper Nomination] TFB: Towards Comprehensive and Fair Benchmarking of Time Series Forecasting Methods

Shell 1,049 77 Updated Oct 15, 2025

qixucen / atom

[NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling

Python 590 51 Updated Jun 16, 2025

mll-lab-nu / RAGEN

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,364 183 Updated Oct 23, 2025

microsoft / UFO

The Desktop AgentOS.

Python 7,668 934 Updated Sep 5, 2025

pingcap / tidb

TiDB - the open-source, cloud-native, distributed SQL database designed for modern applications.

Go 39,202 6,038 Updated Oct 23, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,211 804 Updated Oct 23, 2025

langgenius / dify

Production-ready platform for agentic workflow development.

TypeScript 117,085 18,088 Updated Oct 23, 2025

xlang-ai / OSWorld

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Python 2,258 313 Updated Oct 23, 2025

Shawn Xu Tebmer

Highlights

Lists (32)

Agents

Algorithm💶

BigModelTraining

ChatGPT

Code

Conversation

DB

Finetune-Lightweight

Graph

Inc

interact

IR

KGC

KGE

Knowledge Summary

llama

LLM-Benchmark

LLM+Data

llm dataset

LLM Framework

LLM Tutorial

LM-RL

mm

NLP+KG📚

Paper

PLM

Pretraining

RAG

structureddata

Tokenizer

Tools

work

Starred repositories

Tensorflow