Stars
Baby Dragon Hatchling (BDH) – Architecture and Code
A Survey of Attributions for Large Language Models
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
This is a fun, new monospaced font that includes programming ligatures and is designed to enhance the modern look and feel of the Windows Terminal.
If you live in the terminal, kitty is made for you! Cross-platform, fast, feature-rich, GPU based.
Renderer for the harmony response format to be used with gpt-oss
Application that allows streaming with Inochi2D puppets
💖🧸 Self hosted, you owned Grok Companion, a container of souls of waifu, cyber livings to bring them into our worlds, wishing to achieve Neuro-sama's altitude. Capable of realtime voice chat, Minec…
A real-time motion capture system for 3D virtual character animating.
Implementation for FP8/INT8 Rollout for RL training without performence drop.
An Automatic Prompt Optimization Framework for Large Language Models
Task-Aware Agent-driven Prompt Optimization Framework
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm
[EMNLP'25] s3 - ⚡ Efficient & Effective Search Agent Training via RL for RAG (RLVR for Search with Minimal Data)
Tongyi Deep Research, the Leading Open-source Deep Research Agent
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
Official repository for Mi:dm 2.0, the large language model developed by KT.
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
LEAKED SYSTEM PROMPTS FOR CHATGPT, GEMINI, GROK, CLAUDE, PERPLEXITY, CURSOR, DEVIN, REPLIT, AND MORE! - AI SYSTEMS TRANSPARENCY FOR ALL! 👐
🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning
Build resilient language agents as graphs.
Official PyTorch implementation for "Large Language Diffusion Models"
Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.
Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini
🤗 smolagents: a barebones library for agents that think in code.