Stars
π Efficient implementations of state-of-the-art linear attention models
ππ Efficient implementations of Native Sparse Attention
Fast and memory-efficient exact attention
Witness the aha moment of VLM with less than $3.
An open-source AI agent that brings the power of Gemini directly into your terminal.
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workfloβ¦
Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.
The official repo of the paper "MMLongBench Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly"
SGLang is a high-performance serving framework for large language models and multimodal models.
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
π OpenHands: AI-Driven Development
Recipes to train the self-rewarding reasoning LLMs.
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
Agent S: an open agentic framework that uses computers like a human
My learning notes for ML SYS.
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models
Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.
Train transformer language models with reinforcement learning.
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
π Make websites accessible for AI agents. Automate tasks online with ease.
Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.
π° Must-read papers and blogs on LLM based Long Context Modeling π₯
This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!
No fortress, purely open ground. OpenManus is Coming.
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."