-
Zhejiang University
- Shanghai China
-
08:15
(UTC +08:00)
Highlights
- Pro
Lists (2)
Sort Name ascending (A-Z)
Stars
🌴 ARES is an open-source framework for adaptive multimodal reasoning, featuring a two-stage pipeline—Adaptive Cold-Start and Entropy-Shaped Policy Optimization—to balance reasoning depth and effici…
MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models
Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Scalable RL solution for advanced reasoning of language models
A curated collection of fun and creative examples generated with Nano Banana🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the community's development…
A version of verl to support diverse tool use
A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.
SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model(1.7B, 4B, 8B, 30B)
Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Learning.
HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation
This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA benchmark performance. It also significantly improves the quality, fine-grain…
Democratizing Reinforcement Learning for LLMs
🚀 MassGen: An Open-Source Multi-Agent Scaling System for Collaborative AI with the Goal of Continuous Self-Improvement. Featuring parallel agent orchestration across frontier open and closed weight…
Interleaving Reasoning: Next-Generation Reasoning Systems for AGI
Code for the paper "VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use"
An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation
The official repository of the dots.vlm1 instruct models proposed by rednote-hilab.
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
SkyRL: A Modular Full-stack RL Library for LLMs