Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View CSfufu's full-sized avatar
  • Zhejiang University
  • Shanghai China
  • 08:15 (UTC +08:00)

Highlights

  • Pro

Block or report CSfufu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🌴 ARES is an open-source framework for adaptive multimodal reasoning, featuring a two-stage pipeline—Adaptive Cold-Start and Entropy-Shaped Policy Optimization—to balance reasoning depth and effici…

Python 9 Updated Oct 13, 2025

MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

Python 336 17 Updated Aug 26, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 15,382 1,198 Updated Oct 22, 2025

A Gym for Agentic LLMs

Python 332 13 Updated Oct 22, 2025

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

683 26 Updated Sep 13, 2025

Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"

Python 343 15 Updated Sep 15, 2025
Python 7,987 560 Updated Oct 23, 2025

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

TypeScript 40,134 2,580 Updated Oct 21, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 16,337 1,229 Updated Oct 18, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,754 99 Updated Mar 18, 2025

Scaling RL on advanced reasoning models

Python 621 39 Updated Oct 20, 2025

A curated collection of fun and creative examples generated with Nano Banana🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the community's development…

14,919 1,559 Updated Sep 24, 2025

A version of verl to support diverse tool use

Python 624 44 Updated Oct 23, 2025

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 748 55 Updated Jul 31, 2025

SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model(1.7B, 4B, 8B, 30B)

Python 251 11 Updated Oct 19, 2025

Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Learning.

Python 290 10 Updated Oct 16, 2025

HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation​

Python 650 48 Updated Oct 14, 2025

This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA benchmark performance. It also significantly improves the quality, fine-grain…

Python 64 Updated Sep 14, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,554 425 Updated Oct 23, 2025

🚀 MassGen: An Open-Source Multi-Agent Scaling System for Collaborative AI with the Goal of Continuous Self-Improvement. Featuring parallel agent orchestration across frontier open and closed weight…

Python 570 83 Updated Oct 23, 2025

Interleaving Reasoning: Next-Generation Reasoning Systems for AGI

187 9 Updated Oct 17, 2025

Code for the paper "VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use"

Python 133 2 Updated Aug 10, 2025

An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation

Python 1,079 50 Updated Oct 16, 2025

The official repository of the dots.vlm1 instruct models proposed by rednote-hilab.

Dockerfile 261 6 Updated Sep 26, 2025

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,476 76 Updated Oct 17, 2025

Bob 是一款 macOS 平台的翻译和 OCR 软件。

9,446 525 Updated Jan 24, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 18,925 1,867 Updated Oct 23, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,087 143 Updated Oct 23, 2025
Python 303 13 Updated May 24, 2025
Next