CSfufu

凪 CSfufu

【次の交差点でお会いします、よろしくお願いします】

46 followers · 38 following

Zhejiang University
Shanghai China
08:15 (UTC +08:00)

Achievements

Highlights

Lists (2)

Sort

efficient reasoning

2 repositories

🚀 My stack

2 repositories

Stars

YsTvT / Awesome-Agentic-RL-Papers

69 5 Updated Oct 22, 2025

shawn0728 / ARES

🌴 ARES is an open-source framework for adaptive multimodal reasoning, featuring a two-stage pipeline—Adaptive Cold-Start and Entropy-Shaped Policy Optimization—to balance reasoning depth and effici…

Python 9 Updated Oct 13, 2025

EvolvingLMMs-Lab / multimodal-search-r1

MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

Python 336 17 Updated Aug 26, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 15,382 1,198 Updated Oct 22, 2025

axon-rl / gem

A Gym for Agentic LLMs

Python 332 13 Updated Oct 22, 2025

yfzhang114 / Awesome-Multimodal-Large-Language-Models

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

683 26 Updated Sep 13, 2025

Mini-o3 / Mini-o3

Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"

Python 343 15 Updated Sep 15, 2025

bytedance / UI-TARS

Python 7,987 560 Updated Oct 23, 2025

anthropics / claude-code

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

TypeScript 40,134 2,580 Updated Oct 21, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 16,337 1,229 Updated Oct 18, 2025

PRIME-RL / PRIME

Scalable RL solution for advanced reasoning of language models

Python 1,754 99 Updated Mar 18, 2025

ChenxinAn-fdu / POLARIS

Scaling RL on advanced reasoning models

Python 621 39 Updated Oct 20, 2025

PicoTrex / Awesome-Nano-Banana-images

A curated collection of fun and creative examples generated with Nano Banana🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the community's development…

14,919 1,559 Updated Sep 24, 2025

TIGER-AI-Lab / verl-tool

A version of verl to support diverse tool use

Python 624 44 Updated Oct 23, 2025

BytedTsinghua-SIA / MemAgent

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 748 55 Updated Jul 31, 2025

JetAstra / SDAR

SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model（1.7B, 4B, 8B, 30B）

Python 251 11 Updated Oct 19, 2025

HorizonWind2004 / reconstruction-alignment

Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Learning.

Python 290 10 Updated Oct 16, 2025

Tencent-Hunyuan / HunyuanImage-2.1

HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation

Python 650 48 Updated Oct 14, 2025

Osilly / Interleaving-Reasoning-Generation

This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA benchmark performance. It also significantly improves the quality, fine-grain…

Python 64 Updated Sep 14, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,554 425 Updated Oct 23, 2025

Leezekun / MassGen

🚀 MassGen: An Open-Source Multi-Agent Scaling System for Collaborative AI with the Goal of Continuous Self-Improvement. Featuring parallel agent orchestration across frontier open and closed weight…

Python 570 83 Updated Oct 23, 2025