gogoczh

Zihui Cheng gogoczh

0 followers · 1 following

Central South University
Central South University

Achievements

Lists (10)

Sort

Stars

Haochen-Wang409 / ross

[ICLR'25] Reconstructive Visual Instruction Tuning

Python 122 6 Updated Apr 9, 2025

Osilly / Vision-R1

This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reas…

Python 720 19 Updated Sep 10, 2025

FYYDCC / IVT-LR

Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”

Python 5 Updated Oct 17, 2025

gotcha / ipdb

Integration of IPython pdb

Python 1,944 150 Updated Jul 28, 2025

facebookresearch / coconut

Training Large Language Model to Reason in a Continuous Latent Space

Python 1,313 135 Updated Aug 12, 2025

DoubtedSteam / MM-GCoT

The official implement of "Grounded Chain-of-Thought for Multimodal Large Language Models"

Python 18 1 Updated Jul 21, 2025

Open-Reasoner-Zero / Open-Vision-Reasoner

[NeurIPS 2025] The official repository for our paper, "Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning".

143 1 Updated Sep 12, 2025

SUSTechBruce / SRPO_MLLMs

[NeurIPS 2025🔥]Main source code of SRPO framework.

Python 176 18 Updated Sep 21, 2025

xiaomi-research / colar

[NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains

Python 56 4 Updated Jul 29, 2025

Gorilla-Lab-SCUT / PaDT

The official implementation of "Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs"

Python 187 7 Updated Oct 9, 2025

cythu / PeBR-R1

Python 7 1 Updated Oct 21, 2025

lian-tian-mo-zun / Pro_Reason

ProReason: Multi-Modal Proactive Reasoning with Decoupled Eyesight and Wisdom

Python 1 Updated Mar 27, 2025

gogoczh / CoMT

code for "CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models"

Python 19 Updated Mar 10, 2025

tongjingqi / Game-RL

Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning

Python 103 2 Updated Oct 16, 2025

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,230 405 Updated Oct 23, 2025

VincentLeebang / lvr

Official codebase for the paper Latent Visual Reasoning

Python 27 Updated Oct 22, 2025

shawnricecake / Heima

Code for Heima

Python 56 4 Updated Apr 21, 2025

AntResearchNLP / ViLaSR

Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing

Python 75 2 Updated Jul 27, 2025

InternLM / SIM-CoT

An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"

Python 94 3 Updated Sep 28, 2025

TIGER-AI-Lab / Pixel-Reasoner

Pixel-Level Reasoning Model trained with RL [NeuIPS25]

Python 243 9 Updated Sep 10, 2025

Hanhpt23 / OmniMod

MCOUT: Multimodal Chain of Continuous Thought for Latent Reasoning

Python 9 1 Updated Oct 4, 2025

TIGER-AI-Lab / VL-Rethinker

The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]

Python 161 5 Updated Jun 5, 2025

mahtabbigverdi / Aurora-perception

Python 33 1 Updated Sep 8, 2025

Visual-Agent / DeepEyes

Python 891 53 Updated Oct 20, 2025

UMass-Embodied-AGI / Mirage

Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens (arXiv 2025)

Python 185 13 Updated Aug 2, 2025

ByteDance-BandAI / LLM-I

🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code execution & editing

Python 30 1 Updated Oct 20, 2025

zjunlp / LightThinker

[EMNLP 2025] LightThinker: Thinking Step-by-Step Compression

Python 115 5 Updated Apr 12, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 3,895 294 Updated Oct 24, 2025

MikeWangWZHL / PAPO

Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"

Python 94 5 Updated Aug 26, 2025

zhengkid / Parallel-R1

The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"

Python 229 15 Updated Oct 18, 2025

Zihui Cheng gogoczh

Lists (10)

bboxing

dataset

GRPO-optimization

latent-image-reasoning

latent-reasoning

perception

perception-and-reasoning

reasoning

think-with-image

visual-RL

Stars