Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View 01yzzyu's full-sized avatar
🏠
Working
🏠
Working

Block or report 01yzzyu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICML 2025 Oral] The official repository for the paper "Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark"

Python 70 Updated Jul 17, 2025

MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency

Python 136 5 Updated Aug 5, 2025

XModBench: Benchmarking Cross-Modal Capabilities and Consistency in Omni-Language Models

Python 4 Updated Oct 23, 2025

Awesome Unified Multimodal Models

998 31 Updated Aug 17, 2025

Official code implementation of the paper: QuCo-RAG: Quantifying Uncertainty from the Pre-training Corpus for Dynamic Retrieval-Augmented Generation

Python 16 2 Updated Dec 25, 2025

A collection of awesome image inpainting studies.

TeX 357 25 Updated Dec 11, 2025

[CVPR 2025] Video Narration as Vocabulary & Video as Long Document

Python 582 31 Updated Mar 13, 2025

Official eval code for ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation

Python 27 Updated Dec 12, 2025

Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision

Python 185 3 Updated Dec 23, 2025

[NIPS 2025 DB Oral] Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Python 130 7 Updated Dec 17, 2025

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Python 822 25 Updated Dec 23, 2025

Unified Multimodal Model for image generation/editing/understanding

Python 820 38 Updated Sep 8, 2025

OmniGen2: Exploration to Advanced Multimodal Generation.

Jupyter Notebook 3,975 12 Updated Dec 2, 2025

Official implementation of BLIP3o-Series

Python 1,613 73 Updated Nov 29, 2025
Python 36 2 Updated Dec 11, 2025

Next-Token Prediction is All You Need

Python 2,271 91 Updated Nov 19, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,646 2,233 Updated Feb 1, 2025

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,832 81 Updated Dec 15, 2025

Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics

Python 166 12 Updated May 6, 2025

[arXiv 2025] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence

Python 67 Updated Dec 23, 2025
Python 114 3 Updated Nov 1, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,959 3,863 Updated Dec 25, 2025
Python 20 1 Updated Dec 10, 2025

📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.

339 15 Updated Oct 16, 2025

[ACL 2025] The Role of Visual Modality in Multimodal Mathematical Reasoning: Challenges and Insights

Python 7 1 Updated Jun 12, 2025

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

Python 446 Updated Dec 16, 2025

This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.

243 5 Updated Dec 23, 2025

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,227 40 Updated Dec 23, 2025

MedEvalKit: A Unified Medical Evaluation Framework

Python 193 17 Updated Oct 23, 2025

ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning

Python 104 6 Updated Oct 28, 2025
Next