Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View pyh-129's full-sized avatar

Highlights

  • Pro

Block or report pyh-129

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The code for PixelRefer & VideoRefer

Jupyter Notebook 332 18 Updated Nov 16, 2025

Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights

JavaScript 28 1 Updated Dec 2, 2025
HTML 49 1 Updated Dec 8, 2025

[ICCV 2025] Object-centric Video Question Answering with Visual Grounding and Referring

Python 22 Updated Aug 8, 2025

This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.

239 4 Updated Dec 20, 2025

[CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step

Python 328 9 Updated Jul 4, 2025

Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation

Python 634 34 Updated Oct 2, 2025

Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give it a star 🌟 if you find it useful.

Python 193 7 Updated Oct 12, 2025

[ArXiv 2025] A survey about controllable video generation: This repo is the official awesome of "Controllable video generation: A survey"

604 38 Updated Nov 11, 2025

[CVPR 2025] Official Implementation of MotionPro: A Precise Motion Controller for Image-to-Video Generation

Python 138 15 Updated Aug 11, 2025

[CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis

Python 62 4 Updated Apr 27, 2025

Translate Unreal Engine Blueprints to C++ in seconds. Not hours.

C++ 441 61 Updated Jun 6, 2025

Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model

Python 911 57 Updated Nov 26, 2025

LBM: Latent Bridge Matching for Fast Image-to-Image Translation ✨ (ICCV 2025 Highlight)

Python 801 49 Updated Jul 24, 2025

The official GitHub repository of the paper "Recent advances in large langauge model benchmarks against data contamination: From static to dynamic evaluation"

48 2 Updated Sep 13, 2025

[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

711 34 Updated Oct 20, 2025

WeTok: Powerful Discrete Tokenization for High-Fidelity Visual Reconstruction

Python 57 1 Updated Sep 3, 2025

MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning

Python 764 29 Updated Sep 7, 2025

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Python 3,318 348 Updated Dec 3, 2025

[ACM MM 2025] ViTCoT: Video-Text Interleaved Chain-of-Thought for Boosting Video Understanding in Large Language Models

Python 17 Updated Jul 15, 2025

ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback

Python 121 4 Updated Sep 20, 2025

Large World Model -- Modeling Text and Video with Millions Context

Python 7,389 560 Updated Oct 19, 2024

FlexTok: Resampling Images into 1D Token Sequences of Flexible Length

Jupyter Notebook 277 14 Updated Jun 2, 2025

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 13,889 1,303 Updated Oct 28, 2025
Python 1,036 63 Updated Nov 20, 2025

A scalable, end-to-end training pipeline for general-purpose agents

Python 362 54 Updated Jul 4, 2025

Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework

Python 876 47 Updated Jul 1, 2025

[CVPR‘ 2025 ] JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration

Python 248 14 Updated Dec 14, 2025

Official Implementation of Paper Transfer between Modalities with MetaQueries

Python 279 9 Updated Oct 12, 2025
Next