pyh-129

ppTanya pyh-129

Student of Wuhan University

5 followers · 14 following

Wuhan, Hubei Province, China
https://www.whu.edu.cn/

Highlights

Lists (14)

Sort

Stars

alibaba-damo-academy / PixelRefer

The code for PixelRefer & VideoRefer

Jupyter Notebook 332 18 Updated Nov 16, 2025

opendatalab-raiser / Envision

Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights

JavaScript 28 1 Updated Dec 2, 2025

RyannDaGreat / MotionV2V

HTML 49 1 Updated Dec 8, 2025

qirui-chen / RGA3-release

[ICCV 2025] Object-centric Video Question Answering with Visual Grounding and Referring

Python 22 Updated Aug 8, 2025

EIT-NLP / Awesome-Latent-CoT

This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.

239 4 Updated Dec 20, 2025

hanyang-21 / VideoScene

[CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step

Python 328 9 Updated Jul 4, 2025

nv-tlabs / lyra

Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation

Python 634 34 Updated Oct 2, 2025

thuml / MiniVeo3-Reasoner

Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give it a star 🌟 if you find it useful.

Python 193 7 Updated Oct 12, 2025

mayuelala / Awesome-Controllable-Video-Generation

[ArXiv 2025] A survey about controllable video generation: This repo is the official awesome of "Controllable video generation: A survey"

604 38 Updated Nov 11, 2025

HiDream-ai / MotionPro

[CVPR 2025] Official Implementation of MotionPro: A Precise Motion Controller for Image-to-Video Generation

Python 138 15 Updated Aug 11, 2025

Jialuo-Li / Science-T2I

[CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis

Python 62 4 Updated Apr 27, 2025

protospatial / NodeToCode

Translate Unreal Engine Blueprints to C++ in seconds. Not hours.

C++ 441 61 Updated Jun 6, 2025

Alpha-VLLM / Lumina-DiMOO

Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model

Python 911 57 Updated Nov 26, 2025

gojasper / LBM

LBM: Latent Bridge Matching for Fast Image-to-Image Translation ✨ (ICCV 2025 Highlight)

Python 801 49 Updated Jul 24, 2025

SeekingDream / Static-to-Dynamic-LLMEval

The official GitHub repository of the paper "Recent advances in large langauge model benchmarks against data contamination: From static to dynamic evaluation"

48 2 Updated Sep 13, 2025