Stars
[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction
Mixture-of-Groups Attention for End-to-End Long Video Generation
Official Pytorch Implementation for "Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising"
Official Code for MotionCtrl [SIGGRAPH 2024]
Efficient Long Video Generation via Next-Frame-Rate Prediction
[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
利用HuggingFace的官方下载工具从镜像网站进行高速下载。
Official code for StoryMem: Multi-shot Long Video Storytelling with Memory
[ICLR 2024] Code for FreeNoise based on VideoCrafter
(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
[CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
[ICCV 2025] Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency
TurboDiffusion: 100–200× Acceleration for Video Diffusion Models
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Long-horizon, spatially consistent video generation enabled by persistent 3D scene point clouds and dynamic-static disentanglement.
Official repo for paper "IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning"
HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency
[ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos
Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation
[ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints
[CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
[NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory
[ICCV 2025 ⭐highlight⭐] Implementation of VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory
[TPAMI 2025] ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis
Official implementation of Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling
Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025) and "UltraViCo: Breaking Extrapolation Limits in Video Diffusion Transformers"