Thanks to visit codestin.com
Credit goes to github.com

Skip to content

KlingAIResearch/VideoCanvas

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning

Minghong Cai1 †, Qiulin Wang2 ✉, Zongli Ye1, Wenze Liu1, Quande Liu2, Weicai Ye2, Xintao Wang2, Pengfei Wan2, Kun Gai2, Xiangyu Yue1 ✉
1MMLab, The Chinese University of Hong Kong 2Kling Team, Kuaishou Technology
†: Intern at Kuaishou Technology, ✉: Corresponding Authors

📋 News

  • [2025.10.9] Release Arxiv paper.

📖 Introduction

VideoCanvas has two key contributions:

  • 🎯 Unified Tasks: VideoCanvas introduces a unified paradigm for arbitrary spatio-temporal video generation, seamlessly integrating diverse capabilities including image/patch-to-video conditioning at any timestamp, inpainting/outpainting, camera control, scene transitions, and video extension.
  • 🛠️ Simple Solution: Our technical innovation leverages In-Context Conditioning with zero-padding for spatial control and Temporal RoPE Interpolation for temporal alignment, achieving frame-precise video generation without fine-tuning VAEs or adding parameters.
teaser.mp4

📖 VideoCanvasBench

We will release this benchmark, including intra-scene and inter-scene evaluation data.

⚙️ Code (Coming soon)

Citation

 @article{cai2025videocanvas,
    title={VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning},
    author={Minghong Cai, Qiulin Wang, Zongli Ye, Wenze Liu, Quande Liu, Weicai Ye, Xintao Wang, Pengfei Wan, Kun Gai, Xiangyu Yue},
    journal={arXiv preprint arXiv:2510.08555},
    year={2025}
}

About

Official Code of "VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors