Highlights
- Pro
Starred repositories
Elevate your AI research writing, no more tedious polishing ✨
Claude Code VS Code extension patched for Force Local mode — run CLI locally, proxy file ops to remote server via VS Code Remote SSH
Easy and fast 2d human and animal multi pose estimation using SOTA ViTPose [Y. Xu et al., 2022] Real-time performances and multiple skeletons supported.
[CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer
WiLoR: End-to-end 3D hand localization and reconstruction in-the-wild
Project page for "3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation"
[SIGGRAPH Asia 2025] The official repo for the conference paper "MV-Performer: Taming Video Diffusion Model for Faithful and Synchronized Multi-view Performer Synthesis".
🔥(CVPR 2025 Highlight) Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera
Masked Depth Modeling for Spatial Perception
Official implementation of "VideoMaMa: Mask-Guided Video Matting via Generative Prior", CVPR 2026
Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Pythonic bindings for FFmpeg's libraries.
VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control
Official Codebase for our CVPR 2026 paper UniSH: Unifying Scene and Human Reconstruction in a Feed-Forward Pass
[CVPR 2026] InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields
HY-Motion model for 3D human motion or 3D character animation generation.
Lossy PNG compressor — pngquant command based on libimagequant library
Long-horizon, spatially consistent video generation enabled by persistent 3D scene point clouds and dynamic-static disentanglement.
[CVPR 2026] Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny co…
Code for ICCV 2021 paper "HuMoR: 3D Human Motion Model for Robust Pose Estimation"
Official repository for the paper "CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models"
An unified model for 4D human-scene reconstruction
[CVPR 2025] Official code for Using Diffusion Priors for Video Amodal Segmentation
4DHumans: Reconstructing and Tracking Humans with Transformers