Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Kevin-thu's full-sized avatar

Block or report Kevin-thu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A paper list for spatial reasoning

550 31 Updated Dec 20, 2025

Unified KV Cache Compression Methods for Auto-Regressive Models

Python 1,291 159 Updated Jan 4, 2025

A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.

1,528 63 Updated Dec 18, 2025

Official repo for paper "Video-As-Prompt: Unified Semantic Control for Video Generation"

Python 330 19 Updated Nov 2, 2025

Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives

Python 563 105 Updated Nov 26, 2025

[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System

Python 3,649 382 Updated Dec 3, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,644 55 Updated Nov 15, 2025

[ArXiv 25] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling

Python 807 54 Updated Dec 16, 2025

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

Python 1,465 94 Updated Sep 11, 2025

Lynx: Towards High-Fidelity Personalized Video Generation

Python 297 38 Updated Sep 26, 2025

Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)

Python 243 13 Updated Dec 5, 2025

Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch

Python 1,012 1,338 Updated Aug 29, 2025

๐ŸŒ 3D and 4D World Modeling: A Survey

HTML 722 41 Updated Dec 17, 2025

Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini

Roff 24,404 3,724 Updated Dec 22, 2025

A Comprehensive Benchmark Suite for AI Story Visualization

Python 106 5 Updated Dec 22, 2025

๐Ÿ”ฅ [ICCV 2025 Highlight] InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

Python 2,650 286 Updated Aug 22, 2025

Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.

Python 692 49 Updated Dec 21, 2025

[NeurIPS 2025] Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation".

Python 616 46 Updated Oct 22, 2025

[SIGGRAPH Asia 2025] DreamO: A Unified Framework for Image Customization

Python 1,735 129 Updated Aug 14, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,329 1,450 Updated Nov 28, 2025

Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model

Python 2,564 221 Updated Dec 17, 2025

Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).

Python 402 10 Updated Aug 26, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,987 1,510 Updated Dec 17, 2025

Official repository for LTX-Video

Python 8,919 837 Updated Oct 25, 2025

A unified inference and post-training framework for accelerated video generation.

Python 2,842 228 Updated Dec 22, 2025

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 2,991 220 Updated Sep 12, 2025

Official implementation of ICCV 2025 paper - CharaConsist: Fine-Grained Consistent Character Generation

Jupyter Notebook 139 9 Updated Jul 22, 2025

Official implementation of Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling

Python 118 1 Updated Nov 26, 2025

[Arxiv'25] MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization

Python 53 2 Updated Sep 16, 2025
Python 8 Updated Jul 27, 2025
Next