Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View z-jiaming's full-sized avatar
🧐
🧐

Highlights

  • Pro

Block or report z-jiaming

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[ArXiv 25] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling

Python 380 19 Updated Oct 27, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,380 33 Updated Oct 15, 2025

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

619 15 Updated Oct 24, 2025

VideoNSA: Native Sparse Attention Scales Video Understanding

Python 51 1 Updated Oct 8, 2025

Official Repo for Self-Forcing++ High Quality Long Video Generation

175 3 Updated Oct 13, 2025

LongLive: Real-time Interactive Long Video Generation

Python 748 47 Updated Oct 13, 2025

HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation

Python 2,317 97 Updated Oct 14, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 15,600 1,216 Updated Oct 27, 2025
C++ 25 Updated Jul 16, 2025

SpatialVID: A Large-Scale Video Dataset with Spatial Annotations

Python 394 12 Updated Oct 22, 2025

Tracking the latest and greatest research papers on video generation.

78 7 Updated Oct 18, 2025

Unofficial extension implementation of Self-Forcing to support I2V && 14B training.

Python 222 15 Updated Sep 29, 2025

4-steps distilled version of Wan2.2-TI2V-5B

Python 101 6 Updated Sep 12, 2025

A collection of paper/projects that trains flow matching model/policies via RL.

277 9 Updated Oct 9, 2025

Pusa: Thousands Timesteps Video Diffusion Model

Python 659 47 Updated Sep 5, 2025

Model analyzer in PyTorch

Python 90 12 Updated Aug 31, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 3,893 293 Updated Oct 24, 2025

A unified inference and post-training framework for accelerated video generation.

Python 2,478 189 Updated Oct 27, 2025

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 991 51 Updated Aug 7, 2025

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 1,054 56 Updated Apr 1, 2025

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 2,746 195 Updated Sep 12, 2025

T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation

Jupyter Notebook 30 2 Updated Sep 16, 2025

SkyReels-V2: Infinite-length Film Generative model

Python 4,827 674 Updated Aug 11, 2025

Use Claude Code or Cursor CLI on mobile and web with Claude Code UI. Claude Code UI free open source webui/GUI that helps you manage your Claude Code session and projects remotely

JavaScript 4,528 552 Updated Oct 8, 2025

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 80,547 8,902 Updated Oct 27, 2025

Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model

Python 1,695 175 Updated Oct 4, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 7,956 523 Updated Oct 27, 2025

Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.

Python 659 48 Updated Sep 1, 2025
Python 47 1 Updated Mar 24, 2025

[Siggraph '23] NeRSemble: Neural Radiance Field Reconstruction of Human Heads

Python 236 11 Updated Apr 29, 2025
Next