Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View GongXinyuu's full-sized avatar
🏡
WFH
🏡
WFH

Highlights

  • Pro

Block or report GongXinyuu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A curated list of recent diffusion models for video generation, editing, and various other applications.

5,142 317 Updated Oct 15, 2025

[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python 3,362 243 Updated Oct 17, 2025

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,321 276 Updated Jul 17, 2025

The best ChatGPT that $100 can buy.

Python 33,134 3,647 Updated Oct 25, 2025

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,489 77 Updated Oct 17, 2025

An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.

Python 1,311 163 Updated Oct 22, 2025

Official GitHub repository for FLUX.1 Krea [dev].

Python 348 30 Updated Aug 2, 2025

DiffusionNFT: Online Diffusion Reinforcement with Forward Process

Python 349 10 Updated Sep 22, 2025

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,243 81 Updated Oct 23, 2025

[Preprint] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 480 19 Updated Oct 20, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 15,543 1,210 Updated Oct 22, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 18,955 1,877 Updated Oct 23, 2025

Enjoy the magic of Diffusion models!

Python 10,449 975 Updated Oct 27, 2025

The collection of awesome papers on alignment of diffusion models.

348 16 Updated Oct 24, 2025

[CVPR 2025] HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation

Python 54 Updated Jul 8, 2025

Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)

Python 646 23 Updated Sep 24, 2025

A unified inference and post-training framework for accelerated video generation.

Python 2,475 189 Updated Oct 27, 2025

My learning notes/codes for ML SYS.

Python 3,984 240 Updated Oct 6, 2025

Official Implementation of Paper Transfer between Modalities with MetaQueries

Python 256 6 Updated Oct 12, 2025

An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation

Python 1,101 51 Updated Oct 16, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 11,397 1,165 Updated Oct 11, 2025

Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation [Siggraph Asian 2025]

Python 425 23 Updated Sep 21, 2025

APOLLO: SGD-like Memory, AdamW-level Performance; MLSys'25 Oustanding Paper Honorable Mention

Python 258 13 Updated Apr 25, 2025

[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention

Python 524 29 Updated Oct 5, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,524 208 Updated Jun 17, 2025

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

Python 1,442 89 Updated Sep 11, 2025

Official repository for LTX-Video

Python 8,511 766 Updated Oct 25, 2025

Lets make video diffusion practical!

Python 16,021 1,530 Updated Oct 16, 2025

[NeurIPS 2025] Improving Video Generation with Human Feedback

Python 310 5 Updated Sep 24, 2025

SkyReels-A2: Compose anything in video diffusion transformers

Python 676 62 Updated Jun 3, 2025
Next