Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Lil-Shake's full-sized avatar

Block or report Lil-Shake

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 2,726 195 Updated Sep 12, 2025

Official Repo for Self-Forcing++ High Quality Long Video Generation

170 3 Updated Oct 13, 2025

Making Flux go brrr on GPUs.

Python 148 14 Updated Jul 18, 2025

MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE

Python 1,013 42 Updated Oct 13, 2025

Official inference repo for FLUX.1 models

Python 24,524 1,799 Updated Jul 31, 2025

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,880 91 Updated Aug 15, 2024

[CVPR 2025 (Oral)] Open implementation of "RandAR"

Python 197 6 Updated Jul 14, 2025

This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA benchmark performance. It also significantly improves the quality, fine-grain…

Python 64 Updated Sep 14, 2025

[ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation

Python 176 5 Updated May 21, 2025

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,749 74 Updated Oct 22, 2025

Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"

Python 411 19 Updated Jun 20, 2025

TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation

Python 225 5 Updated Aug 18, 2025

[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding

Python 429 8 Updated Sep 22, 2025

Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Learning.

Python 289 10 Updated Oct 16, 2025

HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation​

Python 650 48 Updated Oct 14, 2025

Implementation of Flow Policy Optimization (FPO)

Python 267 10 Updated Sep 29, 2025

Official implementation of Diffusion Policy Policy Optimization, arxiv 2024

Python 660 76 Updated Feb 4, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 7,871 513 Updated Oct 22, 2025

[ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer

Python 1,803 137 Updated Jul 3, 2025

[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 1,228 42 Updated Jun 12, 2025
Python 151 9 Updated Oct 15, 2025

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Python 389 36 Updated Aug 13, 2024

16-fold memory access reduction with nearly no loss

Python 105 8 Updated Mar 26, 2025

Enjoy the magic of Diffusion models!

Python 10,410 972 Updated Oct 22, 2025

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 980 51 Updated Aug 7, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 10,422 1,116 Updated Oct 12, 2025

[NeurIPS 2025] Radial Attention: O(nlogn) Sparse Attention with Energy Decay for Long Video Generation

Python 525 29 Updated Sep 18, 2025

Code for ICML 2025 Paper "Highly Compressed Tokenizer Can Generate Without Training"

Jupyter Notebook 179 11 Updated Jun 10, 2025

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,471 76 Updated Oct 17, 2025

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python 4,978 393 Updated Jul 10, 2024
Next