Thanks to visit codestin.com
Credit goes to github.com

Lil-Shake

Follow

Lil-Shake

Follow

4 followers · 4 following

Stars

guandeh17 / Self-Forcing

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 2,726 195 Updated Sep 12, 2025

justincui03 / Self-Forcing-Plus-Plus

Official Repo for Self-Forcing++ High Quality Long Video Generation

170 3 Updated Oct 13, 2025

huggingface / flux-fast

Making Flux go brrr on GPUs.

Python 148 14 Updated Jul 18, 2025

Tencent-Hunyuan / MixGRPO

MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE

Python 1,013 42 Updated Oct 13, 2025

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 24,524 1,799 Updated Jul 31, 2025

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,880 91 Updated Aug 15, 2024

ziqipang / RandAR

[CVPR 2025 (Oral)] Open implementation of "RandAR"

Python 197 6 Updated Jul 14, 2025

Osilly / Interleaving-Reasoning-Generation

This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA benchmark performance. It also significantly improves the quality, fine-grain…

Python 64 Updated Sep 14, 2025

wusize / Harmon

[ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation

Python 176 5 Updated May 21, 2025

showlab / Show-o

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,749 74 Updated Oct 22, 2025

wdrink / SimpleAR

Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"

Python 411 19 Updated Jun 20, 2025

TencentARC / TokLIP

TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation

Python 225 5 Updated Aug 18, 2025

FoundationVision / UniTok

[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding

Python 429 8 Updated Sep 22, 2025

HorizonWind2004 / reconstruction-alignment

Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Learning.

Python 289 10 Updated Oct 16, 2025

Tencent-Hunyuan / HunyuanImage-2.1

HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation

Python 650 48 Updated Oct 14, 2025

akanazawa / fpo

Implementation of Flow Policy Optimization (FPO)

Python 267 10 Updated Sep 29, 2025

irom-princeton / dppo

Official implementation of Diffusion Policy Policy Optimization, arxiv 2024

Python 660 76 Updated Feb 4, 2025

facebookresearch / dinov3

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 7,871 513 Updated Oct 22, 2025

Yuanshi9815 / OminiControl

[ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer

Python 1,803 137 Updated Jul 3, 2025

hustvl / LightningDiT

[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 1,228 42 Updated Jun 12, 2025

CompVis / tread

Python 151 9 Updated Oct 15, 2025

SqueezeAILab / KVQuant

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Python 389 36 Updated Aug 13, 2024

andy-yang-1 / DoubleSparse

16-fold memory access reduction with nearly no loss

Python 105 8 Updated Mar 26, 2025

modelscope / DiffSynth-Studio

Enjoy the magic of Diffusion models!

Python 10,410 972 Updated Oct 22, 2025

tianweiy / CausVid

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 980 51 Updated Aug 7, 2025

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 10,422 1,116 Updated Oct 12, 2025

mit-han-lab / radial-attention

[NeurIPS 2025] Radial Attention: O(nlogn) Sparse Attention with Energy Decay for Long Video Generation

Python 525 29 Updated Sep 18, 2025

lukaslaobeyer / token-opt

Code for ICML 2025 Paper "Highly Compressed Tokenizer Can Generate Without Training"

Jupyter Notebook 179 11 Updated Jun 10, 2025

yifan123 / flow_grpo

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,471 76 Updated Oct 17, 2025

AILab-CVC / VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python 4,978 393 Updated Jul 10, 2024