Stars
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image Generation.
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
[ACM Computing Surveys] The collection of awesome papers on alignment of diffusion models.
[CVPR 2025] Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models
A curated collection of fun and creative examples generated with Nano Banana & Nano Banana Pro🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the commu…
Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference
Official Pytorch implementation of "StyleKeeper: Prevent Content Leakage using Negative Visual Query Guidance"
[CVPR 2024] X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model
[Image 2 Text Para] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.
minLoRA: a minimal PyTorch library that allows you to apply LoRA to any PyTorch model.
fast-stable-diffusion + DreamBooth
A compendium of informations regarding Stable Diffusion (SD)
Unofficial implementation of Palette: Image-to-Image Diffusion Models by Pytorch
[WACV 2023] Keys to Better Image Inpainting: Structure and Texture Go Hand in Hand
A latent text-to-image diffusion model
Official implementation of Diffusion Autoencoders
Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]
[CVPR 2022] Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer
A toolbox for receptive field analysis and visualizing neural network architectures
Official Implementation for "HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing" (CVPR 2022) https://arxiv.org/abs/2111.15666
Deformable Style Transfer (ECCV 2020)