Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View crystallee-ai's full-sized avatar
  • hangzhou

Highlights

  • Pro

Organizations

@fudan-generative-vision

Block or report crystallee-ai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

rCM: SOTA Diffusion Distillation & Few-Step Video Generation

Python 240 12 Updated Oct 19, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,351 32 Updated Oct 15, 2025

[ICLR 2025] Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation

Python 3,652 528 Updated Feb 27, 2025

HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation

Python 2,296 96 Updated Oct 14, 2025

Pytorch implementation of MeanFlow on ImageNet and CIFAR10

Python 315 17 Updated Aug 23, 2025
Python 660 24 Updated Dec 5, 2024

A mini-library for training consistency models.

Jupyter Notebook 249 26 Updated Dec 26, 2023

Unofficial extension implementation of Self-Forcing to support I2V && 14B training.

Python 220 15 Updated Sep 29, 2025

Light Video Generation Inference Framework

Python 686 41 Updated Oct 24, 2025

Qwen-Image-Lightning: Speed up Qwen-Image model with distillation

Python 862 35 Updated Oct 14, 2025

A pipeline parallel training script for diffusion models.

Python 1,657 223 Updated Sep 14, 2025

Elucidating the Design Space of Diffusion-Based Generative Models (EDM)

Python 1,806 177 Updated Mar 16, 2024
Python 5 Updated Jul 20, 2025

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)

Jupyter Notebook 1,824 100 Updated Feb 1, 2025
Python 260 19 Updated Oct 14, 2025

Official implementation of BLIP3o-Series

Python 1,553 68 Updated Oct 20, 2025

The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search

Python 1,686 322 Updated Oct 24, 2025

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Python 7,260 1,003 Updated Jul 3, 2024

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,382 63 Updated Mar 16, 2025

[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 1,232 42 Updated Jun 12, 2025

Benchmarking physical understanding in generative video models

Python 208 19 Updated Sep 29, 2025

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,475 79 Updated Jun 24, 2025

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 7,948 707 Updated May 31, 2024

[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 3,102 307 Updated Dec 21, 2024

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,447 541 Updated May 18, 2025

[ECCV 2024, Oral] FMBoost: Boosting Latent Diffusion with Flow Matching

Python 247 6 Updated Oct 17, 2025

[CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Modality Evolution'

Python 312 3 Updated Jun 8, 2025

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 3,603 242 Updated Sep 25, 2025

✨✨Latest Advances on Multimodal Large Language Models

16,533 1,069 Updated Oct 25, 2025
Next