Thanks to visit codestin.com
Credit goes to github.com

hwenjun18

Follow

Wenjun Huang hwenjun18

Follow

Rookie Programmer ^o^

3 followers · 16 following

Stars

dc-ai-projects / DC-Gen

DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space

Python 259 8 Updated Oct 5, 2025

shiml20 / SVG

Official PyTorch Implementation of "Latent Diffusion Model Without Variational Autoencoder".

Python 214 2 Updated Oct 20, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 33,862 3,752 Updated Oct 25, 2025

GCYZSL / MoLA

Python 161 11 Updated Jul 22, 2024

WeChatCV / Stand-In

Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.

Python 658 48 Updated Sep 1, 2025

JyChen9811 / FaithDiff

[CVPR 2025] FaithDiff for Classic Film Rejuvenation, Old Photo Revival, Social Media Restoration, Image Enhancement and AIGC Enhancement.

Python 191 11 Updated Aug 11, 2025

TencentARC / IC-Custom

[Arxiv'25] IC-Custom: Diverse Image Customization via In-Context Learning

Python 144 3 Updated Sep 15, 2025

ali-vilab / TeaCache

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Python 1,148 44 Updated Jun 8, 2025

FlyMyAI / flymyai-lora-trainer

Qwen-Image text to image lora trainer

Python 526 42 Updated Oct 16, 2025

Eyeline-Labs / CineScale

Code for CineScale, higher-resolution video generation based on Wan

Python 170 2 Updated Aug 25, 2025

jamez-bondos / awesome-gpt4o-images

Awesome curated collection of images and prompts generated by GPT-4o and gpt-image-1. Explore AI generated visuals created with ChatGPT and Sora, showcasing OpenAI’s advanced image generation capab…

JavaScript 7,582 1,461 Updated May 26, 2025

PicoTrex / Awesome-Nano-Banana-images

A curated collection of fun and creative examples generated with Nano Banana🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the community's development…

15,051 1,570 Updated Sep 24, 2025

littlejuyan / FusingGlobalandLocal

Python 31 4 Updated Jul 10, 2022

feizc / DiT-MoE

Scaling Diffusion Transformers with Mixture of Experts

Python 393 19 Updated Sep 9, 2024

maidacundo / MoE-LoRA

Adapt an LLM model to a Mixture-of-Experts model using Parameter Efficient finetuning (LoRA), injecting the LoRAs in the FFN.

Python 62 2 Updated Oct 21, 2025

yushuiwx / Mixture-of-LoRA-Experts

Python 54 6 Updated Dec 2, 2024

XueZeyue / DanceGRPO

An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation

Python 1,111 52 Updated Oct 16, 2025

zhang0jhon / diffusion-4k

[CVPR 2025] Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models

Python 310 11 Updated Jun 3, 2025

FilippoBotti / MambaST

Python 49 7 Updated Jun 24, 2025

nv-tlabs / cosmos-transfer1-diffusion-renderer

Cosmos-Transfer1-DiffusionRenderer: High-quality video de-lighting and re-lighting based on Cosmos video diffusion framework

Jupyter Notebook 732 53 Updated Oct 2, 2025

microsoft / markitdown

Python tool for converting files and office documents to Markdown.

Python 82,198 4,616 Updated Oct 20, 2025

haidog-yaqub / MeanFlow

Pytorch Implementation (unofficial) of the paper "Mean Flows for One-step Generative Modeling" by Geng et al.

Python 911 52 Updated Oct 16, 2025

noxsine / LDP

Jupyter Notebook 40 1 Updated Sep 3, 2025

PieceZhang / MPT-CataBlur

Python 20 5 Updated Jun 3, 2025

bryanswkim / Chain-of-Zoom

[NeurIPS'25 Spotlight] Official repository for "Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment"

Python 732 75 Updated Sep 27, 2025

stepfun-ai / Step1X-Edit

A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

Python 1,694 78 Updated Sep 8, 2025

xlite-dev / Awesome-DiT-Inference

📚A curated list of Awesome Diffusion Inference Papers with Codes: Sampling, Cache, Quantization, Parallelism, etc.🎉

Python 434 21 Updated Aug 19, 2025

showlab / Show-o

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,753 75 Updated Oct 22, 2025

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,880 89 Updated Aug 15, 2024

Visual-Agent / DeepEyes

Python 892 53 Updated Oct 20, 2025