Thanks to visit codestin.com
Credit goes to Github.com

wizardbob

Follow

wizard wizardbob

Follow

1 follower · 59 following

Starred repositories

XLabs-AI / x-flux

Python 2,229 162 Updated Nov 8, 2024

PRIV-Creation / In-domain-Generation-Diffusion

The official code of "Image is All You Need to Empower Large-scale Diffusion Models for In-Domain Generation". [CVPR2025]

Python 24 Updated Mar 17, 2025

X-GenGroup / Flow-Factory

A unified framework for easy reinforcement learning in Flow-Matching models

Python 182 8 Updated Feb 27, 2026

Kenkenzaii / PrefPaint

Python 44 1 Updated Nov 13, 2025

tgxs002 / HPSv2

Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis

Jupyter Notebook 647 27 Updated May 24, 2024

tgxs002 / align_sd

Better Aligning Text-to-Image Models with Human Preference. ICCV 2023

Python 294 10 Updated Jul 14, 2023

yuvalkirstain / PickScore

Python 579 31 Updated Dec 21, 2024

Kwai-Kolors / MPS

Python 199 9 Updated Jul 12, 2024

facebookresearch / multimodal_rewardbench

Multimodal RewardBench

Python 62 1 Updated Feb 21, 2025

bytedance / OneReward

Python 329 16 Updated Sep 15, 2025

zai-org / GLM-Image

GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image Generation.

Python 798 50 Updated Feb 2, 2026

MizzenAI / HPSv3

Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)

Python 268 13 Updated Dec 5, 2025

facebookresearch / MMRB2

Data and sample evaluation codes for Multimodal Rewardbench 2

Python 136 10 Updated Dec 20, 2025

TIGER-AI-Lab / Pixel-Reasoner

Pixel-Level Reasoning Model trained with RL [NeuIPS25]

Python 278 11 Updated Nov 6, 2025

qunzhongwang / vr-thinker

Python 42 1 Updated Oct 20, 2025

appletea233 / EditThinker

Unlocking Iterative Reasoning for Any Image Editor

Python 89 3 Updated Jan 18, 2026

meituan-longcat / LongCat-Image

Python 624 52 Updated Feb 24, 2026

QwenLM / Qwen-Image-Layered

Qwen-Image-Layered: Layered Decomposition for Inherent Editablity

Python 1,586 121 Updated Dec 31, 2025

ZiyuGuo99 / MME-CoF

Are Video Models Ready as Zero-shot Reasoners?

Python 84 4 Updated Nov 24, 2025

ZiyuGuo99 / Thinking-while-Generating

The first Interleaved framework for textual reasoning within the visual generation process

158 1 Updated Nov 21, 2025

GVCLab / Awesome-Reasoning-via-VDM

6 1 Updated Jan 13, 2026

XueZeyue / Awesome-Visual-Generation-Alignment-Survey

A survey for visual generation alignment

121 7 Updated Nov 9, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 44,145 5,809 Updated Feb 20, 2026

ThreeSR / Awesome-Inference-Time-Scaling

Paper List of Inference/Test Time Scaling/Computing

Python 354 11 Updated Feb 27, 2026

NVlabs / DiffusionNFT

[ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process

Python 668 25 Updated Feb 10, 2026

PKU-YuanGroup / Edit-R1

Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback

Python 237 8 Updated Jan 24, 2026

Shredded-Pork / TempFlow-GRPO

[ICLR 26] TempFlow-GRPO (Temporal Flow GRPO), a principled GRPO framework that captures and exploits the temporal structure inherent in flow-based generation.

Python 782 42 Updated Nov 24, 2025

VectorSpaceLab / OmniGen2

OmniGen2: Exploration to Advanced Multimodal Generation. https://arxiv.org/abs/2506.18871

Jupyter Notebook 4,031 17 Updated Dec 2, 2025

VectorSpaceLab / EditScore

EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling

Python 214 6 Updated Feb 3, 2026

adobe-research / EditVerse

Official repo for paper "EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning"

Python 129 4 Updated Oct 9, 2025

Starred topics

image-animation

video-captioning

image-captioning