Thanks to visit codestin.com
Credit goes to github.com

sharkDDD

Follow

sharkDDD

Follow

3 followers · 65 following

Stars

timothybrooks / instruct-pix2pix

Python 6,829 574 Updated Mar 3, 2024

NVlabs / edm

Elucidating the Design Space of Diffusion-Based Generative Models (EDM)

Python 1,811 180 Updated Mar 16, 2024

sihyun-yu / REPA

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,400 64 Updated Mar 16, 2025

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 19,652 1,381 Updated Oct 25, 2025

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 5,252 454 Updated Oct 27, 2025

ikostrikov / rlpd

Python 356 34 Updated Feb 13, 2023

LituRout / RF-Inversion

Rectified Flow Inversion (RF-Inversion) - ICLR 2025

Python 460 18 Updated Mar 19, 2025

LTH14 / mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,782 107 Updated Sep 27, 2024

yang-song / score_sde_pytorch

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 2,031 340 Updated Jul 14, 2024

bcmi / Granular-GRPO

G2RPO: Granular GRPO for precise reward in flow models

Python 34 Updated Oct 11, 2025

FoundationVision / VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,467 541 Updated May 18, 2025

HorizonWind2004 / reconstruction-alignment

Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Learning.

Python 300 10 Updated Oct 16, 2025

NVlabs / DiffusionNFT

DiffusionNFT: Online Diffusion Reinforcement with Forward Process

Python 392 11 Updated Sep 22, 2025

River-Zhang / ICEdit

[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run!

Python 2,013 111 Updated Oct 29, 2025

AccumulateMore / CV

✔（已完结）最全面的深度学习笔记【土堆 Pytorch】【李沐动手学深度学习】【吴恩达深度学习】

Jupyter Notebook 14,268 1,682 Updated Nov 6, 2025

yifan123 / reward-server

Python 42 4 Updated Jul 10, 2025

TsinghuaC3I / Awesome-RL-for-LRMs

A Survey of Reinforcement Learning for Large Reasoning Models

1,991 111 Updated Nov 5, 2025

google-deepmind / deepmind-research

This repository contains implementations and illustrative code to accompany DeepMind publications

Jupyter Notebook 14,430 2,778 Updated Aug 22, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 16,001 1,266 Updated Oct 27, 2025

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,428 735 Updated Sep 22, 2025

bghira / SimpleTuner

A general fine-tuning kit geared toward diffusion models.

Python 2,593 254 Updated Nov 5, 2025

feifeiobama / RectifID

[NeurIPS 2024] RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance

Jupyter Notebook 129 8 Updated Oct 13, 2024

Kunbyte-AI / OmniTry

Official Repository of "OmniTry: Virtual Try-On Anything without Masks"

Python 223 27 Updated Aug 29, 2025

hustvl / LightningDiT

[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 1,265 42 Updated Jun 12, 2025

xiaomoguhz / DeCLIP

[CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception

Python 140 5 Updated Jun 10, 2025

rizavelioglu / tryoffdiff

[CVPR'25-Demo] Official repository of "TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models".

Python 136 24 Updated Oct 3, 2025

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 31,435 3,819 Updated Jul 23, 2024

showlab / DiffSim

[ICCV 2025] Official repository of DiffSim: Taming Diffusion Models for Evaluating Visual Similarity

Python 25 1 Updated Jul 14, 2025

Saquib764 / omini-kontext

An inference and training framework for multiple image input in Flux Kontext dev

Jupyter Notebook 416 29 Updated Sep 1, 2025

runjiali-rl / vmem

[ICCV 2025 ⭐highlight⭐] Implementation of VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory

Python 388 14 Updated Jul 25, 2025