Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View sharkDDD's full-sized avatar

Block or report sharkDDD

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Elucidating the Design Space of Diffusion-Based Generative Models (EDM)

Python 1,811 180 Updated Mar 16, 2024

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,400 64 Updated Mar 16, 2025

Contexts Optical Compression

Python 19,652 1,381 Updated Oct 25, 2025

Open-source unified multimodal model

Python 5,252 454 Updated Oct 27, 2025
Python 356 34 Updated Feb 13, 2023

Rectified Flow Inversion (RF-Inversion) - ICLR 2025

Python 460 18 Updated Mar 19, 2025

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,782 107 Updated Sep 27, 2024

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 2,031 340 Updated Jul 14, 2024

G2RPO: Granular GRPO for precise reward in flow models

Python 34 Updated Oct 11, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,467 541 Updated May 18, 2025

Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Learning.

Python 300 10 Updated Oct 16, 2025

DiffusionNFT: Online Diffusion Reinforcement with Forward Process

Python 392 11 Updated Sep 22, 2025

[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run!

Python 2,013 111 Updated Oct 29, 2025

✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】

Jupyter Notebook 14,268 1,682 Updated Nov 6, 2025
Python 42 4 Updated Jul 10, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

1,991 111 Updated Nov 5, 2025

This repository contains implementations and illustrative code to accompany DeepMind publications

Jupyter Notebook 14,430 2,778 Updated Aug 22, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 16,001 1,266 Updated Oct 27, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,428 735 Updated Sep 22, 2025

A general fine-tuning kit geared toward diffusion models.

Python 2,593 254 Updated Nov 5, 2025

[NeurIPS 2024] RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance

Jupyter Notebook 129 8 Updated Oct 13, 2024

Official Repository of "OmniTry: Virtual Try-On Anything without Masks"

Python 223 27 Updated Aug 29, 2025

[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 1,265 42 Updated Jun 12, 2025

[CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception

Python 140 5 Updated Jun 10, 2025

[CVPR'25-Demo] Official repository of "TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models".

Python 136 24 Updated Oct 3, 2025

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 31,435 3,819 Updated Jul 23, 2024

[ICCV 2025] Official repository of DiffSim: Taming Diffusion Models for Evaluating Visual Similarity

Python 25 1 Updated Jul 14, 2025

An inference and training framework for multiple image input in Flux Kontext dev

Jupyter Notebook 416 29 Updated Sep 1, 2025

[ICCV 2025 ⭐highlight⭐] Implementation of VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory

Python 388 14 Updated Jul 25, 2025
Next