Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View RockeyCoss's full-sized avatar
😧
😧
  • the Solar System

Highlights

  • Pro

Block or report RockeyCoss

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official code for paper Advantage Weighted Matching: Aligning RL with Pretraining in Diffusion Models

Python 17 Updated Jan 16, 2026

Scalable group inference for generating high quality and diverse images with diffusion models.

Python 38 1 Updated Aug 31, 2025

LoRA fine-tuning for FLUX.2 to improve virtual try-on (VTON) capabilities

Python 2 Updated Dec 9, 2025

Implementation of Particle Guidance: non-I.I.D. Diverse Sampling with Diffusion Models

Python 77 3 Updated Oct 23, 2023

[Tutorial] Few-Step Distillation for Text-to-Image Generation: A Practical Guide

Python 325 21 Updated Dec 31, 2025

Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.

Python 685 59 Updated Jan 5, 2026

Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?

Python 191 9 Updated Dec 15, 2025

Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"

Python 1,420 141 Updated Dec 30, 2025
Python 9,115 559 Updated Jan 7, 2026

Official PyTorch Implementation of "Optimal Stepsize for Diffusion Sampling".

Python 194 12 Updated Apr 13, 2025

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 1,998 124 Updated Dec 8, 2025

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,885 311 Updated Jun 12, 2025

Enhancing Reward Models for High-quality Image Generation: Beyond Text-Image Alignment [ICCV 2025] - Official implementation

Python 41 1 Updated Aug 5, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,700 59 Updated Dec 26, 2025

SigLIP-based Aesthetic Score Predictor

Python 373 9 Updated Dec 18, 2024

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 7,046 407 Updated Dec 31, 2025

DiffusionNFT: Online Diffusion Reinforcement with Forward Process

Python 551 21 Updated Jan 6, 2026

Control and limit battery charging on Apple Silicon MacBooks.

Go 1,394 55 Updated Jan 13, 2026

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,281 205 Updated Jan 8, 2026

Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference

Python 1,244 41 Updated Oct 26, 2025

An official implementation of Coefficients-Preserving Sampling for Reinforcement Learning with Flow Matching

Python 59 3 Updated Sep 11, 2025

A unified inference and post-training framework for accelerated video generation.

Python 2,964 241 Updated Jan 18, 2026

The official UniVerse-1 code.

Python 118 8 Updated Oct 13, 2025

Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Python 208 8 Updated Jan 14, 2026

NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s Multimodal Intelligence team.

Python 597 18 Updated Dec 25, 2025

Pytorch implementation for MeanFlow

Jupyter Notebook 295 24 Updated Jul 30, 2025

TempFlow-GRPO (Temporal Flow GRPO), a principled GRPO framework that captures and exploits the temporal structure inherent in flow-based generation.

Python 844 45 Updated Nov 24, 2025

Enjoy the magic of Diffusion models!

Python 11,488 1,096 Updated Jan 15, 2026
Next