Diffusion Prism: Enhancing Diversity and Morphology Consistency in Mask-to-Image Diffusion

News: This paper is accepted by the WACV 2024 4th Workshop on Image/Video/Audio Quality in Computer Vision and Generative AI

For more details, visit the Project Page.

Introduction

Diffusion Prism is a training-free framework that efficiently transforms binary masks into realistic and diverse samples while preserving morphological features. We explored that a small amount of artificial noise will significantly assist the image-denoising process. To prove this novel mask-to-image concept, we use nano-dendritic patterns as an example to demonstrate the merit of our method compared to existing controllable diffusion models. We also extend the proposed framework to other biological patterns, highlighting its potential applications across various fields.

Quick Tutorial

First, please download stable-diffusion-v1-5 model file from: https://huggingface.co/stable-diffusion-v1-5/stable-diffusion-v1-5/resolve/main/v1-5-pruned.ckpt and place it into the diffusion_prism\models\ldm\stable-diffusion-v1\ folder.
Run mask_diffuser_demo.py as a demo to show the proposed 'perlin_mask' method from the paper.
Go to exp to check the evaluation-related functions such as random_forest.py
Run other_dataset_eval.py to produce the results of FID, CLIP Score, and SSIM. It will generate annotation for the test folder as well.

We will update more details later according to the request. Please contact us anytime if you have questions.

Sample Dataset

Dataset: Download from Google Drive

Key Features

Training-Free Diffusion Framework: Generates images from binary skeletons without the need for model training or fine-tuning.
Diverse Backgrounds: Creates images with varied and realistic backgrounds, enhancing model generalizability.

Methodology

Diffusion Process:

Combines masks with controllable noise, processed through a Variational Autoencoder (VAE) to generate latent variables.
The denoising U-Net refines these variables to produce realistic images guided by text prompts.

Experimental Results

High-Quality: Lowest FID score compared to other methods, indicating better realistic styles.
Consistency: Morphology preserving, the skeleton shape is well-kept in synthesized images.

For more details, visit the Project Page.

Citation

Diffusion Prism: Enhancing Diversity and Morphology Consistency in Mask-to-Image Diffusion

@article{wang2025diffusion, title={Diffusion Prism: Enhancing Diversity and Morphology Consistency in Mask-to-Image Diffusion}, author={Wang, Hao and Chen, Xiwen and Bastola, Ashish and Qin, Jiayou and Razi, Abolfazl}, journal={arXiv preprint arXiv:2501.00944}, year={2025} }

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Figure		Figure
configs		configs
exp		exp
ldm		ldm
models		models
.gitignore		.gitignore
README.md		README.md
download_first_stages.sh		download_first_stages.sh
download_models.sh		download_models.sh
environment.yaml		environment.yaml
latent_imagenet_diffusion.ipynb		latent_imagenet_diffusion.ipynb
main.py		main.py
mask_diffuser_demo.py		mask_diffuser_demo.py
notebook_helpers.py		notebook_helpers.py
requirements.py		requirements.py
requirements.txt		requirements.txt
sample_diffusion.py		sample_diffusion.py
setup.py		setup.py
train_searcher.py		train_searcher.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Diffusion Prism: Enhancing Diversity and Morphology Consistency in Mask-to-Image Diffusion

Introduction

Quick Tutorial

Sample Dataset

Key Features

Methodology

Experimental Results

Citation

About

Uh oh!

Releases

Packages

Languages

AIS-Clemson/diffusion_prism

Folders and files

Latest commit

History

Repository files navigation

Diffusion Prism: Enhancing Diversity and Morphology Consistency in Mask-to-Image Diffusion

Introduction

Quick Tutorial

Sample Dataset

Key Features

Methodology

Experimental Results

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages