0% found this document useful (0 votes)

52 views39 pages

Lecture 4 Diffusion - Models Part I Final

Uploaded by

huukhoadn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views39 pages

Lecture 4 Diffusion - Models Part I Final

Uploaded by

huukhoadn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

CAP6412

Advanced Computer Vision

Mubarak Shah
[email protected]
HEC-245
Lecture-4: Diffusion Models

1/23/2023 CAP6412 - Lecture 1 Introduction 1

Diffusion models in vision: A survey
https://arxiv.org/pdf/2209.04747.pdf

Alin Croitoru Vlad Hondru Radu Tudor Ionescu Mubarak Shah

University of Bucharest, University of Bucharest, University of Bucharest, University of Central
Romania Romania Romania Florida, US
[email protected] [email protected] [email protected] [email protected]
Agenda

A hedgehog using a A corgi wearing a red A transparent sculpture- of

calculator. bowtie and a purple a duck made out of glass.
party hat.

A photo of a Corgi dog riding a Pomeranian king Zebras roaming

bike in Times Square. It is wearing with tiger soldiers. in the field.
sunglasses and a beach hat.
Outline

1. Motivation
2. High-level overview
3. Denoising diffusion probabilistic models
4. Noise Conditioned Score Network
5. Conditional Generation
6. Stochastic Differential Equations
7. Research directions
High-level overview
• Diffusion models are probabilistic models used for image generation
• They involve reversing the process of gradually degrading the data
• Consist of two processes:
 The forward process: data is progressively destroyed by adding noise across
multiple time steps
 The reverse process: using a neural network, noise is sequentially removed
to obtain the original data

Standard Gaussian
Data distribution

reverse

forward
High-level overview

• Three categories:

 Denoising Diffusion Probabilistic Models (DDPM)

 Noise Conditioned Score Networks (NCSN)

 Stochastic Differential Equations (SDE)

Outline

𝒩 𝑥; 𝜇, 𝜎 ⋅ 𝐼 - Gaussian distribution

Random Variable (image) Mean Vector Covariance matrix. 𝐼 is the identity matrix

𝑥= 𝜇+ 𝜎 ⋅ 𝑧, 𝑧~𝒩(0, 𝐼)

Sample from this distribution

Denoising Diffusion Probabilistic Models (DDPMs)

Forward process
𝑥 𝑥

… …

𝑥 ~𝑝(𝑥 ) 𝑥 ~𝒩(0, 𝐼)
Denoising Diffusion Probabilistic Models (DDPMs)

𝑥 𝑥

… …

𝑥 ~𝑝(𝑥 ) Reverse process 𝑥 ~𝒩(0, 𝐼)

Denoising Diffusion Probabilistic Models (DDPMs)

Forward process (Iterative) The image is

replaced with
noise
𝑥 ~𝑝 𝑥 𝑥 = 𝒩(𝑥 ; 1 − 𝛽 𝑥 , 𝛽 I) 𝛽 ≪ 1 , 𝑡 = 1, 𝑇

… …
𝑥 𝑥 𝑥 𝑥
Denoising Diffusion Probabilistic Models (DDPMs)

Forward process. Ancestral sampling (One Shot) Notations:

𝛽 = 𝛼
𝑥 ~𝑝 𝑥 𝑥 = 𝒩(𝑥 ; 𝛽 ⋅ 𝑥 , 1 − 𝛽 I) 𝛼 =1 − 𝛽

… …
𝑥 𝑥 𝑥 𝑥
DDPMs. Properties of

1. 𝛽 ≪ 1 , 𝑡 = 1, 𝑇
𝑥 ~𝑝 𝑥 𝑥 = 𝒩(𝑥 ; 1 − 𝛽 𝑥 , 𝛽 I)

𝑥 is created with a small step modeled by 𝛽

𝑡−1 𝑡

𝑥 comes from region close to 𝑥 ,

therefore we can model with Gaussian

𝑥 ~𝑝 𝑥 𝑥 = 𝒩(𝑥 ; 𝜇 𝑥 ,𝑡 ,Σ 𝑥 ,𝑡 )
DDPMs. Properties of

1. 𝛽 ≪ 1 , 𝑡 = 1, 𝑇

𝑥 ~𝑝 𝑥 𝑥 = 𝒩(𝑥 ; 1 − 𝛽 𝑥 , 𝛽 I)

𝑡−1 𝑡

?
Less certain where was the 𝑥 , because we could have
reached 𝑥 from many more regions.
DDPMs. Properties of

1. 𝛽 ≪ 1, 𝑡 = 1, 𝑇 ⟹ 2. 𝑇 𝑖𝑠 𝑙𝑎𝑟𝑔𝑒
𝑥 𝑖𝑠 𝑝𝑢𝑟𝑒 𝑛𝑜𝑖𝑠𝑒

𝑥 𝑥

𝑇 𝑖𝑡𝑒𝑟𝑎𝑡𝑖𝑜𝑛𝑠
DDPMs. Training objective
Remember that:

𝑥 𝑥 𝑥 𝑥

… …
𝑝 𝑥 𝑥 ≈𝑝 𝑥 𝑥 = 𝒩(𝑥 ;𝜇 𝑥 ,𝑡 ,Σ 𝑥 ,𝑡 )
Reverse process

Neural network Approximated by

weights a neural network
DDPMs. Training objective
Simplification:

𝑥 𝑥 𝑥 𝑥

… …
𝑝 𝑥 𝑥 ≈𝑝 𝑥 𝑥 = 𝒩(𝑥 ; 𝜇 𝑥 , 𝑡 , 𝜎 I)
Reverse process

Neural network Approximated by

Fix the variance instead of learning, and predict/learn the mean weights a neural network
DDPMs. Training objective
UNet-like neural network

𝜇 (𝑥 , 𝑡)

~𝒩 𝑥 , 𝜇 (𝑥 , 𝑡), 𝜎 I

𝑥
U-Net
U-Net
U-Net
Slide from:
Denoising Diffusion-based Generative Modeling:
Foundations and Applications
Karsten Kreis Ruiqi Gao Arash Vahdat
DDPMs. Training Objective
02 Attention and Transformers

Cross Entropy and KL (Kullback-Leibler) divergence

• Entropy: E(P) = - ΣiP(i)logP(i)

• Cross Entropy: C(P) = - ΣiP(i) log Q(i)
• KL divergence: DKL(P || Q) = ΣiP(i)log[P(i)/Q(i)] = ΣiP(i)[logP(i) – logQ(i)]

Slides from Ming Li, University of Waterloo, CS 886 Deep Learning and NLP
DDPMs. Training Objective

min 𝔼 ~ ( ) − log 𝑝 𝑥 𝑥 + 𝐾𝐿(𝑝(𝑥 |𝑥 )||𝑝 𝑥 ) + 𝐾𝐿(𝑝 𝑥 𝑥 , 𝑥 ||𝑝 𝑥 𝑥 )

At each time step t, 𝑝 𝑥 𝑥 is as close

This term can be ignored because 𝑝 𝑥 is 𝒩 0, Ι as possible to the true posterior of the
and does not depend on 𝜃. forward process when conditioned on the
original image.
DDPMs. Training Objective. Simplifications
min 𝔼 ~ ( ) − log 𝑝 𝑥 𝑥 + 𝐾𝐿(𝑝 𝑥 𝑥 , 𝑥 ||𝑝 𝑥 𝑥 )

• The KL divergences of 2 gaussians is L2 distance between their means

• The first term measures the reconstruction error and can be addressed
with an independent decoder.

• DDPMs paper introduced two simplifications that led to a much simple

objective that is based on the noise in the image.
DDPMs. Training Objective. Simplifications
min 𝔼 ~ ( ) − log 𝑝 𝑥 𝑥 + 𝐾𝐿(𝑝 𝑥 𝑥 , 𝑥 ||𝑝 𝑥 𝑥 )

Notations:
𝑝 𝑥 𝑥 ,𝑥 = 𝒩 𝑥 ; 𝜇 𝑥 ,𝑥 ,𝛽 𝐼
𝛽 = 𝛼
1 1 − 𝛼
𝜇 𝑥 ,𝑥 = 𝑥 − 𝑧 , 𝑧 ~𝒩(0, I) 𝛼 =1 − 𝛽
𝛼
1−𝛽
1−𝛽
𝛽 = ⋅𝛽
1−𝛽
DDPMs. Training Objective. Simplifications
min 𝔼 ~ ( ) − log 𝑝 𝑥 𝑥 + 𝐾𝐿(𝑝 𝑥 𝑥 , 𝑥 ||𝑝 𝑥 𝑥 )

Tractable posterior: Notations:

𝑝 𝑥 𝑥 ,𝑥 = 𝒩 𝑥 ; 𝜇 𝑥 ,𝑥 ,𝛽 𝐼 𝛽 = 𝛼

𝛼 =1 − 𝛽
1 1 − 𝛼
𝜇 𝑥 ,𝑥 = 𝑥 − 𝑧 , 𝑧 ~𝒩(0, I) 1−𝛽
𝛼 𝛽 = ⋅𝛽
1−𝛽 1−𝛽
1 1 − 𝛼 ⟹
𝜇 𝑥 ,𝑥 = 𝑥 − 𝑧 (𝑥 , 𝑡)
𝛼
1−𝛽

𝛽
⟹ 𝐾𝐿(𝑝 𝑥 𝑥 , 𝑥 |𝑝 𝑥 𝑥 = 𝔼 ~𝒩( , ) 𝑧 − 𝑧 (𝑥 , 𝑡)
2𝜎 𝛼 (1 − 𝛽 )
DDPMs. Training Objective. Simplifications
min 𝔼 ~ ( ) − log 𝑝 𝑥 𝑥 + 𝐾𝐿(𝑝 𝑥 𝑥 , 𝑥 ||𝑝 𝑥 𝑥 )

Tractable posterior: Notations:

𝑝 𝑥 𝑥 ,𝑥 = 𝒩 𝑥 ; 𝜇 𝑥 ,𝑥 ,𝛽 𝐼 𝛽 = 𝛼

𝛼 =1 − 𝛽
1 1 − 𝛼
𝜇 𝑥 ,𝑥 = 𝑥 − 𝑧 , 𝑧 ~𝒩(0, I) 1−𝛽
𝛼 𝛽 = ⋅𝛽
1−𝛽 1−𝛽
1 1 − 𝛼 ⟹
𝜇 𝑥 ,𝑥 = 𝑥 − 𝑧 (𝑥 , 𝑡)
𝛼
1−𝛽
Ignored

𝛽
⟹ 𝐾𝐿(𝑝 𝑥 𝑥 , 𝑥 |𝑝 𝑥 𝑥 = 𝔼 ~𝒩( , ) 𝑧 − 𝑧 (𝑥 , 𝑡)
2𝜎 𝛼 (1 − 𝛽 )
DDPMs. Training Algorithm

1
min 𝔼 ~ , ~𝒩 , 𝑧 − 𝑧 (𝑥 , 𝑡)
𝑇

Training algorithm:

Repeat 𝛽 = 𝛼
𝑥 ~𝑝 𝑥
𝑡~𝒰 1, … , 𝑇
𝑧 ~𝒩(0, I)
𝑥 = 𝛽 ⋅𝑥 + 1−𝛽 𝑧
𝜃 = 𝜃 − 𝑙𝑟 ⋅ ∇ ℒ
Until convergence
DDPMs. Sampling

𝑥
𝑧 (𝑥 , 𝑡)

• Pass the current noisy image along with t to the neural network

• With the resultant compute the mean of the gaussian distribution

DDPMs. Sampling

𝑥
𝑧 (𝑥 , 𝑡)

Sample the image for the next iteration

𝜇 (𝑥 , 𝑡)

1 1 − 𝛼
~𝒩 𝑥 , 𝑥 − 𝑧 𝑥 ,𝑡 ,𝜎 I
𝛼
1−𝛽

𝑥
Thank You

Lec16 DiffusionModels
No ratings yet
Lec16 DiffusionModels
57 pages
Pre-Hiring, Hiring, and Post-Hiring
0% (1)
Pre-Hiring, Hiring, and Post-Hiring
11 pages
Spanish English Speech Practices
100% (2)
Spanish English Speech Practices
22 pages
Section 8 ISO 19650 3 Infographic - 280721@3xPDF
No ratings yet
Section 8 ISO 19650 3 Infographic - 280721@3xPDF
1 page
Alfred Adler's Individual Psychology - QUIZ
No ratings yet
Alfred Adler's Individual Psychology - QUIZ
26 pages
StableDiffusion Presentation
No ratings yet
StableDiffusion Presentation
27 pages
Diffusion: by Aryan Jain
100% (1)
Diffusion: by Aryan Jain
55 pages
Gantt Charts for Project Managers
No ratings yet
Gantt Charts for Project Managers
3 pages
Diffusion Models for Beginners
No ratings yet
Diffusion Models for Beginners
8 pages
Lecture7 8 - Diffusion - Model 1 78 1 66
No ratings yet
Lecture7 8 - Diffusion - Model 1 78 1 66
66 pages
Diffusion
No ratings yet
Diffusion
23 pages
Oracle PL/SQL by Example, 6th Edition Benjamin Rosenzweig Instant Download
No ratings yet
Oracle PL/SQL by Example, 6th Edition Benjamin Rosenzweig Instant Download
125 pages
CVPR2022 Tutorial Diffusion Model
No ratings yet
CVPR2022 Tutorial Diffusion Model
188 pages
Assessing Children's Pain: R-Flacc Pain Rating Scale For Children With Developmental Disability
0% (1)
Assessing Children's Pain: R-Flacc Pain Rating Scale For Children With Developmental Disability
1 page
Pediatric Neuropsychology Tool Update
No ratings yet
Pediatric Neuropsychology Tool Update
6 pages
14 Diffusion Model
No ratings yet
14 Diffusion Model
73 pages
Diffusion Models
No ratings yet
Diffusion Models
151 pages
Lecture 5 Diffusion - Models Part II Final
No ratings yet
Lecture 5 Diffusion - Models Part II Final
49 pages
Tutorial On Diffusion Models
No ratings yet
Tutorial On Diffusion Models
4 pages
Structured Denoising Diffusion Models in Discrete State-Spaces
No ratings yet
Structured Denoising Diffusion Models in Discrete State-Spaces
33 pages
EDUC 5010 Written Assignment U1
No ratings yet
EDUC 5010 Written Assignment U1
7 pages
Experiential Learning Presentation With Index
No ratings yet
Experiential Learning Presentation With Index
12 pages
Lecture 13
No ratings yet
Lecture 13
31 pages
Lecture # 13-2 Stable Diffusion Model
No ratings yet
Lecture # 13-2 Stable Diffusion Model
48 pages
Diffusion
No ratings yet
Diffusion
19 pages
Machine Learning: The Basics
No ratings yet
Machine Learning: The Basics
288 pages
Lec24 Diffusion
No ratings yet
Lec24 Diffusion
83 pages
Diffusion Models
No ratings yet
Diffusion Models
27 pages
Officer Tryout Leadership Questions
No ratings yet
Officer Tryout Leadership Questions
5 pages
Lessson 13 ANN
No ratings yet
Lessson 13 ANN
76 pages
Diffusion Models For PNP IR
No ratings yet
Diffusion Models For PNP IR
48 pages
Lecture7-8 Diffusion Model
No ratings yet
Lecture7-8 Diffusion Model
136 pages
AIMLB PGP 2025 Session 4
No ratings yet
AIMLB PGP 2025 Session 4
38 pages
Kaist cs492d Fall 2024 Lecture 4
No ratings yet
Kaist cs492d Fall 2024 Lecture 4
33 pages
Application DPM
No ratings yet
Application DPM
43 pages
Deep Neural Networks
No ratings yet
Deep Neural Networks
79 pages
Denoising Diffusion Probabilistic Models in Six Simple Steps
No ratings yet
Denoising Diffusion Probabilistic Models in Six Simple Steps
15 pages
FL LectureNotes
No ratings yet
FL LectureNotes
92 pages
Lecture7 8 Diffusion Model 1 78
No ratings yet
Lecture7 8 Diffusion Model 1 78
78 pages
Lecture7 Diffusion
No ratings yet
Lecture7 Diffusion
42 pages
Lecture 14
No ratings yet
Lecture 14
23 pages
CS772 Lec21
No ratings yet
CS772 Lec21
26 pages
PNP Slides
No ratings yet
PNP Slides
50 pages
DiffusionModel DDPM
No ratings yet
DiffusionModel DDPM
52 pages
Diffusion Models in Imaging Tutorial
No ratings yet
Diffusion Models in Imaging Tutorial
90 pages
Khan - Diffusion Models and Normalizing Flows
No ratings yet
Khan - Diffusion Models and Normalizing Flows
36 pages
Tutorial On Diffusion Models For Imaging and Vision: Stanley Chan March 28, 2024
No ratings yet
Tutorial On Diffusion Models For Imaging and Vision: Stanley Chan March 28, 2024
51 pages
WINSEM2024-25 CSE4006 ETH AP2024254000689 2025-01-03 Reference-Material-I
No ratings yet
WINSEM2024-25 CSE4006 ETH AP2024254000689 2025-01-03 Reference-Material-I
39 pages
Main
No ratings yet
Main
183 pages
Diffusion Model For Generating Bulbasaurs
No ratings yet
Diffusion Model For Generating Bulbasaurs
25 pages
User Experience Design Project Guide
No ratings yet
User Experience Design Project Guide
70 pages
Slides 2
No ratings yet
Slides 2
28 pages
Lecture 03 - Feedforward Networks - 4p
No ratings yet
Lecture 03 - Feedforward Networks - 4p
19 pages
An In-Depth Guide To Denoising Diffusion Probabilistic Models - From Theory To Implementation
No ratings yet
An In-Depth Guide To Denoising Diffusion Probabilistic Models - From Theory To Implementation
18 pages
Diffusion Model
No ratings yet
Diffusion Model
17 pages
Diffusion Model Clearly Explained! - by Steins - Medium
No ratings yet
Diffusion Model Clearly Explained! - by Steins - Medium
18 pages
121 DL2 Ann
No ratings yet
121 DL2 Ann
64 pages
Non Gaussian Denoising Diffusion Models
No ratings yet
Non Gaussian Denoising Diffusion Models
11 pages
Chapter 5
No ratings yet
Chapter 5
140 pages
Diffusion Models for Students
No ratings yet
Diffusion Models for Students
89 pages
Student Absenteeism Survey
No ratings yet
Student Absenteeism Survey
3 pages
Week 4
No ratings yet
Week 4
61 pages
New Denoising Diffusion Model
No ratings yet
New Denoising Diffusion Model
13 pages
Divide-and-Conquer Posterior Sampling For Denoising Diffusion Priors
No ratings yet
Divide-and-Conquer Posterior Sampling For Denoising Diffusion Priors
30 pages
Improved Denoising Diffusion Probabilistic Models
No ratings yet
Improved Denoising Diffusion Probabilistic Models
17 pages
Step-by-Step Diffusion: An Elementary Tutorial
No ratings yet
Step-by-Step Diffusion: An Elementary Tutorial
51 pages
Week 4 - Diffusion Models
No ratings yet
Week 4 - Diffusion Models
35 pages
Diffusion
No ratings yet
Diffusion
55 pages
Intro to Variational Autoencoders
No ratings yet
Intro to Variational Autoencoders
94 pages
Variational Diffusion Models Guide
No ratings yet
Variational Diffusion Models Guide
48 pages
Clustering PDF
No ratings yet
Clustering PDF
36 pages
Catherine Hoblin-5
No ratings yet
Catherine Hoblin-5
1 page
Efficient Image Restoration with DDRM
No ratings yet
Efficient Image Restoration with DDRM
32 pages
Ict Css12 q4 Las4 Week-7-8
No ratings yet
Ict Css12 q4 Las4 Week-7-8
7 pages
M - SC - Counselling and Psychotherapy
No ratings yet
M - SC - Counselling and Psychotherapy
13 pages
Abu Jafar Al Tahawi
No ratings yet
Abu Jafar Al Tahawi
8 pages
What Is Anthropology 2nd Edition Thomas Hylland Eriksen Download
No ratings yet
What Is Anthropology 2nd Edition Thomas Hylland Eriksen Download
48 pages
List of Affiliated B.Ed. College G-MAIL List Session 2014-154180
No ratings yet
List of Affiliated B.Ed. College G-MAIL List Session 2014-154180
20 pages
Resume - Muneeb Manzoor
No ratings yet
Resume - Muneeb Manzoor
3 pages
Resume Asrul Lattes
No ratings yet
Resume Asrul Lattes
5 pages
Multicultural Identity and Ecocentrism
No ratings yet
Multicultural Identity and Ecocentrism
13 pages
Educators' Evidence-Based Strategies
No ratings yet
Educators' Evidence-Based Strategies
9 pages
Course Outline
No ratings yet
Course Outline
2 pages
Simolazione Seconda Traccia Inglese 2023 Extra
No ratings yet
Simolazione Seconda Traccia Inglese 2023 Extra
5 pages
Entry-Level Sales Hiring Platform
No ratings yet
Entry-Level Sales Hiring Platform
7 pages
Curvitaeko Updated
No ratings yet
Curvitaeko Updated
4 pages
Three Social Theory Handouts
No ratings yet
Three Social Theory Handouts
1 page
Notice: Office of The Dean Academic
No ratings yet
Notice: Office of The Dean Academic
2 pages
Learner Profile Brochure
No ratings yet
Learner Profile Brochure
3 pages

Lecture 4 Diffusion - Models Part I Final

Uploaded by

Lecture 4 Diffusion - Models Part I Final

Uploaded by

CAP6412

Advanced Computer Vision

1/23/2023 CAP6412 - Lecture 1 Introduction 1

Alin Croitoru Vlad Hondru Radu Tudor Ionescu Mubarak Shah

A hedgehog using a A corgi wearing a red A transparent sculpture- of

A photo of a Corgi dog riding a Pomeranian king Zebras roaming

 Denoising Diffusion Probabilistic Models (DDPM)

 Noise Conditioned Score Networks (NCSN)

 Stochastic Differential Equations (SDE)

Sample from this distribution

𝑥 ~𝑝(𝑥 ) Reverse process 𝑥 ~𝒩(0, 𝐼)

Forward process (Iterative) The image is

Forward process. Ancestral sampling (One Shot) Notations:

𝑥 is created with a small step modeled by 𝛽

𝑥 comes from region close to 𝑥 ,

Neural network Approximated by

Neural network Approximated by

Cross Entropy and KL (Kullback-Leibler) divergence

• Entropy: E(P) = - ΣiP(i)logP(i)

min 𝔼 ~ ( ) − log 𝑝 𝑥 𝑥 + 𝐾𝐿(𝑝(𝑥 |𝑥 )||𝑝 𝑥 ) + 𝐾𝐿(𝑝 𝑥 𝑥 , 𝑥 ||𝑝 𝑥 𝑥 )

At each time step t, 𝑝 𝑥 𝑥 is as close

• The KL divergences of 2 gaussians is L2 distance between their means

• DDPMs paper introduced two simplifications that led to a much simple

Tractable posterior: Notations:

Tractable posterior: Notations:

• With the resultant compute the mean of the gaussian distribution

Sample the image for the next iteration

You might also like