0% found this document useful (0 votes)

14 views17 pages

Module13 GaussianMixtureModel

The document discusses K-means clustering and Gaussian mixture models. It explains how K-means can be used for image segmentation and data compression. It then describes Gaussian distributions and how Gaussian mixture models can model clustered data. The document outlines the expectation maximization algorithm for estimating the parameters of a Gaussian mixture model.

Uploaded by

riya pandey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views17 pages

Module13 GaussianMixtureModel

Uploaded by

riya pandey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Clustering and

Gaussian Mixture Model

Dr. Sayak Roychowdhury
Department of Industrial & Systems Engineering,
IIT Kharagpur
Reference
• Bishop, C. M. (2006). Pattern recognition and machine
learning. Springer google schola, 2, 5-43.
Application of K-means Clustering
• Image segmentation and compression
• The goal of segmentation is to partition an image into regions each of which has a
reasonably homogeneous visual appearance or which corresponds to objects or
parts of objects
• Each pixel in an image is a point in a 3-dimensional space comprising the intensities
of the RGB channels
• Running K-means to convergence, for any particular value of K, by re-drawing the
image replacing each pixel vector with the {R, G,B} intensity triplet given by the
centre 𝜇𝑘 to which that pixel has been assigned.
• Data Compressioning: K-means for lossy data compression
• Each data point is approximated by nearest cluster centre 𝜇𝑘
• This framework is often called vector quantization, and the vectors 𝜇𝑘 are called
code-book vectors
Image Segmentation with K-means

Bishop, C. M. (2006).
Pattern recognition
and machine
learning. Springer
google schola, 2, 5-43.
Gaussian Distribution
• Univariate Gaussian Distribution:
1 1 2
• 𝑓 𝑥|𝜇, 𝜎 = exp − 2 𝑥−𝜇
𝜎 2𝜋 2𝜎

• Multivariate Gaussian Distribution:

1 1
−2 𝑥−𝜇 𝑇 Σ−1 𝑥−𝜇
𝑓 𝑥|𝜇, Σ = 𝑝 1 𝑒
2𝜋 2 Σ 2
Gaussian Mixture

Bishop, C. M. (2006). Pattern recognition and machine

learning. Springer google schola, 2, 5-43.
Gaussian Mixture

3 gaussian distribution
That generated the datapoints

Bishop, C. M. (2006). Pattern recognition and machine

learning. Springer google schola, 2, 5-43.
Gaussian Mixture

3 gaussian distribution Clustering using estimated

that generated the datapoints posterior probability
of clusters using GMM
Bishop, C. M. (2006). Pattern recognition and machine
learning. Springer google schola, 2, 5-43.
Maximum Likelihood for Parameter
Estimation
1 𝑇 −1
𝑝
ln 𝑓𝑘 𝑥|𝜇𝑘 , Σ𝑘 = − l𝑛 Σ𝑘 − 𝑥 − 𝜇𝑘 Σ𝑘 𝑥 − 𝜇𝑘 − l𝑛 𝜋
2 2

Differentiating and equating to 0

𝑥𝑖
𝜇Ƹ 𝑘 = σ𝑔𝑖 =𝑘
𝑁𝑘
𝑥𝑖 −ෝ 𝜇𝑘 𝑇
𝜇𝑘 𝑥𝑖 −ෝ
෢k =
Σ 𝐾
σ𝑘=1 σ𝑔𝑖 =𝑘
𝑁 𝑘

Where 𝑁𝑘 is the number of datapoints in 𝑘𝑡ℎ cluster

Gaussian Mixture
• Linear superposition of Gaussians:
𝐾

𝑓(𝑥) = ෍ 𝑤𝑘 𝒩(𝑥|𝜇𝑘 , Σ𝑘 )
𝑘=1
Normalization and positivity of weights (mixing coefficients):
0 ≤ 𝑤𝑘 ≤ 1, σ𝐾 𝑘=1 𝑤𝑘 = 1
• Log-likelihood:
𝑁 𝑁 𝐾

ln 𝑓(𝑋|𝜇, Σ, 𝑊) = ෍ ln 𝑓 𝑥𝑖 = ෍ ln ෍ 𝑤𝑘 𝒩 𝑥 𝜇𝑘 , Σ𝑘
𝑖=1 𝑖=1 𝑘=1
Responsibilities
• The mixing coefficients can be thought of as prior probabilities
• For a given value of ‘x’, the posterior probabilities can be calculated, which are
also called “responsibilities”
• Using Bayes rule:

𝑓 𝑥𝑘 𝑓 𝑘 𝑤𝑘 𝑓𝑘 𝑥
𝛾𝑘 𝑥 = 𝑓 𝑘 𝑥 = = σ𝑙 𝑤𝑙 𝑓𝑙 𝑥
𝑓(𝑥)

𝑤𝑘 𝒩 𝑥 𝜇𝑘 , Σ𝑘
= 𝐾
σ𝑙=1 𝑤𝑙 𝒩(𝑥|𝜇𝑙 , Σ𝑙 )
𝑁𝑘
where 𝑤𝑘 =
𝑁

𝛾𝑘 𝑥 is also called latent variable here.

Expectation Maximization (EM) Algorithm
• EM algorithm is an iterative optimization technique
• Estimation step: for the given parameter values, compute the
expected values of the latent variable
• Maximization step: update the parameters of the model based on the
calculated value of the latent variable
Expectation Maximization (EM) Algorithm
• Given a Gaussian Mixture Model, the goal is to maximize the
likelihood function by varying the means and covariances and the
mixing coefficients
• Initialize 𝜇𝑗 , Σ𝑗 and mixing coefficients 𝑤𝑗 and evaluate initial log-
likelihood value
• Expectation step: Evaluate responsibilities using current parameter
values:
𝑤𝑘 𝒩 𝑥 𝜇𝑘 , Σ𝑘
𝛾𝑘 𝑥 = σ𝐾
𝑙=1 𝑤𝑙 𝒩(𝑥|𝜇𝑙 ,Σ𝑙 )
Expectation Maximization (EM) Algorithm
• Maximization step: Reestimate the parameters using current
responsibilities:
σ𝑁
𝑛=1 𝛾 𝑧𝑛𝑘 𝑥𝑛
𝜇𝑘𝑛𝑒𝑤 = , where 𝑁𝑘 = σ𝑁
𝑛=1 𝛾 𝑧𝑛𝑘
𝑁𝑘
• The mean 𝜇𝑘 for the kth Gaussian component is obtained by taking a
weighted mean of all of the points in the data set, in which the
weighting factor for data point 𝒙𝒏 is given by the posterior probability
𝛾 𝑧𝑛𝑘 that component k was responsible for generating 𝒙𝒏 .
Expectation Maximization (EM) Algorithm
• Setting derivative of ln 𝑓(𝑋|𝜇, Σ, 𝑊) equal to 0 w.r.t. Σ𝑘
𝑇
𝜇𝑘𝑛𝑒𝑤 𝑥𝑛 −ෝ
𝛾 𝑧𝑛𝑘 𝑥𝑛 −ෝ 𝜇𝑘𝑛𝑒𝑤
• Σ𝑘new = σ𝑁
𝑛=1 𝑁𝑘
• Finally maximize ln 𝑓(𝑋|𝜇, Σ, 𝑊), with respect to 𝑤𝑘 subject to constraint
𝐾

෍ 𝑤𝑘 = 1
𝑘=1
This can be achieved using Lagrange multiplier and maximizing
𝐾

ln 𝑓(𝑋|𝜇, Σ, 𝑊) + 𝜆(෍ 𝑤𝑘 − 1)
𝑘=1
𝑁𝑘
Resulting 𝑤𝑘𝑛𝑒𝑤 = , where 𝑁𝑘 = σ𝑁
𝑛=1 𝛾 𝑧𝑛𝑘
𝑁
• Evaluate ln 𝑓(𝑋|𝜇, Σ, 𝑊) = σ𝑁 𝑁 𝐾
𝑖=1 ln 𝑓 𝑥𝑖 = σ𝑖=1 ln σ𝑘=1 𝑤𝑘 𝒩 𝑥 𝜇𝑘 , Σ𝑘
• Iterate through E-step and M-step.
Expectation Maximization (EM)

Bishop, C. M. (2006).
Pattern recognition and
machine
learning. Springer
google schola, 2, 5-43.
EM Algorithm
• Since K-means is faster, it is common to run the K-means algorithm to
find a suitable initialization for a Gaussian mixture model that is
subsequently adapted using EM.

Professional Education-Curriculum Development (Let Reviewer)
100% (11)
Professional Education-Curriculum Development (Let Reviewer)
7 pages
Coding For Kids Python A Playful Way For - Mark B Bennet
100% (1)
Coding For Kids Python A Playful Way For - Mark B Bennet
143 pages
KNNL - Malaprabha - Final Feasibility Report
No ratings yet
KNNL - Malaprabha - Final Feasibility Report
53 pages
2015 Turbine Day-Final
100% (4)
2015 Turbine Day-Final
217 pages
Chap2 Part2 GMM
No ratings yet
Chap2 Part2 GMM
34 pages
Expectation-Maximization For The Gaussian Mixture Model
No ratings yet
Expectation-Maximization For The Gaussian Mixture Model
8 pages
20 Gaussian Mixture Model
No ratings yet
20 Gaussian Mixture Model
55 pages
16) ISM-Session 16 - 30th and 31st March 2024
No ratings yet
16) ISM-Session 16 - 30th and 31st March 2024
36 pages
AI29
No ratings yet
AI29
3 pages
cs229 Notes7b PDF
No ratings yet
cs229 Notes7b PDF
4 pages
Clustering Mixture
No ratings yet
Clustering Mixture
22 pages
کتاب ششم بارگزاری شده
No ratings yet
کتاب ششم بارگزاری شده
49 pages
Lec15 16 Handout
No ratings yet
Lec15 16 Handout
33 pages
CB PDF
No ratings yet
CB PDF
69 pages
Gaussian Mixture Modelling GMM
No ratings yet
Gaussian Mixture Modelling GMM
11 pages
Dsci303-19 GM - em
No ratings yet
Dsci303-19 GM - em
81 pages
Gaussian Distribution
No ratings yet
Gaussian Distribution
5 pages
6.2 K Means
No ratings yet
6.2 K Means
23 pages
Lecture Expectation Maximization
No ratings yet
Lecture Expectation Maximization
58 pages
GMMEMNotes
No ratings yet
GMMEMNotes
10 pages
Week 7 - Latent Variable Models and Expectation Maximization
No ratings yet
Week 7 - Latent Variable Models and Expectation Maximization
39 pages
L08 GMM
No ratings yet
L08 GMM
11 pages
Machine Learning: CSCE883
No ratings yet
Machine Learning: CSCE883
22 pages
Lecture 5
No ratings yet
Lecture 5
16 pages
Week11 Summary Detail
No ratings yet
Week11 Summary Detail
7 pages
Mixture Models and Expectation-Maximization: Justus H. Piater
No ratings yet
Mixture Models and Expectation-Maximization: Justus H. Piater
11 pages
Mixture Models and Clustering
No ratings yet
Mixture Models and Clustering
8 pages
Applied Stat
No ratings yet
Applied Stat
2 pages
MLSlides5 - Selected - Shared
No ratings yet
MLSlides5 - Selected - Shared
30 pages
PROBABILISTIC Learning Jb-New
No ratings yet
PROBABILISTIC Learning Jb-New
13 pages
Gaussian Mixture Models
No ratings yet
Gaussian Mixture Models
3 pages
WINSEM2020-21 CSE4020 ETH VL2020210504996 Reference Material I 12-May-2021 5.5 Expectation Maximization
No ratings yet
WINSEM2020-21 CSE4020 ETH VL2020210504996 Reference Material I 12-May-2021 5.5 Expectation Maximization
28 pages
GaussianMixtureModel (GMM)
No ratings yet
GaussianMixtureModel (GMM)
18 pages
L11.2 Prob Models em
No ratings yet
L11.2 Prob Models em
20 pages
Get One More Story in Your Member Preview When You Sign Up. It's Free
No ratings yet
Get One More Story in Your Member Preview When You Sign Up. It's Free
12 pages
ML Unit Iii
No ratings yet
ML Unit Iii
12 pages
Lecture-04 GMM EMalg
No ratings yet
Lecture-04 GMM EMalg
34 pages
Pattern Classification 08. Gaussian Mixture Model: Abdelmoniem Bayoumi, PHD
No ratings yet
Pattern Classification 08. Gaussian Mixture Model: Abdelmoniem Bayoumi, PHD
12 pages
Image Segmentation1
No ratings yet
Image Segmentation1
42 pages
Mathematics Behind The Powerful Gaussian Mixture Model (GMM)
No ratings yet
Mathematics Behind The Powerful Gaussian Mixture Model (GMM)
30 pages
ML-2-Expectation Maximization
No ratings yet
ML-2-Expectation Maximization
11 pages
TD10 - TD - GMM - 2025
No ratings yet
TD10 - TD - GMM - 2025
1 page
EM and Kmeans Relations
No ratings yet
EM and Kmeans Relations
70 pages
Week 7 GMM
No ratings yet
Week 7 GMM
9 pages
Gaussian Mixture Model (GMM)
No ratings yet
Gaussian Mixture Model (GMM)
10 pages
Lecture 3
No ratings yet
Lecture 3
15 pages
Introduction To EM - Gaussian Mixture Models
No ratings yet
Introduction To EM - Gaussian Mixture Models
12 pages
Bayesian Networks & EM Algorithm
No ratings yet
Bayesian Networks & EM Algorithm
7 pages
Unit - 3 Gausian Mixture Models
No ratings yet
Unit - 3 Gausian Mixture Models
13 pages
Gaussian Mixture Models: LE Thi Khuyen
No ratings yet
Gaussian Mixture Models: LE Thi Khuyen
40 pages
Algoritmo E-M. Utilizado para Calcular La Mezcla de Gausianas
No ratings yet
Algoritmo E-M. Utilizado para Calcular La Mezcla de Gausianas
8 pages
Machine Learning Estimation Guide
No ratings yet
Machine Learning Estimation Guide
6 pages
Gaussian Mixtures
No ratings yet
Gaussian Mixtures
5 pages
15 GMC
No ratings yet
15 GMC
4 pages
Dynamical Gaussian Mixture Model For Tracking Elliptical Living Objects
No ratings yet
Dynamical Gaussian Mixture Model For Tracking Elliptical Living Objects
5 pages
Dis10 Sol PDF
No ratings yet
Dis10 Sol PDF
6 pages
Gaussian Mixture Model: P (X - Y) P (Y - X) P (X)
No ratings yet
Gaussian Mixture Model: P (X - Y) P (Y - X) P (X)
3 pages
GMM
No ratings yet
GMM
25 pages
MTM 2024 Mod 1 Lec 2
No ratings yet
MTM 2024 Mod 1 Lec 2
12 pages
Module10 - Support Vector Machine
No ratings yet
Module10 - Support Vector Machine
23 pages
Ltintegratedreport 2023
No ratings yet
Ltintegratedreport 2023
100 pages
Module08 PolynomialRegressionSplineGAMs
No ratings yet
Module08 PolynomialRegressionSplineGAMs
56 pages
LT 1083RCU FX 3500RCU Installation Manual
No ratings yet
LT 1083RCU FX 3500RCU Installation Manual
103 pages
Inventory Management and Control System
No ratings yet
Inventory Management and Control System
88 pages
01 History of Philippine Architecture
No ratings yet
01 History of Philippine Architecture
18 pages
SURREY Booking - Com - Confirmation
No ratings yet
SURREY Booking - Com - Confirmation
2 pages
SPE 101937-STU: Determining Cutting Transport Parameter in A Horizontal Coiled Tubing Underbalanced Drilling Operation
No ratings yet
SPE 101937-STU: Determining Cutting Transport Parameter in A Horizontal Coiled Tubing Underbalanced Drilling Operation
11 pages
NL-S2 Series Valve Manual
No ratings yet
NL-S2 Series Valve Manual
5 pages
Peachtree Charter Middle School: Daily Lesson Plan For Monday
No ratings yet
Peachtree Charter Middle School: Daily Lesson Plan For Monday
3 pages
Teaching Tools for Parsing Education
No ratings yet
Teaching Tools for Parsing Education
5 pages
Rapid Prototyping
100% (1)
Rapid Prototyping
21 pages
Benlac Module 5
No ratings yet
Benlac Module 5
9 pages
In Partial Fulfillment of The Requirements For Work Immersion
No ratings yet
In Partial Fulfillment of The Requirements For Work Immersion
10 pages
Cadenas, Bandas y Piñones
No ratings yet
Cadenas, Bandas y Piñones
0 pages
Photography - Tips & Tricks
No ratings yet
Photography - Tips & Tricks
13 pages
Classic 500
No ratings yet
Classic 500
86 pages
6EP1332-1SH31 - Industry Support Siemens
No ratings yet
6EP1332-1SH31 - Industry Support Siemens
3 pages
Engineering Student Project Proposal
No ratings yet
Engineering Student Project Proposal
14 pages
Business Client Information Form
No ratings yet
Business Client Information Form
5 pages
NSW Recreational Fishing Catch and Release Handbook
No ratings yet
NSW Recreational Fishing Catch and Release Handbook
33 pages
BS 5493 1977 Amd 2 Code of Practice For Protective Coating o PDF
No ratings yet
BS 5493 1977 Amd 2 Code of Practice For Protective Coating o PDF
118 pages
Ensayo Sobre El Patriotismo
100% (1)
Ensayo Sobre El Patriotismo
6 pages
Clinical Research Cover Letter
100% (2)
Clinical Research Cover Letter
6 pages
Philmetals 2014 - Rev - Reduced PDF
No ratings yet
Philmetals 2014 - Rev - Reduced PDF
82 pages
Moral Panics Assignment
No ratings yet
Moral Panics Assignment
7 pages
Types of False Ceilings: 1. Gypsum Plasterboard False Ceiling System
No ratings yet
Types of False Ceilings: 1. Gypsum Plasterboard False Ceiling System
15 pages
The Effect of Macrocelebrity and Microin Uencer Endorsements On Consumer-Brand Engagement in Instagram
No ratings yet
The Effect of Macrocelebrity and Microin Uencer Endorsements On Consumer-Brand Engagement in Instagram
21 pages
Detyre Kursi Rrjeta Telematike
No ratings yet
Detyre Kursi Rrjeta Telematike
19 pages

Module13 GaussianMixtureModel

Uploaded by

Module13 GaussianMixtureModel

Uploaded by

Clustering and

Gaussian Mixture Model

• Multivariate Gaussian Distribution:

Bishop, C. M. (2006). Pattern recognition and machine

Bishop, C. M. (2006). Pattern recognition and machine

3 gaussian distribution Clustering using estimated

Differentiating and equating to 0

Where 𝑁𝑘 is the number of datapoints in 𝑘𝑡ℎ cluster

𝛾𝑘 𝑥 is also called latent variable here.

You might also like