0% found this document useful (0 votes)

40 views12 pages

Get One More Story in Your Member Preview When You Sign Up. It's Free

The document is an explanation of Gaussian mixture models. It defines Gaussian mixture models as a combination of multiple Gaussian distributions used for clustering unlabeled data points. It describes the parameters of Gaussian distributions including mean, covariance, and mixing probability. It then explains the expectation-maximization algorithm used to estimate the parameters of a Gaussian mixture model and cluster the data. Code in Python is provided to demonstrate implementing a Gaussian mixture model.

Uploaded by

M Rosyidi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views12 pages

Get One More Story in Your Member Preview When You Sign Up. It's Free

Uploaded by

M Rosyidi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Gaussian Mixture Models Explained - Towards ... https://towardsdatascience.com/gaussian-mixtu...

Get one more story in your member

preview when you sign up. It’s free.
Already have an
Sign up with Google Sign up with Facebook
account? Sign in

1 of 12 11/7/19, 2:24 PM
Gaussian Mixture Models Explained - Towards ... https://towardsdatascience.com/gaussian-mixtu...

Gaussian Mixture Models

Explained
From intuition to implementation
Oscar Contreras Carrasco Follow
Jun 3 · 12 min read

In the world of Machine Learning, we can distinguish two

main areas: Supervised and unsupervised learning. The
main difference between both lies in the nature of the data
as well as the approaches used to deal with it. Clustering is
an unsupervised learning problem where we intend to find
clusters of points in our dataset that share some common
characteristics. Let’s suppose we have a dataset that looks
like this:

Get one more story in your member

preview when you sign up. It’s free.
Already have an
Sign up with Google Sign up with Facebook
account? Sign in

2 of 12 11/7/19, 2:24 PM
Gaussian Mixture Models Explained - Towards ... https://towardsdatascience.com/gaussian-mixtu...

. . .

Get one more story in your member

preview when you sign up. It’s free.
Already have an
Sign up with Google Sign up with Facebook
account? Sign in

3 of 12 11/7/19, 2:24 PM
Gaussian Mixture Models Explained - Towards ... https://towardsdatascience.com/gaussian-mixtu...

De�initions
A Gaussian Mixture is a function that is comprised of several
Gaussians, each identified by k ∈ {1,…, K}, where K is the
number of clusters of our dataset. Each Gaussian k in the
mixture is comprised of the following parameters:

A mean μ that defines its centre.

A covariance Σ that defines its width. This would be

equivalent to the dimensions of an ellipsoid in a
multivariate scenario.

A mixing probability π that defines how big or small the

Gaussian function will be.

Let us now illustrate these parameters graphically:

Get one more story in your member

preview when you sign up. It’s free.
Already have an
Sign up with Google Sign up with Facebook
account? Sign in

Here, we can see that there are three Gaussian functions,

4 of 12 11/7/19, 2:24 PM
Gaussian Mixture Models Explained - Towards ... https://towardsdatascience.com/gaussian-mixtu...

. . .

Get one more story in your member

preview when you sign up. It’s free.
Already have an
Sign up with Google Sign up with Facebook
account? Sign in

5 of 12 11/7/19, 2:24 PM
Gaussian Mixture Models Explained - Towards ... https://towardsdatascience.com/gaussian-mixtu...

Initial derivations
We are now going to introduce some additional notation.
Just a word of warning. Math is coming on! Don’t worry. I’ll
try to keep the notation as clean as possible for better
understanding of the derivations. First, let’s suppose we
want to know what is the probability that a data point xn
comes from Gaussian k. We can express this as:

Which reads “given a data point x, what is the probability it

came from Gaussian k?” In this case, z is a latent variable that
takes only two possible values. It is one when x came from
Gaussian k, and zero otherwise. Actually, we don’t get to see
this z variable in reality, but knowing its probability of
occurrence will be useful in helping us determine the
Gaussian mixture parameters, as we discuss later.

Likewise, we can state the following:

Get one more story in your member

preview when you sign up. It’s free.
Already have an
Sign up with Google Sign up with Facebook
account? Sign in

Which means that the overall probability of observing a

6 of 12 11/7/19, 2:24 PM
Gaussian Mixture Models Explained - Towards ... https://towardsdatascience.com/gaussian-mixtu...

. . .

Get one more story in your member

preview when you sign up. It’s free.
Already have an
Sign up with Google Sign up with Facebook
account? Sign in

7 of 12 11/7/19, 2:24 PM
Gaussian Mixture Models Explained - Towards ... https://towardsdatascience.com/gaussian-mixtu...

Expectation — Maximization algorithm

Well, at this point we have derived some expressions for the
probabilities that we will find useful in determining the
parameters of our model. However, in the past section we
could see that simply evaluating (3) to find such parameters
would prove to be very hard. Fortunately, there is an
iterative method we can use to achieve this purpose. It is
called the Expectation — Maximization, or simply EM
algorithm. It is widely used for optimization problems where
the objective function has complexities such as the one we’ve
just encountered for the GMM case.

Let the parameters of our model be

Let us now define the steps that the general EM algorithm

will follow¹.

Step 1: Initialise θ accordingly. For instance, we can use the

Get one
results moreby
obtained story in your
a previous member
K-Means run as a good
preview when
starting point youalgorithm.
for our sign up. It’s free.
Already have an
Sign up with Google Sign up with Facebook
Step 2 (Expectation step): Evaluate account? Sign in

8 of 12 11/7/19, 2:24 PM
Gaussian Mixture Models Explained - Towards ... https://towardsdatascience.com/gaussian-mixtu...

. . .

Get one more story in your member

preview when you sign up. It’s free.
Already have an
Sign up with Google Sign up with Facebook
account? Sign in

9 of 12 11/7/19, 2:24 PM
Gaussian Mixture Models Explained - Towards ... https://towardsdatascience.com/gaussian-mixtu...

Implementation in Python
Just as a side note, the full implementation is available as a
Jupyter notebook at https://bit.ly/2MpiZp4

I have used the Iris dataset for this exercise, mainly for
simplicity and fast training. From our previous derivations,
we stated that the EM algorithm follows an iterative
approach to find the parameters of a Gaussian Mixture
Model. Our first step was to initialise our parameters. In this
case, we can use the values of K-means to suit this purpose.
The Python code for this would look like:

1 def initialize_clusters(X, n_clusters):

2 clusters = []
3 idx = np.arange(X.shape[0])
4
5 kmeans = KMeans().fit(X)
6 mu_k = kmeans.cluster_centers_
7
8 for i in range(n_clusters):
9 clusters.append({
10 'pi_k': 1.0 / n_clusters,
11 'mu_k': mu_k[i],
12 'cov_k': np.identity(X.shape[1], dtype=np.float64)
13 })
14

Get one more story in your member

Next, we execute the expectation step. Here we calculate
preview when you sign up. It’s free.
Already have an
Sign up with Google Sign up with Facebook
account? Sign in

10 of 12 11/7/19, 2:24 PM
Gaussian Mixture Models Explained - Towards ... https://towardsdatascience.com/gaussian-mixtu...

. . .

Final remarks
Gaussian Mixture Models are a very powerful tool and are
widely used in diverse tasks that involve data clustering. I
hope you found this post useful! Feel free to approach with
questions or comments. I would also highly encourage you to
try the derivations yourself as well as look further into the
code. I look forward to creating more material like this soon.

Enjoy!

. . .

[1] Bishop, Christopher M. Pattern Recognition and Machine

Learning (2006) Springer-Verlag Berlin, Heidelberg.

[2] Murphy, Kevin P. Machine Learning: A Probabilistic

Perspective (2012) MIT Press, Cambridge, Mass,

Machine Learning Gaussian Mixture Model Gmm Clustering

Get one
Towards Datamore
Science story in your member

preview when you sign up. It’s free.

Already have an
Sign up with Google Sign up with Facebook
account? Sign in

11 of 12 11/7/19, 2:24 PM
Gaussian Mixture Models Explained - Towards ... https://towardsdatascience.com/gaussian-mixtu...

Discover Medium Make Medium Become a member

Welcome to a place where
yours Get unlimited access to
words matter. On Follow all the topics you the best stories on
Medium, smart voices and care about, and we’ll Medium — and support
original ideas take center deliver the best stories for writers while you’re at it.
stage - with no ads in you to your homepage and Just $5/month. Upgrade
sight. Watch inbox. Explore

About Help Legal

Get one more story in your member

preview when you sign up. It’s free.
Already have an
Sign up with Google Sign up with Facebook
account? Sign in

12 of 12 11/7/19, 2:24 PM

Gaussian Mixture Model - GeeksforGeeks
No ratings yet
Gaussian Mixture Model - GeeksforGeeks
6 pages
Unit - 3 Gausian Mixture Models
No ratings yet
Unit - 3 Gausian Mixture Models
13 pages
Gaussian Mixture Models Explained - by Oscar Contreras Carrasco - Towards Data Science
No ratings yet
Gaussian Mixture Models Explained - by Oscar Contreras Carrasco - Towards Data Science
11 pages
Mathematics Behind The Powerful Gaussian Mixture Model (GMM)
No ratings yet
Mathematics Behind The Powerful Gaussian Mixture Model (GMM)
30 pages
Image Segmentation1
No ratings yet
Image Segmentation1
42 pages
EM and Kmeans Relations
No ratings yet
EM and Kmeans Relations
70 pages
Gaussian Mixture Models Unit-III
No ratings yet
Gaussian Mixture Models Unit-III
13 pages
Gaussian Mixture Mode
No ratings yet
Gaussian Mixture Mode
30 pages
Week11 Summary Detail
No ratings yet
Week11 Summary Detail
7 pages
Gaussian Mixture Models: LE Thi Khuyen
No ratings yet
Gaussian Mixture Models: LE Thi Khuyen
40 pages
Gaussian Mixture Model (GMM)
No ratings yet
Gaussian Mixture Model (GMM)
10 pages
Unit 5 - ML
No ratings yet
Unit 5 - ML
10 pages
DSA5102 Lecture10
No ratings yet
DSA5102 Lecture10
40 pages
Unsupervised Learning: K-Means & EM
No ratings yet
Unsupervised Learning: K-Means & EM
34 pages
20 Gaussian Mixture Model
No ratings yet
20 Gaussian Mixture Model
55 pages
GaussianMixtureModel (GMM)
No ratings yet
GaussianMixtureModel (GMM)
18 pages
Gaussian Models - 1
No ratings yet
Gaussian Models - 1
16 pages
ML Unit3
No ratings yet
ML Unit3
21 pages
کتاب ششم بارگزاری شده
No ratings yet
کتاب ششم بارگزاری شده
49 pages
Gaussian Mixture Modelling GMM
No ratings yet
Gaussian Mixture Modelling GMM
11 pages
Gaussian Mixture Model GMM
No ratings yet
Gaussian Mixture Model GMM
5 pages
Chapter 1 - Part1
No ratings yet
Chapter 1 - Part1
56 pages
ASSIGNMENT1
No ratings yet
ASSIGNMENT1
7 pages
GMM
No ratings yet
GMM
25 pages
Machine Learning: CSCE883
No ratings yet
Machine Learning: CSCE883
22 pages
Gaussian Mixture Models
No ratings yet
Gaussian Mixture Models
3 pages
CSC454 9
No ratings yet
CSC454 9
29 pages
15 GMC
No ratings yet
15 GMC
4 pages
Pset GMM
No ratings yet
Pset GMM
1 page
Set Design Checklist1
50% (2)
Set Design Checklist1
4 pages
Dsci303-19 GM - em
No ratings yet
Dsci303-19 GM - em
81 pages
5 Clustering
No ratings yet
5 Clustering
38 pages
The Secular City (Cox, Stewart)
No ratings yet
The Secular City (Cox, Stewart)
2 pages
PROBABILISTIC Learning Jb-New
No ratings yet
PROBABILISTIC Learning Jb-New
13 pages
Expectation-Maximization Clustring V2
No ratings yet
Expectation-Maximization Clustring V2
9 pages
Expectation-Maximization For The Gaussian Mixture Model
No ratings yet
Expectation-Maximization For The Gaussian Mixture Model
8 pages
16) ISM-Session 16 - 30th and 31st March 2024
No ratings yet
16) ISM-Session 16 - 30th and 31st March 2024
36 pages
Clustering Mixture
No ratings yet
Clustering Mixture
22 pages
6.2 K Means
No ratings yet
6.2 K Means
23 pages
Gaussian Distribution
No ratings yet
Gaussian Distribution
5 pages
ML Unit Iii
No ratings yet
ML Unit Iii
12 pages
Experiment 9
No ratings yet
Experiment 9
3 pages
Concept of Entrepreneurs
0% (1)
Concept of Entrepreneurs
3 pages
WINSEM2020-21 CSE4020 ETH VL2020210504996 Reference Material I 12-May-2021 5.5 Expectation Maximization
No ratings yet
WINSEM2020-21 CSE4020 ETH VL2020210504996 Reference Material I 12-May-2021 5.5 Expectation Maximization
28 pages
Gaussian Mixtures
No ratings yet
Gaussian Mixtures
5 pages
401 Week7 Part 2 EM Algorithm
No ratings yet
401 Week7 Part 2 EM Algorithm
58 pages
Mixture Models and Clustering
No ratings yet
Mixture Models and Clustering
8 pages
Lec15 16 Handout
No ratings yet
Lec15 16 Handout
33 pages
cs229 Notes7b PDF
No ratings yet
cs229 Notes7b PDF
4 pages
Pattern Classification 08. Gaussian Mixture Model: Abdelmoniem Bayoumi, PHD
No ratings yet
Pattern Classification 08. Gaussian Mixture Model: Abdelmoniem Bayoumi, PHD
12 pages
Mixture of Gaussians: CS229: Machine Learning Carlos Guestrin
No ratings yet
Mixture of Gaussians: CS229: Machine Learning Carlos Guestrin
18 pages
Module13 GaussianMixtureModel
No ratings yet
Module13 GaussianMixtureModel
17 pages
Bok:978 3 540 72035 5
No ratings yet
Bok:978 3 540 72035 5
667 pages
DL14 Dragons of Triumph
100% (1)
DL14 Dragons of Triumph
102 pages
A Nice OSCP Cheat Sheet
50% (2)
A Nice OSCP Cheat Sheet
12 pages
Roland Barthes - Steak and Chips
No ratings yet
Roland Barthes - Steak and Chips
3 pages
Philosophy of Freedom Overview
No ratings yet
Philosophy of Freedom Overview
150 pages
Chap2 Part2 GMM
No ratings yet
Chap2 Part2 GMM
34 pages
Thermodynamics For Engineers 1st Edition Kroos Solutions Manual 1
100% (57)
Thermodynamics For Engineers 1st Edition Kroos Solutions Manual 1
36 pages
Unsupervised Learning: K-Means & GMM
No ratings yet
Unsupervised Learning: K-Means & GMM
27 pages
Statistical Methods For NLP: Document and Topic Clustering, K-Means, Mixture Models, Expectation-Maximization
No ratings yet
Statistical Methods For NLP: Document and Topic Clustering, K-Means, Mixture Models, Expectation-Maximization
47 pages
Ge English Through Literature DU
No ratings yet
Ge English Through Literature DU
5 pages
Epoxy Resin for Industrial Use
No ratings yet
Epoxy Resin for Industrial Use
1 page
Company Accounts BBA 6TH Sem CCS UNIT-1
No ratings yet
Company Accounts BBA 6TH Sem CCS UNIT-1
2 pages
Book of Wisdom: A Painting Entitled
No ratings yet
Book of Wisdom: A Painting Entitled
49 pages
Week 7 - Latent Variable Models and Expectation Maximization
No ratings yet
Week 7 - Latent Variable Models and Expectation Maximization
39 pages
Role Play About The Patient'S Family Complaints Over The Nurse'S Negligence
No ratings yet
Role Play About The Patient'S Family Complaints Over The Nurse'S Negligence
7 pages
Myocardial Infarction
No ratings yet
Myocardial Infarction
37 pages
Atg Info2019en
100% (1)
Atg Info2019en
47 pages
Solar Lottery Philip K Dick Instant Download
No ratings yet
Solar Lottery Philip K Dick Instant Download
41 pages
HC Gradation 01.01.2021 New
No ratings yet
HC Gradation 01.01.2021 New
28 pages
Elliptical Mixture Models Improve The Accuracy of Gaussian Mixture Models With Expectationmaximization Algorithm
No ratings yet
Elliptical Mixture Models Improve The Accuracy of Gaussian Mixture Models With Expectationmaximization Algorithm
20 pages
Rust Language Cheat Sheet
No ratings yet
Rust Language Cheat Sheet
19 pages
People Express Airlines: Case Analysis: Group 5 (Section C)
No ratings yet
People Express Airlines: Case Analysis: Group 5 (Section C)
20 pages
Pulmonary Function Tests
No ratings yet
Pulmonary Function Tests
65 pages
Ethics in Science Communication
No ratings yet
Ethics in Science Communication
28 pages
Eno ModernismIndia 1925
No ratings yet
Eno ModernismIndia 1925
17 pages
Chem 2BLabManual201303
No ratings yet
Chem 2BLabManual201303
121 pages
Jay Exam Time Table
No ratings yet
Jay Exam Time Table
1 page
Timeless Tale: Tortoise & Hare
No ratings yet
Timeless Tale: Tortoise & Hare
8 pages
Sand Casting Feeder Optimization
No ratings yet
Sand Casting Feeder Optimization
10 pages
PRP For Hair Loss Pre Post Instructions 10.18
No ratings yet
PRP For Hair Loss Pre Post Instructions 10.18
2 pages
Mapeh 4th Quarter Las
No ratings yet
Mapeh 4th Quarter Las
8 pages
Lionel Bekier, in The Matter of Jonathan Bekier, Infant v. Bettina Srour Bekier, in The Matter of Jonathan Bekier, Infant, Defendant, 248 F.3d 1051, 11th Cir. (2001)
No ratings yet
Lionel Bekier, in The Matter of Jonathan Bekier, Infant v. Bettina Srour Bekier, in The Matter of Jonathan Bekier, Infant, Defendant, 248 F.3d 1051, 11th Cir. (2001)
7 pages

Get One More Story in Your Member Preview When You Sign Up. It's Free

Uploaded by

Get One More Story in Your Member Preview When You Sign Up. It's Free

Uploaded by

Gaussian Mixture Models Explained - Towards ... https://towardsdatascience.com/gaussian-mixtu...

Get one more story in your member

Gaussian Mixture Models

In the world of Machine Learning, we can distinguish two

Get one more story in your member

Get one more story in your member

A mean μ that defines its centre.

A covariance Σ that defines its width. This would be

A mixing probability π that defines how big or small the

Let us now illustrate these parameters graphically:

Get one more story in your member

Here, we can see that there are three Gaussian functions,

Get one more story in your member

Which reads “given a data point x, what is the probability it

Likewise, we can state the following:

Get one more story in your member

Which means that the overall probability of observing a

Get one more story in your member

Expectation — Maximization algorithm

Let the parameters of our model be

Let us now define the steps that the general EM algorithm

Step 1: Initialise θ accordingly. For instance, we can use the

Get one more story in your member

1 def initialize_clusters(X, n_clusters):

Get one more story in your member

[1] Bishop, Christopher M. Pattern Recognition and Machine

[2] Murphy, Kevin P. Machine Learning: A Probabilistic

Machine Learning Gaussian Mixture Model Gmm Clustering

preview when you sign up. It’s free.

Discover Medium Make Medium Become a member

About Help Legal

Get one more story in your member

You might also like