Experiment 2 KMeans Clustering

The document outlines an experiment to implement K-Means clustering for customer segmentation using Python and scikit-learn. It includes software requirements, a dataset description, and a step-by-step procedure for loading data, determining the optimal number of clusters using the Elbow method, applying K-Means, and visualizing the results. The expected output includes an Elbow plot and a scatter plot of customer segments based on income and spending score.

Uploaded by

Anant More

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views3 pages

Experiment 2 KMeans Clustering

Uploaded by

Anant More

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Experiment 2: Customer Segmentation

using K-Means Clustering

Aim:
To implement K-Means clustering algorithm for customer segmentation using Python and
scikit-learn.

Software Requirements:
Python 3.x, Jupyter Notebook, pandas, matplotlib, seaborn, scikit-learn

Dataset:
Sample customer dataset with features like Age, Annual Income, and Spending Score.

Procedure:
1. Import necessary libraries.
2. Load the dataset.
3. Explore and visualize the dataset using scatter plots.
4. Use the Elbow method to determine the optimal number of clusters (k).
5. Apply K-Means clustering algorithm using the determined value of k.
6. Visualize the clusters formed.
7. Interpret the results for business insights.

Program:

import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.cluster import KMeans

# Load dataset
data = pd.read_csv('Mall_Customers.csv')
X = data[['Annual Income (k$)', 'Spending Score (1-100)']]

# Elbow method to find optimal k

wcss = []
for i in range(1, 11):
kmeans = KMeans(n_clusters=i, init='k-means++', random_state=42)
kmeans.fit(X)
wcss.append(kmeans.inertia_)

plt.plot(range(1, 11), wcss)

plt.title('Elbow Method')
plt.xlabel('Number of clusters')
plt.ylabel('WCSS')
plt.show()

# Apply K-Means
kmeans = KMeans(n_clusters=5, init='k-means++', random_state=42)
y_kmeans = kmeans.fit_predict(X)

# Visualizing clusters
plt.scatter(X.values[y_kmeans == 0, 0], X.values[y_kmeans == 0, 1], s = 100, c = 'red', label =
'Cluster 1')
plt.scatter(X.values[y_kmeans == 1, 0], X.values[y_kmeans == 1, 1], s = 100, c = 'blue', label =
'Cluster 2')
plt.scatter(X.values[y_kmeans == 2, 0], X.values[y_kmeans == 2, 1], s = 100, c = 'green', label
= 'Cluster 3')
plt.scatter(X.values[y_kmeans == 3, 0], X.values[y_kmeans == 3, 1], s = 100, c = 'cyan', label =
'Cluster 4')
plt.scatter(X.values[y_kmeans == 4, 0], X.values[y_kmeans == 4, 1], s = 100, c = 'magenta',
label = 'Cluster 5')
plt.scatter(kmeans.cluster_centers_[:, 0], kmeans.cluster_centers_[:, 1], s = 300, c = 'yellow',
label = 'Centroids')
plt.title('Customer Segments')
plt.xlabel('Annual Income (k$)')
plt.ylabel('Spending Score (1-100)')
plt.legend()
plt.show()

Sample Output:
The output consists of:

 Elbow plot showing the optimal number of clusters (typically 5 for this dataset).
 Scatter plot of customers segmented into clusters.
 Different customer segments visualized based on income and spending score.
Viva Questions:
 What is the purpose of customer segmentation?
 How does the K-Means algorithm work?
 What is the Elbow method?
 What are the limitations of K-Means clustering?

Customer Segmentation Using Machine Learning
100% (1)
Customer Segmentation Using Machine Learning
28 pages
Mall Customer Segmentation Using Machine Learning Techniques
No ratings yet
Mall Customer Segmentation Using Machine Learning Techniques
17 pages
Customer Clustering with K-Means
No ratings yet
Customer Clustering with K-Means
3 pages
Ideophones, Mimetics and Expressives - (2019)
100% (2)
Ideophones, Mimetics and Expressives - (2019)
337 pages
Phase 2
No ratings yet
Phase 2
5 pages
Peter Stockwell-Texture - A Cognitive Aesthetics of Reading-Edinburgh University Press (2005)
100% (1)
Peter Stockwell-Texture - A Cognitive Aesthetics of Reading-Edinburgh University Press (2005)
225 pages
Subject: ML Name: Priyanshu Gandhi Date: 10/4/21 Expt. No.: 9 Roll No.: C008 Title: Clustering Implementation in Python
No ratings yet
Subject: ML Name: Priyanshu Gandhi Date: 10/4/21 Expt. No.: 9 Roll No.: C008 Title: Clustering Implementation in Python
7 pages
Customer Segmentation in Python Chapter4
No ratings yet
Customer Segmentation in Python Chapter4
37 pages
Mall Customer Segmentation Guide
No ratings yet
Mall Customer Segmentation Guide
8 pages
K-Means for Customer Segmentation
No ratings yet
K-Means for Customer Segmentation
13 pages
Kman 07
No ratings yet
Kman 07
9 pages
Exp 8ml
No ratings yet
Exp 8ml
5 pages
ML2 Practical List
No ratings yet
ML2 Practical List
80 pages
Customer Segmentation in Python Chapter3
No ratings yet
Customer Segmentation in Python Chapter3
25 pages
Segmentation Algorithm
No ratings yet
Segmentation Algorithm
2 pages
ML - K-Means
No ratings yet
ML - K-Means
12 pages
Clustering Mall Data Students
No ratings yet
Clustering Mall Data Students
11 pages
PMA Experiment 2
No ratings yet
PMA Experiment 2
6 pages
Customer Categorization by Data Analysis Using Clustering Algorithms of Machine Learning
No ratings yet
Customer Categorization by Data Analysis Using Clustering Algorithms of Machine Learning
4 pages
ML Assignment 4
No ratings yet
ML Assignment 4
6 pages
Customer Segmentation
No ratings yet
Customer Segmentation
15 pages
LP I Assignment A4 Clustering
No ratings yet
LP I Assignment A4 Clustering
13 pages
Experiment 3.1 K-Mean
No ratings yet
Experiment 3.1 K-Mean
8 pages
Clustering Algorithms for Data Analysis
No ratings yet
Clustering Algorithms for Data Analysis
7 pages
K Means Clustering Customer Clustering
No ratings yet
K Means Clustering Customer Clustering
7 pages
Da Exp 10
No ratings yet
Da Exp 10
6 pages
Practical 03
No ratings yet
Practical 03
3 pages
Chp1-3 Design and Implementation of A Web Based Payment Verification and Receipts System School Fees
No ratings yet
Chp1-3 Design and Implementation of A Web Based Payment Verification and Receipts System School Fees
26 pages
Customer Segmentation Using Clustering
No ratings yet
Customer Segmentation Using Clustering
6 pages
Customer Segmentation Using K
No ratings yet
Customer Segmentation Using K
16 pages
Presentation 1
No ratings yet
Presentation 1
47 pages
K Means Clustering
No ratings yet
K Means Clustering
5 pages
Assignment 2 (Mb23042)
No ratings yet
Assignment 2 (Mb23042)
7 pages
Da Exp 10
No ratings yet
Da Exp 10
6 pages
Final Code
No ratings yet
Final Code
3 pages
Customer Spent Analysis Using K-Means Clustering
No ratings yet
Customer Spent Analysis Using K-Means Clustering
1 page
Alice in Wonderland - A Critique Paper
No ratings yet
Alice in Wonderland - A Critique Paper
2 pages
BDA LabReport-9
No ratings yet
BDA LabReport-9
17 pages
DWDM PPT
No ratings yet
DWDM PPT
13 pages
23CC554
No ratings yet
23CC554
10 pages
Experiment 4 1
No ratings yet
Experiment 4 1
4 pages
AAM 7th Prac
No ratings yet
AAM 7th Prac
4 pages
Experiment-3 ML Lab
No ratings yet
Experiment-3 ML Lab
20 pages
Processor Organization: Module-3 Part-2
No ratings yet
Processor Organization: Module-3 Part-2
88 pages
ML0101EN Clus K Means Customer Seg Py v1
100% (1)
ML0101EN Clus K Means Customer Seg Py v1
8 pages
Untitled Document-2-1-13-7-11.4
No ratings yet
Untitled Document-2-1-13-7-11.4
5 pages
23dscp206 Ex11
No ratings yet
23dscp206 Ex11
3 pages
Coa Anant More
No ratings yet
Coa Anant More
71 pages
Lab 11 - HT
No ratings yet
Lab 11 - HT
4 pages
AI Week 11
No ratings yet
AI Week 11
21 pages
Week 2 Eapp Lesson
No ratings yet
Week 2 Eapp Lesson
43 pages
Encoder Decoder Multiplexers and Demultiplexers
No ratings yet
Encoder Decoder Multiplexers and Demultiplexers
39 pages
Mad Summer 2022 Mad Model Answer Paper
No ratings yet
Mad Summer 2022 Mad Model Answer Paper
40 pages
Customer Segmentation Using K Means Clustering
No ratings yet
Customer Segmentation Using K Means Clustering
10 pages
Project Explanation
No ratings yet
Project Explanation
17 pages
DS Prac 8
No ratings yet
DS Prac 8
4 pages
Error DC
No ratings yet
Error DC
25 pages
1746593166-Lecture#41 Customer Segmentation K Means Clustering
No ratings yet
1746593166-Lecture#41 Customer Segmentation K Means Clustering
9 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
3 pages
ICT 204 - Lecture 4 Methods
No ratings yet
ICT 204 - Lecture 4 Methods
31 pages
Bone Suplement Market Segmentation
No ratings yet
Bone Suplement Market Segmentation
20 pages
Low Power MAC Architecture Design
No ratings yet
Low Power MAC Architecture Design
5 pages
Components of GIS (Praveen) AMREEN
No ratings yet
Components of GIS (Praveen) AMREEN
20 pages
LAB 4 - K-Means and Elbow Technique
No ratings yet
LAB 4 - K-Means and Elbow Technique
3 pages
新文件 12
No ratings yet
新文件 12
15 pages
Interpolation and Least Square
No ratings yet
Interpolation and Least Square
18 pages
Aiml Assignment 10
No ratings yet
Aiml Assignment 10
6 pages
ANSWER KEY Yearly Exame Paper Maths Class 9 Session (2024-25)
No ratings yet
ANSWER KEY Yearly Exame Paper Maths Class 9 Session (2024-25)
12 pages
IEE Paper
No ratings yet
IEE Paper
5 pages
Mechatronics Lab Manual Latest - Dummy
No ratings yet
Mechatronics Lab Manual Latest - Dummy
11 pages
Binary Subtraction
No ratings yet
Binary Subtraction
7 pages
Network Sim Tools for IT Pros & Students
No ratings yet
Network Sim Tools for IT Pros & Students
12 pages
Practical Research 2
No ratings yet
Practical Research 2
13 pages
Packet Tracer
No ratings yet
Packet Tracer
11 pages
K Means Clustering
No ratings yet
K Means Clustering
11 pages
Booth Algoritm
No ratings yet
Booth Algoritm
6 pages
Form 2 School Based Computer Science Syllabus
No ratings yet
Form 2 School Based Computer Science Syllabus
5 pages
Software Recovery Testing Guide
No ratings yet
Software Recovery Testing Guide
5 pages
Ba I Khao Sat HSG Anh 8 - V1-2021 39144
No ratings yet
Ba I Khao Sat HSG Anh 8 - V1-2021 39144
6 pages
Teaching Grammar (II) : Unit 2
No ratings yet
Teaching Grammar (II) : Unit 2
25 pages
Structure of Computer COA Notes
No ratings yet
Structure of Computer COA Notes
3 pages
Focus2 2E Unit Test Vocabulary Grammar UoE Unit5 GroupB
100% (1)
Focus2 2E Unit Test Vocabulary Grammar UoE Unit5 GroupB
2 pages
Demonology 32893204
No ratings yet
Demonology 32893204
6 pages
The Living Photograph: Poem Analysis
No ratings yet
The Living Photograph: Poem Analysis
4 pages
Endsem Deep Learning Important
No ratings yet
Endsem Deep Learning Important
2 pages
BCM and O Webcast - Questions - and - Answers PDF
No ratings yet
BCM and O Webcast - Questions - and - Answers PDF
12 pages
Compiler Token Separation Guide
No ratings yet
Compiler Token Separation Guide
5 pages
Code COverage
No ratings yet
Code COverage
2 pages
dn015f NOISE
No ratings yet
dn015f NOISE
2 pages
What Is The Twink-Handler Relationship I Asked A Bunch of Twinks and Their Handlers
No ratings yet
What Is The Twink-Handler Relationship I Asked A Bunch of Twinks and Their Handlers
1 page
Shortcut Keys
No ratings yet
Shortcut Keys
1 page
Telecom Data Specialist Profile
No ratings yet
Telecom Data Specialist Profile
3 pages
Focgb1 GQ 5 2a
No ratings yet
Focgb1 GQ 5 2a
1 page
Experiment-7: Implementation of K-Means Clustering Algorithm
No ratings yet
Experiment-7: Implementation of K-Means Clustering Algorithm
3 pages
Hutchinson Resume
No ratings yet
Hutchinson Resume
2 pages

Experiment 2 KMeans Clustering

Uploaded by

Experiment 2 KMeans Clustering

Uploaded by

Experiment 2: Customer Segmentation

using K-Means Clustering

# Elbow method to find optimal k

plt.plot(range(1, 11), wcss)

You might also like