K - Means - Clustering - Ipynb - Colaboratory

This document discusses using K-Means clustering on a dataset to group customers into clusters. It imports necessary libraries and the dataset, uses the elbow method to determine the optimal number of clusters, trains a K-Means model to classify customers into clusters, and visualizes the resulting clusters.

Uploaded by

Hoàng Thị Thu Thảo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

104 views2 pages

K - Means - Clustering - Ipynb - Colaboratory

Uploaded by

Hoàng Thị Thu Thảo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

K-Means Clustering

Importing the libraries

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

Importing the dataset

dataset = pd.read_csv('Mall_Customers.csv')
X = dataset.iloc[:, [3, 4]].values
X
[ 60, 42],
[ 60, 52],
[ 60, 47],
[ 60, 50],
[ 61, 42],
[ 61, 49],
[ 62, 41],
[ 62, 48],
[ 62, 59],
[ 62, 55],
[ 62, 56],
[ 62, 42],
[ 63, 50],
[ 63, 46],
[ 63, 43],
[ 63, 48],
[ 63, 52],
[ 63, 54],
[ 64, 42],
[ 64, 46],
[ 65, 48],
[ 65, 50],
[ 65, 43],
[ 65, 59],
[ 67, 43],
[ 67, 57],
[ 67, 56],
[ 67, 40],
[ 69, 58],
[ 69, 91],
[ 70, 29],
[ 70, 77],
[ 71, 35],
[ 71, 95],
[ 71, 11],
[ 71, 75],
[ 71, 9],
[ 71, 75],
[ 72, 34],
[ 72, 71],
[ 73, 5],
[ 73, 88],
[ 73, 7],
[ 73, 73],
[ 74, 10],
[ 74, 72],
[ 75, 5],
[ 75, 93],
[ 76, 40],
[ 76, 87],
[ 77, 12],
[ 77, 97],
[ 77, 36],
[ 77, 74],
[ 78, 22],
[ 78, 90],
[ 78, 17],
[ 78, 88],
[ 78, 20],
[ 8 ]

Using the elbow method to find the optimal number of clusters

from sklearn.cluster import KMeans
wcss = []
for i in range(1, 11):
kmeans = KMeans(n_clusters = i, init = 'k-means++', random_state = 42)
kmeans.fit(X)
wcss.append(kmeans.inertia_)
plt.plot(range(1, 11), wcss)
plt.title('The Elbow Method')
plt.xlabel('Number of clusters')
plt.ylabel('WCSS')
plt.show()

Training the K-Means model on the dataset

kmeans = KMeans(n_clusters = 5, init = 'k-means++', random_state = 42)

y_kmeans = kmeans.fit_predict(X)

Visualising the clusters

plt.scatter(X[y_kmeans == 0, 0], X[y_kmeans == 0, 1], s = 100, c = 'red', label = 'Cluster 1')

plt.scatter(X[y_kmeans == 1, 0], X[y_kmeans == 1, 1], s = 100, c = 'blue', label = 'Cluster 2')
plt.scatter(X[y_kmeans == 2, 0], X[y_kmeans == 2, 1], s = 100, c = 'green', label = 'Cluster 3')
plt.scatter(X[y_kmeans == 3, 0], X[y_kmeans == 3, 1], s = 100, c = 'cyan', label = 'Cluster 4')
plt.scatter(X[y_kmeans == 4, 0], X[y_kmeans == 4, 1], s = 100, c = 'magenta', label = 'Cluster 5')
plt.scatter(kmeans.cluster_centers_[:, 0], kmeans.cluster_centers_[:, 1], s = 300, c = 'black', label = 'Centroids')
plt.title('Clusters of customers')
plt.xlabel('Annual Income (k$)')
plt.ylabel('Spending Score (1-100)')
plt.legend()
plt.show()

Analysis of Variance (ANOVA)
100% (6)
Analysis of Variance (ANOVA)
18 pages
1)
No ratings yet
1)
9 pages
Data Mining Portfolio
No ratings yet
Data Mining Portfolio
19 pages
Output Xerox
No ratings yet
Output Xerox
12 pages
Ex No - 9
No ratings yet
Ex No - 9
10 pages
Project 13 Customer Segmentation Using K Means Clustering
No ratings yet
Project 13 Customer Segmentation Using K Means Clustering
9 pages
Mathematics Project
No ratings yet
Mathematics Project
7 pages
Iris Dataset: Data Preprocessing
No ratings yet
Iris Dataset: Data Preprocessing
13 pages
Program 8
No ratings yet
Program 8
11 pages
Minor Project
No ratings yet
Minor Project
92 pages
Practical 4
No ratings yet
Practical 4
9 pages
K-Means Clustering - Jupyter Notebook
No ratings yet
K-Means Clustering - Jupyter Notebook
11 pages
LAB7 Kmeans
No ratings yet
LAB7 Kmeans
11 pages
K-Means Clustering Tutorial - Matlab Code
No ratings yet
K-Means Clustering Tutorial - Matlab Code
3 pages
DM Lab Internal
No ratings yet
DM Lab Internal
37 pages
Experiment 8 Heirarchical Clustering
No ratings yet
Experiment 8 Heirarchical Clustering
17 pages
Experiment 9
No ratings yet
Experiment 9
10 pages
Mlda - Lab
No ratings yet
Mlda - Lab
35 pages
K Means
No ratings yet
K Means
5 pages
Inbuilt Kmeans
No ratings yet
Inbuilt Kmeans
3 pages
Week 6 (PCA, SVD, LDA)
No ratings yet
Week 6 (PCA, SVD, LDA)
14 pages
Experiment 1111
No ratings yet
Experiment 1111
25 pages
ML Lab Exp 7 K-Means Clustering
No ratings yet
ML Lab Exp 7 K-Means Clustering
14 pages
FDS Lab 1 Manuel .1..1new
No ratings yet
FDS Lab 1 Manuel .1..1new
38 pages
DWM Practical
No ratings yet
DWM Practical
12 pages
Assignment5 VidulGarg
No ratings yet
Assignment5 VidulGarg
12 pages
Tugas Clustering - 132021012 - Kevin Gazkia Naufal
No ratings yet
Tugas Clustering - 132021012 - Kevin Gazkia Naufal
6 pages
Unit3
No ratings yet
Unit3
7 pages
Kmeans Clustering
No ratings yet
Kmeans Clustering
3 pages
Drawback of Standard K-Means Algorithm
No ratings yet
Drawback of Standard K-Means Algorithm
5 pages
Final Code
No ratings yet
Final Code
3 pages
Practical File of AI and ML
No ratings yet
Practical File of AI and ML
26 pages
Unit 2
No ratings yet
Unit 2
12 pages
DL Lab 3
No ratings yet
DL Lab 3
5 pages
ML#07
No ratings yet
ML#07
21 pages
7 Output
No ratings yet
7 Output
4 pages
Saurabh Pandey 22it3044 K Mean
No ratings yet
Saurabh Pandey 22it3044 K Mean
12 pages
Implement Clustering Algorithms For Unsupervised Classification
No ratings yet
Implement Clustering Algorithms For Unsupervised Classification
4 pages
AI&ML Lab-Ex.9corre
No ratings yet
AI&ML Lab-Ex.9corre
5 pages
Assignment 5
No ratings yet
Assignment 5
5 pages
K-Means Clustering Algorithm
No ratings yet
K-Means Clustering Algorithm
17 pages
Reading Data: #Importing Required Libraries
No ratings yet
Reading Data: #Importing Required Libraries
16 pages
Relationship Between Nutrients and Calories
No ratings yet
Relationship Between Nutrients and Calories
17 pages
SK Learn 1
No ratings yet
SK Learn 1
11 pages
Document
No ratings yet
Document
4 pages
K-Means Clustering Guide
No ratings yet
K-Means Clustering Guide
26 pages
Python Basics for Data Science
No ratings yet
Python Basics for Data Science
30 pages
1 Simple Linear Regression
No ratings yet
1 Simple Linear Regression
9 pages
Numpy
No ratings yet
Numpy
13 pages
Demgn801 Business Analytics 76 150
No ratings yet
Demgn801 Business Analytics 76 150
75 pages
ML Exp5 C36
No ratings yet
ML Exp5 C36
18 pages
Program 2 Hierarchical Cluestring
No ratings yet
Program 2 Hierarchical Cluestring
5 pages
Experiment 3.1 K-Mean
No ratings yet
Experiment 3.1 K-Mean
8 pages
KNN052
No ratings yet
KNN052
5 pages
Dal Programs With Output
No ratings yet
Dal Programs With Output
11 pages
Machine Learning Course Overview
No ratings yet
Machine Learning Course Overview
4 pages
Lab 8
No ratings yet
Lab 8
8 pages
Fds Mannual
No ratings yet
Fds Mannual
39 pages
6th Online FDP Advanced Analysis Simsree
No ratings yet
6th Online FDP Advanced Analysis Simsree
1 page
MachineLearning Unit-III
No ratings yet
MachineLearning Unit-III
26 pages
CISC 504 Assignment 5 - O
No ratings yet
CISC 504 Assignment 5 - O
7 pages
Dougherty5e C14G01 2016 05 27
No ratings yet
Dougherty5e C14G01 2016 05 27
34 pages
K Means Algorithm
No ratings yet
K Means Algorithm
6 pages
Tabel Durbin Watson
No ratings yet
Tabel Durbin Watson
89 pages
Chapter3 Anova Experimental Design Models
No ratings yet
Chapter3 Anova Experimental Design Models
34 pages
Lab
No ratings yet
Lab
9 pages
1 Abril PDF
No ratings yet
1 Abril PDF
10 pages
CISC 504 - Vatsal - Patel - Assignment 5 - O.ipynb
No ratings yet
CISC 504 - Vatsal - Patel - Assignment 5 - O.ipynb
27 pages
Multiple Regression for Educators
100% (1)
Multiple Regression for Educators
7 pages
Stats Students: Correlation Quiz Key
No ratings yet
Stats Students: Correlation Quiz Key
5 pages
Practice Midterm Questions 1 and 2
No ratings yet
Practice Midterm Questions 1 and 2
4 pages
Regresi Data Panel
No ratings yet
Regresi Data Panel
10 pages
DMRT For Table 1 2 and 3 Date 19.10.2024
No ratings yet
DMRT For Table 1 2 and 3 Date 19.10.2024
56 pages
Bangla Political Cyberbullying Detection
No ratings yet
Bangla Political Cyberbullying Detection
22 pages
K Means Clustering
100% (1)
K Means Clustering
10 pages
Model Selection-Handout PDF
No ratings yet
Model Selection-Handout PDF
57 pages
STA102 - Simple Corr - Regression
No ratings yet
STA102 - Simple Corr - Regression
37 pages
Excel Regression Analysis Output Explained
No ratings yet
Excel Regression Analysis Output Explained
14 pages
Future Orientation in Indonesian Teens
No ratings yet
Future Orientation in Indonesian Teens
15 pages
Confirmatory Composite Analysis Guide
No ratings yet
Confirmatory Composite Analysis Guide
10 pages
Diversity and Inclusion Survey Analysis 19th Sep
No ratings yet
Diversity and Inclusion Survey Analysis 19th Sep
8 pages
Chi Nguyen - 1622431 - LAB 4
No ratings yet
Chi Nguyen - 1622431 - LAB 4
5 pages
Lampiran 2 Uji Validitas Dan Reliabilitas
No ratings yet
Lampiran 2 Uji Validitas Dan Reliabilitas
5 pages
Cases Conjoint Analysis
No ratings yet
Cases Conjoint Analysis
5 pages
Viva Questions and Possible Answers - Ver 1.0
No ratings yet
Viva Questions and Possible Answers - Ver 1.0
3 pages
RM Model Question Final
No ratings yet
RM Model Question Final
3 pages
Lab 9
No ratings yet
Lab 9
2 pages
Classification of Variable
No ratings yet
Classification of Variable
2 pages

K - Means - Clustering - Ipynb - Colaboratory

Uploaded by

K - Means - Clustering - Ipynb - Colaboratory

Uploaded by

K-Means Clustering

Importing the libraries

Importing the dataset

Using the elbow method to find the optimal number of clusters

Training the K-Means model on the dataset

kmeans = KMeans(n_clusters = 5, init = 'k-means++', random_state = 42)

Visualising the clusters

plt.scatter(X[y_kmeans == 0, 0], X[y_kmeans == 0, 1], s = 100, c = 'red', label = 'Cluster 1')

You might also like