0% found this document useful (0 votes)

21 views12 pages

Remaining ML Program

The document describes multiple experiments involving machine learning techniques, including the implementation of a Naïve Bayesian classifier, clustering with k-Means and Gaussian Mixture Models, and building an Artificial Neural Network using backpropagation. Each experiment includes source code and explanations of the steps taken, such as data loading, preprocessing, model training, and evaluation. The final experiment demonstrates the Candidate-Elimination algorithm for hypothesis generation based on training data.

Uploaded by

b220752

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views12 pages

Remaining ML Program

Uploaded by

b220752

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Experiment-5

Objective: Write a program to implement the naïve Bayesian classifier for a sample training data
set stored as a .CSV file. Compute the accuracy of the classifier, considering few test data sets.
Source Code:
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import LabelEncoder
from sklearn.naive_bayes import GaussianNB
from sklearn.metrics import accuracy_score, confusion_matrix

# Load dataset
data = pd.read_csv("PlayTennis.csv")

# Encoding categorical features

label_encoders = {}
for column in data.columns[:-1]: # Excluding target column
le = LabelEncoder()
data[column] = le.fit_transform(data[column])
label_encoders[column] = le

target_encoder = LabelEncoder()
data['PlayTennis'] = target_encoder.fit_transform(data['PlayTennis'])

# Splitting dataset into train and test sets

X = data.drop(columns=['PlayTennis'])
y = data['PlayTennis']
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Train Naïve Bayes classifier

classifier = GaussianNB()
classifier.fit(X_train, y_train)

# Predictions

Machine Learning 6IT4-22 37

y_pred = classifier.predict(X_test)

# Compute accuracy
accuracy = accuracy_score(y_test, y_pred)
print(f"Accuracy: {accuracy * 100:.2f}%")

# Confusion matrix
conf_matrix = confusion_matrix(y_test, y_pred)
print("Confusion Matrix:")
print(conf_matrix)

# Correct and incorrect classifications

correct = (y_pred == y_test).sum()
incorrect = (y_pred != y_test).sum()
print(f"Correct Classifications: {correct}")
print(f"Incorrect Classifications: {incorrect}")

Explanation:
1. Load the Dataset
The script reads a CSV file (PlayTennis.csv) containing categorical features like Outlook,
Temperature, Humidity, Wind, and the target variable PlayTennis.
2. Encode Categorical Features
Since Naïve Bayes works with numerical data, Label Encoding is used to convert categorical
features into numerical form. Each categorical column (except the target) is encoded using
LabelEncoder().
The target column (PlayTennis) is also encoded separately.
3. Split Dataset into Training & Testing Sets
The dataset is split into 80% training and 20% testing using train_test_split().
X (features) and y (target) are separated before splitting.
4. Train the Naïve Bayes Classifier
A Gaussian Naïve Bayes model (GaussianNB()) is trained on the training data.
The model learns from the probability distributions of the feature values.

Machine Learning 6IT4-22 38

5. Make Predictions
The classifier predicts outcomes on the test dataset.
6. Compute Accuracy
The model's accuracy is calculated using accuracy_score(y_test, y_pred).
This gives the percentage of correctly classified instances.
7. Generate Confusion Matrix
The confusion matrix (confusion_matrix(y_test, y_pred)) shows the number of:
True Positives (Correct Yes)
True Negatives (Correct No)
False Positives (Incorrect Yes)
False Negatives (Incorrect No)
8. Count Correct & Incorrect Classifications
The script calculates and prints the number of correctly and incorrectly classified instances.

Machine Learning 6IT4-22 39

Experiment-8
Objective: Apply EM algorithm to cluster a set of data stored in a .CSV file. Use the same data
set for clustering using k-Means algorithm. Compare the results of these two algorithms and
comment on the quality of clustering. You can add Java/Python ML library classes/API in the
program.

Source Code:
from sklearn.cluster import KMeans
from sklearn import preprocessing
from sklearn.mixture import GaussianMixture
from sklearn.datasets import load_iris
import sklearn.metrics as sm
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt

# Load the dataset

dataset = load_iris()
X = pd.DataFrame(dataset.data)
X.columns = ['Sepal_Length', 'Sepal_width', 'Petal_Length', 'Petal_Width']
y = pd.DataFrame(dataset.target)
y.columns = ['Targets']

# Plotting
plt.figure(figsize=(14, 7))
colormap = np.array(['red', 'lime', 'black'])

# Real Plot
plt.subplot(1, 3, 1)
plt.scatter(X.Petal_Length, X.Petal_Width, c=colormap[y.Targets], s=40)
plt.title('Real')

# KMeans Plot

Machine Learning 6IT4-22 46

plt.subplot(1, 3, 2)
model = KMeans(n_clusters=3)
model.fit(X)
predY = np.choose(model.labels_, [0, 1, 2]).astype(np.int64)
plt.scatter(X.Petal_Length, X.Petal_Width, c=colormap[predY], s=40)
plt.title('KMeans')

# GMM Plot
scaler = preprocessing.StandardScaler()
scaler.fit(X)
xsa = scaler.transform(X)
xs = pd.DataFrame(xsa, columns=X.columns)
gmm = GaussianMixture(n_components=3)
gmm.fit(xs)
y_cluster_gmm = gmm.predict(xs)
plt.subplot(1, 3, 3)
plt.scatter(X.Petal_Length, X.Petal_Width, c=colormap[y_cluster_gmm], s=40)
plt.title('GMM Classification')

plt.show()

Machine Learning 6IT4-22 47

Output:
Exp

Explanation:
1. Data Loading:
• The script loads the Iris dataset, which includes features
like Sepal_Length, Sepal_width, Petal_Length, and Petal_Width, and the target
variable Targets (species of the flower).
2. k-Means Clustering:
• The k-Means algorithm is used to cluster the data into 3 clusters (since there are 3
species in the Iris dataset).
• The predicted cluster labels are used to color the data points in the plot.
3. Gaussian Mixture Model (GMM) Clustering:
• The data is standardized using StandardScaler to ensure all features contribute
equally to the clustering.
• The GMM algorithm is applied to fit the data into 3 Gaussian distributions
(components).
• The predicted cluster labels from GMM are used to color the data points in the plot.

Machine Learning 6IT4-22 48

4. Visualization:
• Three subplots are created:
• The first subplot shows the real data distribution colored by the true species
labels.
• The second subplot shows the clustering result of k-Means.
• The third subplot shows the clustering result of GMM.

Machine Learning 6IT4-22 49

Experiment-9
Objective: Build an Artificial Neural Network by implementing the Backpropagation algorithm
and test the same using appropriate datasets.
Source Code:
import pandas as pd
import tensorflow as tf
from google.colab import files

# Upload the dataset

file_upload = files.upload()

# Read the dataset

df = pd.read_csv('Celsius_to_Fahrenheit.csv')
print(df.head())

# Prepare the data

X = df['Celsius']
y = df['Fahrenheit']

from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=101)
print(X_train.shape, y_train.shape)
print(X_test.shape, y_test.shape)

# Plot the data

import seaborn as sns
sns.scatterplot(x=df['Celsius'], y=df['Fahrenheit'], marker='.', s=20, color='b')
plt.show()

# Initialize the model

model = tf.keras.Sequential()
model.add(tf.keras.layers.Dense(units=1, input_shape=[1]))
model.summary()

Machine Learning 6IT4-22 50

# Compile the model
model.compile(optimizer=tf.keras.optimizers.Adam(0.5), loss='mean_squared_error')

# Train the model

epochs_hist = model.fit(X_train, y_train, epochs=500)

# Get the model weights

print(model.get_weights())

# Make a prediction
celsius_temp = 100
print(f'Prediction from our perceptron model is: {model.predict([celsius_temp])}')

# Calculate the actual value

F = 9/5 * celsius_temp + 32
print(f'Prediction from actual formula is: {F}')

# Plot the loss progression

import matplotlib.pyplot as plt
plt.plot(epochs_hist.history['loss'])
plt.xlabel('Number of epochs')
plt.ylabel('loss')
plt.title('Loss progression during training')
plt.show()

# Plot the regression line on training data

plt.scatter(X_train, y_train, c='b', marker='.')
plt.plot(X_train, model.predict(X_train), c='g')
plt.xlabel('Celsius')
plt.ylabel('Fahrenheit')
plt.title('Regression line on training data')
plt.show()

Machine Learning 6IT4-22 51

# Plot the regression line on test data
plt.scatter(X_test, y_test, c='b', marker='.')
plt.plot(X_test, model.predict(X_test), c='r')
plt.xlabel('Celsius')
plt.ylabel('Fahrenheit')
plt.title('Regression line on test data')
plt.show()

Output:

Machine Learning 6IT4-22 52

Machine Learning 6IT4-22 53
Experiment-10
Objective: For a given set of training data examples stored in a .CSV file, implement
and demonstrate the Candidate-Elimination algorithm to output a description of the set
of all hypotheses consistent with the training examples.

Source Code:
import numpy as np
import pandas as pd

# Loading Data from a CSV File

data = pd.DataFrame(data=pd.read_csv('trainingdata.csv'))
print(data)

# Separating concept features from Target

concepts = np.array(data.iloc[:,0:-1])
print(concepts)

# Isolating target into a separate DataFrame

# copying last column to target array
target = np.array(data.iloc[:,-1])
print(target)

def learn(concepts, target):

'''
learn() function implements the learning method of the Candidate elimination
algorithm.
Arguments:
concepts - a data frame with all the features
target - a data frame with corresponding output values
'''

# Initialise S0 with the first instance from concepts

Machine Learning 6IT4-22 54

Conditionals in Reported Speech
No ratings yet
Conditionals in Reported Speech
2 pages
From Vivaldi To Viotti - A History of The Early Classical - White, Chappell - 2. Print, Philadelphia, 1992 - Philadelphia - Gordon and Breach - 97828812449
No ratings yet
From Vivaldi To Viotti - A History of The Early Classical - White, Chappell - 2. Print, Philadelphia, 1992 - Philadelphia - Gordon and Breach - 97828812449
416 pages
Artificial Intelligence Advance Practical
No ratings yet
Artificial Intelligence Advance Practical
12 pages
Mrcs Part B Osce Anatomy
No ratings yet
Mrcs Part B Osce Anatomy
287 pages
Wa0001
No ratings yet
Wa0001
39 pages
Marine Crane Failure Analysis
100% (1)
Marine Crane Failure Analysis
27 pages
AES DRRM Memo PASS
No ratings yet
AES DRRM Memo PASS
2 pages
Tomato Processing Guide by Mynampati Sreenivasa Rao
No ratings yet
Tomato Processing Guide by Mynampati Sreenivasa Rao
4 pages
Chapter 6 - Multiphase Systems: CBE2124, Levicky
No ratings yet
Chapter 6 - Multiphase Systems: CBE2124, Levicky
27 pages
Ii M.A. English Men 33 - Contemporary Literary Theory-I
No ratings yet
Ii M.A. English Men 33 - Contemporary Literary Theory-I
16 pages
Funk MMQ 30 Days
100% (1)
Funk MMQ 30 Days
34 pages
ML File
No ratings yet
ML File
7 pages
Hoc Sinh Gioi 8 - 2022
No ratings yet
Hoc Sinh Gioi 8 - 2022
10 pages
Dsbda 10
No ratings yet
Dsbda 10
5 pages
Math Test: Rounding & Operations
No ratings yet
Math Test: Rounding & Operations
4 pages
Project Topics On Law of Evidence
No ratings yet
Project Topics On Law of Evidence
5 pages
Preboard Exam in Ee 2
No ratings yet
Preboard Exam in Ee 2
14 pages
Are Today's Teenagers Smarter and Better Than We Think - The New York Times
No ratings yet
Are Today's Teenagers Smarter and Better Than We Think - The New York Times
5 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
20 pages
Product List
No ratings yet
Product List
42 pages
Beginner's Guide To Implementing A Simple Machine Learning Project - DeV Community
No ratings yet
Beginner's Guide To Implementing A Simple Machine Learning Project - DeV Community
9 pages
Record
No ratings yet
Record
22 pages
Iii Aid - ML
No ratings yet
Iii Aid - ML
30 pages
DDP Sohana - 2021 - Notification
No ratings yet
DDP Sohana - 2021 - Notification
17 pages
ML Lab Programs
No ratings yet
ML Lab Programs
18 pages
Chapter 4 (Answers)
No ratings yet
Chapter 4 (Answers)
5 pages
How Do Trusses Work
No ratings yet
How Do Trusses Work
14 pages
VND - Openxmlformats Officedocument - Wordprocessingml.document&rendition 1
No ratings yet
VND - Openxmlformats Officedocument - Wordprocessingml.document&rendition 1
24 pages
AI and ML Lab Ex3 To 12
No ratings yet
AI and ML Lab Ex3 To 12
27 pages
ML Regression & Classification Guide
100% (1)
ML Regression & Classification Guide
45 pages
Aml Lab
No ratings yet
Aml Lab
6 pages
Amlnew
No ratings yet
Amlnew
25 pages
Python For Data Science IA 1 Programs
No ratings yet
Python For Data Science IA 1 Programs
14 pages
1
No ratings yet
1
13 pages
Geographical Data in The Computer-1
No ratings yet
Geographical Data in The Computer-1
36 pages
ML Lab Record8to15
No ratings yet
ML Lab Record8to15
23 pages
Lab Manual ML
No ratings yet
Lab Manual ML
23 pages
Aiml Practical
No ratings yet
Aiml Practical
17 pages
Basic ML Algo
No ratings yet
Basic ML Algo
10 pages
Python For Data Science IA 1 Programs
No ratings yet
Python For Data Science IA 1 Programs
14 pages
Span 210-MW Syllabus Spring 2014
No ratings yet
Span 210-MW Syllabus Spring 2014
12 pages
Participant Handbook: Iot Hardware Analyst
No ratings yet
Participant Handbook: Iot Hardware Analyst
152 pages
ML5 Implementation
No ratings yet
ML5 Implementation
32 pages
FIND-S Algorithm Implementation
No ratings yet
FIND-S Algorithm Implementation
51 pages
Lifting Eye Bolts B18.15
No ratings yet
Lifting Eye Bolts B18.15
2 pages
AAM PR QB
No ratings yet
AAM PR QB
13 pages
MANUAL
No ratings yet
MANUAL
34 pages
ML Lab
No ratings yet
ML Lab
26 pages
ML Lab 146
No ratings yet
ML Lab 146
50 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
8 pages
Programs Lab Bca
No ratings yet
Programs Lab Bca
16 pages
Advance Machine Learning
No ratings yet
Advance Machine Learning
28 pages
Final ML Programs 075005
No ratings yet
Final ML Programs 075005
15 pages
ML 3
No ratings yet
ML 3
24 pages
ML Practical Manjot 6-10
No ratings yet
ML Practical Manjot 6-10
10 pages
Atul MLT Exp 4-11
No ratings yet
Atul MLT Exp 4-11
17 pages
Data Analytics III
No ratings yet
Data Analytics III
5 pages
ML Lab 01999676272
No ratings yet
ML Lab 01999676272
12 pages
ML Practical 205160694034
No ratings yet
ML Practical 205160694034
33 pages
ML Lab PT
No ratings yet
ML Lab PT
25 pages
AWS-SOP - Creating ALB and Configuring Target Groups, Listeners and Stickiness
No ratings yet
AWS-SOP - Creating ALB and Configuring Target Groups, Listeners and Stickiness
15 pages
ML Yogesh
No ratings yet
ML Yogesh
23 pages
Iris Dataset EDA & ML Techniques
100% (2)
Iris Dataset EDA & ML Techniques
24 pages
Naive Bayes Classifier CSV
No ratings yet
Naive Bayes Classifier CSV
2 pages
ML PDF
No ratings yet
ML PDF
30 pages
ML Lab Manual
No ratings yet
ML Lab Manual
12 pages
EX - NO:3: Algorithm
No ratings yet
EX - NO:3: Algorithm
11 pages
Mini-Vert Brochure
No ratings yet
Mini-Vert Brochure
4 pages
MLA Lab Record (2024)
No ratings yet
MLA Lab Record (2024)
47 pages
LAB-4 Report
No ratings yet
LAB-4 Report
21 pages
3 Classification
No ratings yet
3 Classification
16 pages
ML Lab1 PGM
No ratings yet
ML Lab1 PGM
4 pages
Ross Girshick Et Al - in 2013 Proposed An Architecture Called R-CNN (Region
No ratings yet
Ross Girshick Et Al - in 2013 Proposed An Architecture Called R-CNN (Region
6 pages
Ps 1320 Gbnlfresd
No ratings yet
Ps 1320 Gbnlfresd
8 pages
Advance AI and ML LAB
No ratings yet
Advance AI and ML LAB
16 pages
Lecture O03: ENGR90024 Computational Fluid Dynamics
No ratings yet
Lecture O03: ENGR90024 Computational Fluid Dynamics
43 pages
Machine Learning With SQL
100% (1)
Machine Learning With SQL
12 pages
A Comprehensive Look at The Acid Number Test PDF
No ratings yet
A Comprehensive Look at The Acid Number Test PDF
6 pages
ML Shristi File
No ratings yet
ML Shristi File
49 pages
Mllabprog 5
No ratings yet
Mllabprog 5
6 pages
Education, Arts, and Sciences
No ratings yet
Education, Arts, and Sciences
1 page
Machine Learning Lab New
No ratings yet
Machine Learning Lab New
14 pages
Python ML Algorithms Guide
No ratings yet
Python ML Algorithms Guide
7 pages
Exp 3 Bi 30
No ratings yet
Exp 3 Bi 30
7 pages
Machine Learning Algorithms Lab
No ratings yet
Machine Learning Algorithms Lab
48 pages
Week6 Bai
No ratings yet
Week6 Bai
14 pages
W8 Naive Bayes Lab
No ratings yet
W8 Naive Bayes Lab
4 pages
Part A Assignment 6
No ratings yet
Part A Assignment 6
2 pages