0% found this document useful (0 votes)

68 views13 pages

Python Machine Learning Practical Guide

The document describes a practical file submitted by a student for their Machine Learning using Python course. It contains 6 experiments implementing different machine learning algorithms using Python: 1. Read and analyze a dataset from Kaggle using Pandas. 2. Implement linear regression to fit a line to random sample data. 3. Perform binary logistic regression on the Titanic dataset from Kaggle. 4. Apply Naive Bayes to classify flowers using the iris dataset and study the confusion matrix. 5. Use Naive Bayes on another dataset from Kaggle and evaluate accuracy, precision, and recall. 6. Implement support vector machines on the iris dataset to classify flowers.

Uploaded by

khatmalmain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

68 views13 pages

Python Machine Learning Practical Guide

Uploaded by

khatmalmain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

MACHINE LEARNING USING PYTHON

CAP4013L
School of Engineering & Sciences

Department of Computer Sciences and Engineering

Practical File

Submitted By
Student Name Pankaj Kumar
Enrolment Number 220160307085
Programme Master of Computer Application
Department Computer Science and Engineering
Session/Semester 2022-2024/Third Semester

Submitted To
Faculty Name Dr. Apeksha Mittal
INDEX

S.no Aim of Experiment Date Sign

1. Download a dataset from Kaggle (.csv format, atleast 1000 19-OCT-2023
rows and 20 columns) and
write a program in python programming language to perform
the following operations:
i) Read the dataset file in Python IDE.
ii) Display the dataset
iii) Display the shape of the dataset.
iv) Display the datatypes of the attributes of the dataset.
v) Find out the mean, median and mode of all the numeric
columns.
vi) Describe the entire dataset in terms of count, min, max,
standard deviation, variance
etc.
2. Write a program in python to implement Linear Regression. 30-OCT-2023

3. Write a Program in python to implement Binary Logistic 03-NOV-2023

Regression on a dataset
downloaded from Kaggle.

4. Write a Program in python to implement Naïve Bayes on the 08-NOV-2023

iris dataset. Study the
confusion matrix.

5. Write a program in Python to implement Naïve Bayes 16-NOV-2023

Algorithm on a dataset from Kaggle.
Also print Confusion Matrix, Accuracy, Precision, Recall.

6. Write a program in python to implement Support Vector 21-NOV-2023

Machine on the iris dataset.
1. Download a dataset from Kaggle (.csv format, atleast 1000 rows and 20
columns) and write
a program in python programming language to perform the following
operations:
i) Read the dataset file in Python IDE.
ii) Display the dataset
iii) Display the shape of the dataset.
iv) Display the datatypes of the attributes of the dataset.
v) Find out the mean, median and mode of all the numeric columns.
vi) Describe the entire dataset in terms of count, min, max, standard deviation, variance

# Import necessary libraries

import pandas as pd
i) Read the dataset file in Python IDE
# Replace 'path/to/titanic_dataset.csv' with the actual file path
file_path = 'path/to/match.csv'
df = pd.read_csv("match.csv")
ii) Display the dataset
print("Dataset:")
print(df)
iii) Display the shape of the dataset
print("\nShape of the dataset:")
print(df.shape)
iv) Display the datatypes of the attributes of the dataset
print("\nDatatypes of the attributes:")
print(df.dtypes)
v) Find out the mean, median, and mode of all the numeric columns
print("\nMean of numeric columns:")
print(df.mean())
print("\nMedian of numeric columns:")
print(df.median())
print("\nMode of numeric columns:")
print(df.mode().iloc[0])

vi) Describe the entire dataset in terms of count, min, max, standard deviation, variance,
etc.
print("\nSummary statistics of the dataset:")
print(df.describe())
2. Write a program in python to implement Linear Regression
import numpy as np
import matplotlib.pyplot as plt
# Generate some random data for demonstration purposes
np.random.seed(42)
X = 2 * np.random.rand(100, 1)
y = 4 + 3 * X + np.random.randn(100, 1)
# Visualize the data
plt.scatter(X, y)
plt.xlabel('X')
plt.ylabel('y')
plt.title('Generated Data for Linear Regression')
plt.show()
# Linear Regression implementation using NumPy
X_b = np.c_[np.ones((100, 1)), X] # Add bias term to X
theta_best = np.linalg.inv(X_b.T.dot(X_b)).dot(X_b.T).dot(y)

# Print the calculated parameters

print("Intercept (theta_0):", theta_best[0][0])
print("Slope (theta_1):", theta_best[1][0])

# Make predictions on new data

X_new = np.array([[0], [2]])
X_new_b = np.c_[np.ones((2, 1)), X_new]
y_predict = X_new_b.dot(theta_best)

# Plot the linear regression line

plt.plot(X_new, y_predict, "r-")
plt.scatter(X, y)
plt.xlabel('X')
plt.ylabel('y')
plt.title('Linear Regression Fit')
plt.show()
3. Write a Program in python to implement Binary Logistic Regression on a
dataset downloaded from Kaggle
I take Titanic dataset from Kaggle
# Import necessary libraries
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LogisticRegression
from sklearn.preprocessing import StandardScaler
from sklearn.metrics import accuracy_score, confusion_matrix, precision_score,
recall_score

# Load the Titanic dataset (replace 'path/to/titanic.csv' with the actual file path)
df = pd.read_csv('Titanic.csv')

# Preprocess the data (handle missing values, encode categorical variables, etc.)
# For simplicity, let's drop some irrelevant columns
df = df[['Pclass', 'Sex', 'Age', 'SibSp', 'Parch', 'Fare', 'Survived']].dropna()

# Convert categorical variables to numerical using one-hot encoding

df = pd.get_dummies(df, columns=['Sex'], drop_first=True)

# Separate features and target variable

X = df.drop('Survived', axis=1)
y = df['Survived']

# Split the dataset into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Standardize the features (optional but often recommended)

scaler = StandardScaler()
X_train = scaler.fit_transform(X_train)
X_test = scaler.transform(X_test)

# Create and train the Logistic Regression model

logreg_model = LogisticRegression()
logreg_model.fit(X_train, y_train)

# Make predictions on the test set

y_pred = logreg_model.predict(X_test)

# Evaluate the model

accuracy = accuracy_score(y_test, y_pred)
precision = precision_score(y_test, y_pred)
recall = recall_score(y_test, y_pred)
conf_matrix = confusion_matrix(y_test, y_pred)

# Print the results

print("Accuracy:", accuracy)
print("Precision:", precision)
print("Recall:", recall)
print("Confusion Matrix:")
print(conf_matrix)

4. Write a Program in Python to implement Naïve Bayes on iris Dataset . Study

the Confusion Matrix
# Import necessary libraries
from sklearn.model_selection import train_test_split
from sklearn.naive_bayes import GaussianNB
from sklearn.metrics import confusion_matrix, accuracy_score
from sklearn import datasets

# Load the Iris dataset

iris = datasets.load_iris()
X = iris.data
y = iris.target

# Split the dataset into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=42)

# Create a Naive Bayes model (Gaussian Naive Bayes for continuous features)
nb_model = GaussianNB()

# Train the model

nb_model.fit(X_train, y_train)

# Make predictions on the test set

y_pred = nb_model.predict(X_test)

# Evaluate the model

accuracy = accuracy_score(y_test, y_pred)
conf_matrix = confusion_matrix(y_test, y_pred)

# Display the results

print("Accuracy:", accuracy)
print("Confusion Matrix:")
print(conf_matrix)

5. Write a Program in Python To implement Naive Bayes Algorithm on a

Dataset From Kaggle. Also Print Confusion Matrix ,Accuracy ,Precision ,Recall.

# Import necessary libraries

import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.naive_bayes import GaussianNB
from sklearn import metrics
from sklearn.metrics import confusion_matrix, accuracy_score, precision_score,
recall_score

# Load the Titanic dataset (you can download it from Kaggle or use seaborn library to load
it)
# For example, using seaborn:
# import seaborn as sns
# df = sns.load_dataset('titanic')

# Assuming you have a 'titanic.csv' file

df = pd.read_csv('Titanic.csv')

# Preprocess the data (you may need to handle missing values, encode categorical variables,
etc.)
# For simplicity, let's drop some irrelevant columns
df = df[['Pclass', 'Sex', 'Age', 'SibSp', 'Parch', 'Fare', 'Survived']].dropna()

# Convert categorical variables to numerical using one-hot encoding

df = pd.get_dummies(df, columns=['Sex'], drop_first=True)

# Separate features and target variable

X = df.drop('Survived', axis=1)
y = df['Survived']

# Split the dataset into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)
# Create and train the Naive Bayes model (Gaussian Naive Bayes for numerical features)
naive_bayes = GaussianNB()
naive_bayes.fit(X_train, y_train)

# Make predictions on the test set

y_pred = naive_bayes.predict(X_test)

# Evaluate the model

conf_matrix = confusion_matrix(y_test, y_pred)
accuracy = accuracy_score(y_test, y_pred)
precision = precision_score(y_test, y_pred)
recall = recall_score(y_test, y_pred)

# Print the results

print("Confusion Matrix:")
print(conf_matrix)
print("\nAccuracy:", accuracy)
print("Precision:", precision)
print("Recall:", recall)

6. Write a program in python to implement Support Vector Machine on the iris

dataset.

# Import necessary libraries

import numpy as np
import matplotlib.pyplot as plt
from sklearn import datasets
from sklearn.model_selection import train_test_split
from sklearn.svm import SVC
from sklearn.metrics import accuracy_score, confusion_matrix

# Load the Iris dataset

iris = datasets.load_iris()
X = iris.data
y = iris.target

# Split the dataset into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Create and train the Support Vector Machine model

svm_model = SVC(kernel='linear') # You can try different kernels like 'rbf', 'poly', etc.
svm_model.fit(X_train, y_train)

# Make predictions on the test set

y_pred = svm_model.predict(X_test)

# Evaluate the model

accuracy = accuracy_score(y_test, y_pred)
conf_matrix = confusion_matrix(y_test, y_pred)

# Print the results

print("Accuracy:", accuracy)
print("Confusion Matrix:")
print(conf_matrix)

# Visualization (2D plot for simplicity, considering only the first two features)
plt.figure(figsize=(8, 6))

# Plot the decision boundary

h = .02 # Step size in the mesh
x_min, x_max = X[:, 0].min() - 1, X[:, 0].max() + 1
y_min, y_max = X[:, 1].min() - 1, X[:, 1].max() + 1
xx, yy = np.meshgrid(np.arange(x_min, x_max, h), np.arange(y_min, y_max, h))
Z = svm_model.predict(np.c_[xx.ravel(), yy.ravel()])
Z = Z.reshape(xx.shape)
plt.contourf(xx, yy, Z, cmap=plt.cm.coolwarm, alpha=0.8)

# Plot the points

scatter = plt.scatter(X[:, 0], X[:, 1], c=y, cmap=plt.cm.coolwarm)
plt.xlabel('Sepal length')
plt.ylabel('Sepal width')
plt.title('Support Vector Machine on Iris Dataset')
plt.legend(*scatter.legend_elements(), title='Classes')

plt.show()

Machine L-Lab-Manual
No ratings yet
Machine L-Lab-Manual
90 pages
EfficientNet for Brain Tumor Classification
No ratings yet
EfficientNet for Brain Tumor Classification
12 pages
Evaluation Class X
40% (5)
Evaluation Class X
19 pages
PROG-1: Write A Python Program To Compute Central Tendency Measures: Mean, Median, Mode Measure of Dispersion: Variance, Standard Deviation Aim
No ratings yet
PROG-1: Write A Python Program To Compute Central Tendency Measures: Mean, Median, Mode Measure of Dispersion: Variance, Standard Deviation Aim
11 pages
complete Ml file word file
No ratings yet
complete Ml file word file
64 pages
UNIT 3-Practice Sheet 3
No ratings yet
UNIT 3-Practice Sheet 3
2 pages
Cp4252-Machine Learning Lab Manual 23-24
No ratings yet
Cp4252-Machine Learning Lab Manual 23-24
28 pages
Bengali Text Classification Distinguishing Saintly and Common Forms Using Machine Learning Model
No ratings yet
Bengali Text Classification Distinguishing Saintly and Common Forms Using Machine Learning Model
7 pages
MLLAb
No ratings yet
MLLAb
36 pages
ML With Python Practical
No ratings yet
ML With Python Practical
22 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
21 pages
Tushar ML
No ratings yet
Tushar ML
52 pages
ML Lab Record - 250625 - 105014
No ratings yet
ML Lab Record - 250625 - 105014
29 pages
ML Manual
No ratings yet
ML Manual
30 pages
ML Assignment 1
No ratings yet
ML Assignment 1
15 pages
ML Lab Experiment Shivansh
No ratings yet
ML Lab Experiment Shivansh
29 pages
ML WorkSheet Milan
No ratings yet
ML WorkSheet Milan
4 pages
ML Lab Manual
No ratings yet
ML Lab Manual
70 pages
ML Lab Question Set - 1
No ratings yet
ML Lab Question Set - 1
5 pages
Aiml Practical
No ratings yet
Aiml Practical
17 pages
Questions
No ratings yet
Questions
7 pages
ML Lab Works
No ratings yet
ML Lab Works
14 pages
ML Lab Manual
No ratings yet
ML Lab Manual
19 pages
ML Full For Print New 1
No ratings yet
ML Full For Print New 1
38 pages
R22 ML Lab Manual
No ratings yet
R22 ML Lab Manual
25 pages
ML Lab Manual
No ratings yet
ML Lab Manual
14 pages
ML Lab Manual
No ratings yet
ML Lab Manual
36 pages
21CSC305P ML - Lab Programs 1 - 9
No ratings yet
21CSC305P ML - Lab Programs 1 - 9
36 pages
27 KrishParasShah
No ratings yet
27 KrishParasShah
17 pages
Machinelearninglabmanual
No ratings yet
Machinelearninglabmanual
47 pages
Ritesh Mangla ML PracticalFile
No ratings yet
Ritesh Mangla ML PracticalFile
55 pages
ML Lab Question Set - 2
No ratings yet
ML Lab Question Set - 2
5 pages
Sahil ML
No ratings yet
Sahil ML
21 pages
ML Exp
No ratings yet
ML Exp
1 page
Practical File OF Machine Learning
No ratings yet
Practical File OF Machine Learning
16 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
43 pages
cp4252 Machine Learning Lab Manual
No ratings yet
cp4252 Machine Learning Lab Manual
21 pages
ML Cyber Lab
No ratings yet
ML Cyber Lab
16 pages
AAM PR QB
No ratings yet
AAM PR QB
13 pages
MLCyber Lab
No ratings yet
MLCyber Lab
9 pages
ML - LAB - FILE Amrit
No ratings yet
ML - LAB - FILE Amrit
13 pages
Machine Learning LAB
No ratings yet
Machine Learning LAB
20 pages
Sr. No. Practical No. Date Sign: Index
No ratings yet
Sr. No. Practical No. Date Sign: Index
11 pages
22CM1105
No ratings yet
22CM1105
2 pages
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
No ratings yet
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
20 pages
Train
No ratings yet
Train
17 pages
Machine Learning Practicals
No ratings yet
Machine Learning Practicals
30 pages
Assignment Queostions
No ratings yet
Assignment Queostions
1 page
Color Fundamentals & Image Processing
No ratings yet
Color Fundamentals & Image Processing
17 pages
Teff Leaf Disease Detection Using CNN
No ratings yet
Teff Leaf Disease Detection Using CNN
18 pages
ML Lab
No ratings yet
ML Lab
45 pages
ML Lab Manual
No ratings yet
ML Lab Manual
38 pages
ML Lab Programs For Exam
No ratings yet
ML Lab Programs For Exam
10 pages
Image Processing Mahalanobis Distance
No ratings yet
Image Processing Mahalanobis Distance
17 pages
ML Lab
No ratings yet
ML Lab
7 pages
VND - Openxmlformats Officedocument - Wordprocessingml.document&rendition 1
No ratings yet
VND - Openxmlformats Officedocument - Wordprocessingml.document&rendition 1
24 pages
ML Priyesha - 778
No ratings yet
ML Priyesha - 778
23 pages
Karmbir 19 ML
No ratings yet
Karmbir 19 ML
20 pages
Important Questions
No ratings yet
Important Questions
4 pages
ML Shristi File
No ratings yet
ML Shristi File
49 pages
Syllabus AIML
No ratings yet
Syllabus AIML
14 pages
Fake News Detectio3
No ratings yet
Fake News Detectio3
24 pages
Machine Learning Algorithms Guide
No ratings yet
Machine Learning Algorithms Guide
34 pages
Data Science for Engineers Course
No ratings yet
Data Science for Engineers Course
8 pages
TelTek Customer Analysis Report
100% (1)
TelTek Customer Analysis Report
15 pages
School of Engineering: Lab Manual On Machine Learning Lab
No ratings yet
School of Engineering: Lab Manual On Machine Learning Lab
23 pages
19 - Decision Tree - ID3
No ratings yet
19 - Decision Tree - ID3
87 pages
Estimation Techniques for Classifiers
No ratings yet
Estimation Techniques for Classifiers
61 pages
Python Implementation of Random Forest Algorithm
No ratings yet
Python Implementation of Random Forest Algorithm
10 pages
Object Detection and Ship Classification Using YOLOv5
No ratings yet
Object Detection and Ship Classification Using YOLOv5
10 pages
Student Campus Placement Prediction Analysis Using ChiSquared Test On Machine Learning Algorithms-IJRASET
No ratings yet
Student Campus Placement Prediction Analysis Using ChiSquared Test On Machine Learning Algorithms-IJRASET
10 pages
Swee Tee Fu Thesis
No ratings yet
Swee Tee Fu Thesis
191 pages
AI-Powered Pneumonia Detection Enhanced Chest X-Ray Interpretation With CNNs
No ratings yet
AI-Powered Pneumonia Detection Enhanced Chest X-Ray Interpretation With CNNs
5 pages
Special Issue On Innovations and Technology in FinTech 2023 - Unveiled at GFF 2023
No ratings yet
Special Issue On Innovations and Technology in FinTech 2023 - Unveiled at GFF 2023
86 pages
Advanced Seismic Characterization of A Geothermal Carbonate Reservoir
No ratings yet
Advanced Seismic Characterization of A Geothermal Carbonate Reservoir
54 pages
Final Credit Risk Prediction Report Corrected
No ratings yet
Final Credit Risk Prediction Report Corrected
19 pages
R18 B ML LAB Manual - Minor Degree
No ratings yet
R18 B ML LAB Manual - Minor Degree
16 pages
"Asma'ak Sign Language Translator Report"
No ratings yet
"Asma'ak Sign Language Translator Report"
30 pages
Artificial Intelligence For Predictive Maintenance
No ratings yet
Artificial Intelligence For Predictive Maintenance
10 pages
Machine Learning for Type-2 Diabetes Prediction
No ratings yet
Machine Learning for Type-2 Diabetes Prediction
14 pages
Tactile Object Recognition in Early Phases of Grasping Using Underactuated Robotic Hands
No ratings yet
Tactile Object Recognition in Early Phases of Grasping Using Underactuated Robotic Hands
13 pages
Day Month Year Temperature RH Ws Rain FFMC DMC DC Isi Bui Fwi Classes 0 1 2 3 4
No ratings yet
Day Month Year Temperature RH Ws Rain FFMC DMC DC Isi Bui Fwi Classes 0 1 2 3 4
11 pages
AI40 Markspaper
No ratings yet
AI40 Markspaper
3 pages
Lab Report 02
No ratings yet
Lab Report 02
5 pages
IMDb Movie Review Sentiment Analysis
No ratings yet
IMDb Movie Review Sentiment Analysis
4 pages
Reading Buildinga Decision Treein KNIME
No ratings yet
Reading Buildinga Decision Treein KNIME
5 pages
Internet of Things IoT Assisted Context Aware Fertilizer Recommendation
No ratings yet
Internet of Things IoT Assisted Context Aware Fertilizer Recommendation
15 pages

Python Machine Learning Practical Guide

Uploaded by

Python Machine Learning Practical Guide

Uploaded by

MACHINE LEARNING USING PYTHON

Department of Computer Sciences and Engineering

S.no Aim of Experiment Date Sign

3. Write a Program in python to implement Binary Logistic 03-NOV-2023

4. Write a Program in python to implement Naïve Bayes on the 08-NOV-2023

5. Write a program in Python to implement Naïve Bayes 16-NOV-2023

6. Write a program in python to implement Support Vector 21-NOV-2023

# Import necessary libraries

# Print the calculated parameters

# Make predictions on new data

# Plot the linear regression line

# Convert categorical variables to numerical using one-hot encoding

# Separate features and target variable

# Split the dataset into training and testing sets

# Standardize the features (optional but often recommended)

# Create and train the Logistic Regression model

# Make predictions on the test set

# Evaluate the model

# Print the results

4. Write a Program in Python to implement Naïve Bayes on iris Dataset . Study

# Load the Iris dataset

# Split the dataset into training and testing sets

# Train the model

# Make predictions on the test set

# Evaluate the model

# Display the results

5. Write a Program in Python To implement Naive Bayes Algorithm on a

# Import necessary libraries

# Assuming you have a 'titanic.csv' file

# Convert categorical variables to numerical using one-hot encoding

# Separate features and target variable

# Split the dataset into training and testing sets

# Make predictions on the test set

# Evaluate the model

# Print the results

6. Write a program in python to implement Support Vector Machine on the iris

# Import necessary libraries

# Load the Iris dataset

# Split the dataset into training and testing sets

# Create and train the Support Vector Machine model

# Make predictions on the test set

# Evaluate the model

# Print the results

# Plot the decision boundary

# Plot the points

You might also like