0% found this document useful (0 votes)

42 views28 pages

ML (Lab Programs)

The document provides a comprehensive guide on setting up Python and essential libraries such as NumPy, Pandas, and Scikit-learn for machine learning. It includes detailed installation steps, sample programs for data manipulation, visualization, and machine learning tasks, as well as methods for handling missing data and encoding categorical variables. The document serves as a practical resource for beginners and practitioners in data science and machine learning.

Uploaded by

itadeekshu19

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views28 pages

ML (Lab Programs)

Uploaded by

itadeekshu19

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Machine Learning

Lab Programs

1. Install and set-up Python and Pardas. and essential libmates Like
Numpy

● Installation of Python and Set-up Python

Search Python in the Google search bar “https://www.python.org/downloads/

● ”and download. The latest version of Python from the Google.

● Python latest version, Python 3.12.1 64-bit/32-bit and download the

executable ﬁde.
Downloading the Python Installer

● Open the .exe ﬁle, such as Python 3.12.1 and 64, then launch the
python installer.

● Choose the option to install the launcher for all users by checking
the corresponding checkbox,
● verify the python installation in windows

Go to Python integrated Development Environment [IDLE] in windows search

bar, you can see the “IDLE (python3.12.64-bit)”open the IDLE screen itself you
can see the version.

This gives the com formation of successfully installation of Python.

Installation of essential Packages Numpy and Pandas.

a. Install numpy package:

NumPy is an open-source Python library that facilitates eﬃcient
numerical operations on large quantities of data. There are a few functions
that exist in NumPy that we use on pandas DataFrames.
It is deﬁned as a Python package used for performing the various
numerical computations and processing of the multidimensional and single-
dimensional array elements. The calculations using Numpy arrays are faster
than the normal Python array. It is also capable of handling a vast amount of
data
● Steps to install Numpy is
Step1: open Command prompt CMD
Step2: open the python directory
C:\User\Appdata\Local\Programs\Python\Python 3.12\

Step3: install numpy:

By typing the command
Pip install numpy

b. Install pandas package:

Pandas is a very popular library for working with data (its goal is to be
the most powerful and flexible open-source tool, and in our opinion, it has
reached that goal). DataFrames are at the center of pandas. A DataFrame
is structured like a table or spreadsheet. The rows and the columns both
have indexes, and you can perform operations on rows or columns
separately. It can perform five significant steps required for processing and
analysis of data irrespective of the origin of the data, i.e., load,
manipulate, prepare, model, and analyze.

● Steps to install pandas is

Step1: open Command prompt CMD
Step2: open the python directory
C:\User\Appdata\Local\Programs\Python\Python 3.12\

Step3: install pandas:

By typing the command
Pip install pandas

Simple program to show the installed library versions to provide conformation of

successful installing.
import
numpy
import
pandas
print("numpy library version is: ")
print(numpy. version ) #please type two underscore symbols.
print("numpy library is successfully installed")
print(" ")

print("pandas library
version is: ") print(pandas.
version )
print("pandas library is successfully installed")

Program 2 Introduce scikit-learn as a machine learning library.

Scikit-learn is a popular open-source machine learning library in Python that

offers a comprehensive set of tools and algorithms for data analysis,
modeling, and machine learning tasks. It is built on foundational libraries like
NumPy, SciPy, and Matplotlib. Scikit-learn provides a user-friendly and
eﬃcient framework for both beginners and experts in the ﬁeld of data
science.

Some key points to introduce scikit-learn as a machine learning library:

1. Comprehensive Machine Learning Library: Scikit-learn offers a wide

range of machine learning algorithms and tools for various tasks such
as classiﬁcation, regression, clustering, dimensionality reduction, and
more.

2. User-Friendly and Easy to Use: It is designed with a user-friendly

interface and simple syntax, making it accessible for both beginners and
experienced machine learning practitioners.

3. Integration with Scientiﬁc Computing Libraries: Scikit-learn

integrates well with other scientiﬁc computing libraries in Python such
as NumPy, SciPy, and Matplotlib, providing a powerful environment for
machine learning tasks.

4. Extensive Documentation and Community Support: The library

comes with comprehensive documentation, tutorials, and examples to
help users understand and implement machine learning algorithms
effectively. Additionally, there is a vibrant community around scikit-learn
that provides support and contributions.

5. Eﬃcient Implementation of Algorithms: Scikit-learn is built on top of

NumPy, SciPy, and Cython, which allows for eﬃcient implementation of
machine learning algorithms and scalability to large datasets.

6. Support for Model Evaluation and Validation: The library provides

tools for model evaluation, hyperparameter tuning, cross-validation, and
performance metrics, enabling users to assess and improve the quality
of their machine learning models.

7. Flexibility and Customization: Scikit-learn offers ﬂexibility for

customization and parameter tuning, allowing users to adapt algorithms
to their speciﬁc requirements and datasets.

8. Wide Adoption and Industry Usage: Due to its ease of use,

performance, and versatility, scikit-learn is widely adopted in academia,
research, and industry for various machine learning applications.

Overall, scikit-learn is a powerful and versatile machine learning library in

Python that empowers users to build and deploy machine learning models
eﬃciently for a wide range of tasks and applications.

Lab Program 3: Install and set up scikit-learn and other necessary

tools.
PIP is a package manager for Python, which means it allows you to
install and manage libraries and dependencies that are supplemental to the
standard library. (A package contains all the ﬁles you need for a module, and
modules are Python code libraries that you can include in your projects.) PIP3
is also a package manager, designed to replace PIP to solve few problems
caused by it. Latest versions of python 3.x allows the use of pips command for
installing python libraries.

Scikit-learn (Sklearn) Library:

Scikit-learn is the most useful machine learning library. It provides

modules for data analysis and statistical modelling. It provides a wide range
of eﬃcient tools such as classiﬁcation, regression, and clustering and
dimensionality reduction via a consistence interface in Python. This library,
which is largely written in Python, is built upon following essential libraries:
NumPy, Pandas, SciPy and Matplotlib libraries.

Install numpy library

● Steps to install Numpy is
Step1: open Command prompt CMD
Step2: open the python directory
C:\User\Appdata\Local\Programs\Python\Python 3.12\

Step3: install numpy:

By typing the command
Pip install numpy
● Steps to install pandas is
Step1: open Command prompt CMD
Step2: open the python directory
C:\User\Appdata\Local\Programs\Python\Python 3.12\

Step3: install pandas:

By typing the command
Pip install pandas

● Steps to install matplotlib is

Step1: open Command prompt CMD
Step2: open the python directory
C:\User\Appdata\Local\Programs\Python\Python 3.12\

Step3: install pandas:

By typing the command
Pip install matplotlib

● Steps to install scipy is

Step1: open Command prompt CMD
Step2: open the python directory
C:\User\Appdata\Local\Programs\Python\Python 3.12\

Step3: install pandas:

By typing the command
Pip install scipy

● Steps to install scikit-learn(sklearn) is

Step1: open Command prompt CMD
Step2: open the python directory
C:\User\Appdata\Local\Programs\Python\Python 3.12\

Step3: install pandas:

By typing the command
Pip install scikit-learn

Simple program to show the installed library versions to provide conformation of

successful installing.
import numpy
import pandas
import scipy
import matplotlib
import sklearn
print("numpy library version is: ")
print(numpy.__version__) #please type two underscore symbols.
print("numpy library is successfully installed")
print(" ")
print("pandas library version is: ")
print(pandas.__version__)
print("pandas library is successfully installed")
print(" ")
print("scipy library version is: ")
print(scipy.__version__)
print("scipy library is successfully installed")
print(" ")
print("matplotlib library version is: ")
print(matplotlib.__version__)
print("matplotlib library is successfully installed")
print(" ")
print("scikit-learn(sklearn) library version is: ")
print(sklearn.__version__)
print("sklearn library is successfully installed")

Lab Program 4: Write a program to Load and explore the dataset

of .CVS and excel ﬁles using pandas.
import pandas as pd

csv_ﬁle_path='C:\\ML_Projects\\sample_data.csv'
excel_ﬁle_path='C:\\ML_Projects\\sample_data.xlsx'

data_csv=pd.read_csv(csv_ﬁle_path)

print("CSV File data:")

print(data_csv)

data_excel=pd.read_excel(excel_ﬁle_path)

print("\nExcel File data:")

print(data_excel)

print("\n Data Descriptions:")

print("CSV data Decription:")

print(data_csv.describe())

print("\n Excel data Decription:")

print(data_excel.describe())

print("\n Datatypes in CSv ﬁles:")

print(data_csv.dtypes)

print("\n Datatypes in Excel ﬁles:")

print(data_excel.dtypes)
Output

CSV File data:

Name Age Score

0 Manoj 19 95

1 Dilip 20 97

2 Manjula 40 35

3 Rakesh 24 45

4 Kushal 22 80

Excel File data:

Name Course Sem

0 Rajesh BCA 1

1 Ramesh BCA 2

2 Swati BCOM 1

3 Florina BCOM 3

4 Pooja BBA 2

5 Raghu BBA 4

Data Descriptions:

CSV data Decription:

Age Score

count 5.000000 5.000000

mean 25.000000 70.400000

std 8.602325 28.736736

min 19.000000 35.000000

25% 20.000000 45.000000

50% 22.000000 80.000000

75% 24.000000 95.000000

max 40.000000 97.000000

Excel data Decription:

Sem

count 6.000000

mean 2.166667

std 1.169045

min 1.000000

25% 1.250000

50% 2.000000

75% 2.750000

max 4.000000

Datatypes in CSv ﬁles:

Name object

Age int64

Score int64

dtype: object

Datatypes in Excel ﬁles:

Name object

Course object

Sem int64

dtype: object
Lab Program 5: Write a program to visualize the dataset to gain insights using
Matplotlib or Seaborn by plotting scatter plots, bar charts.

import pandas as pd

import matplotlib.pyplot as plt

data= pd.read_csv('C:\\ML_Projects\\study_data.csv')

plt.ﬁgure(ﬁgsize=(14,7))

plt.subplot(1,2,1)

plt.scatter(data['Study Hours'], data['Exam Score'], color='cyan', edgecolor='k', alpha=0.7)

plt.title('Study Hours vs .Exam Scores')

plt.xlabel('Study Hours')

plt.ylabel('Exam Scores')

plt.grid(True)

bins=[0,2,4,6,8,10,12]

labels=['0-2', '2-4', '4-6', '6-8', '8-10', '10-12']

data['Study Hours Range']=pd.cut(data['Study Hours'], bins=bins, labels=labels, right=False)

grouped_data=data.groupby('Study Hours Range')['Exam Score'].mean()

plt.subplot(1,2,2)

grouped_data.plot(kind='bar', color='pink')

plt.title('Average Exam Score by Study Hour Range')

plt.xlabel('Study Hours Range')

plt.ylabel('Average Exam Scores')

plt.xticks(rotation=0)

plt.tight_layout()

plt.show()

output
Lab Program 6: Write a program to Handle missing data, encode
categorical variables, and perform feature scaling.
import pandas as pd

from sklearn.impute import SimpleImputer

from sklearn.preprocessing import OneHotEncoder, StandardScaler

data={

'Age': [25, 30, None, 28, 35],

'Gender': ['Female', 'Male', 'Male', 'Female', 'Male'],

'Income': [50000, 60000, 45000, None, 70000]

}

df= pd.DataFrame(data)

#Handling missing data.

imputer = SimpleImputer(strategy='mean')

df[['Age', 'Income']] = imputer.ﬁt_transform(df[['Age', 'Income']])

#Print data after handling missing values

print("Data after handling missing values:")

print(df)
#Encoding categorical variables

encoder = OneHotEncoder()

encoded_data = encoder.ﬁt_transform(df[['Gender']]).toarray()

#Print data after categorical encoding

encoded_df= pd.DataFrame(encoded_data,
columns=encoder.get_feature_names_out(['Gender']))

print("\nData after categorical encoding:")

print(encoded_df)

scaler = StandardScaler()

scaled_data =scaler.ﬁt_transform(df[['Age', 'Income']])

#Print data after feature scaling

scaled_df = pd.DataFrame(scaled_data, columns=['Scaled Age', 'Scaled Income'])

print("\nData after feature scaling:")

print(scaled_df)

Output
Lab Program 7: Write a program to implement a k-Nearest Neighbours (k-NN) classiﬁer
using scikitlearn and Train the classiﬁer on the dataset and evaluate its performance.

import numpy as np

from sklearn.model_selection import train_test_split

from sklearn.neighbors import KNeighborsClassiﬁer

from sklearn.metrics import accuracy_score

#Dummy student data: exam score 1, exam score 2, pass/fail (features)

X = np.array([[88, 75], [95, 90], [60, 50], [45, 30], [30, 48], [85, 95], [70, 60], [50, 55], [40, 45], [60,
70]])

y= np.array([1, 1, 0, 0, 0, 1, 1, 0, 0, 1]) #Binary classes for demonstration

#Split the data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=.2,random_state=42)

#Initialize the K-NN classiﬁer with k=3

knn = KNeighborsClassiﬁer(n_neighbors=3)

#Train the classiﬁer on the training data

knn.ﬁt(X_train,y_train)

# Evaluate the classiﬁer's performance

y_pred=knn.predict(X_test)

accuracy=accuracy_score(y_test, y_pred)

print("Accuracy on the test set: {:.2f}".format(accuracy))

#Take user input for exam scores

exam_score1 = ﬂoat(input("Enter Exam Score 1: "))

exam_score2 = ﬂoat(input("Enter Exam Score 2: "))

# Prepare the user input for prediction

user_input = np.array([[exam_score1, exam_score2]])

# Use the trained k-NN classiﬁer to predict the outcome

predicted_outcome=knn.predict(user_input)

if predicted_outcome [0] == 1:

print("Based on the exam scores provided, the student is predicted to pass.")

else:

print("Based on the exam scores provided, the student is predicted to fail.")

OutPut:

Lab Program 08. Write a program to implement a linear regression model for regression
tasks and Train the model on a dataset.
#Regression Algorithm
import numpy as np
import matplotlib.pyplot as plt

X=np.array([18,17,26,19,27,31,14,29,32,26]) #Experince in months

Y=np.array([16000,11000,23000,23000,23000, 32000,15000, 33000, 32000,
32000]) #Salary
print("X-values are:")
print (X)
print("Y-values are:")
print (Y)
print()
#Find mean values of X and Y data.
mean_x=np.mean (X)
print (f"Mean of X is: {mean_x}")
mean_y=np.mean(Y)
print (f"Mean of Y is: {mean_y}")
print()
variance_x = np.var (X)
print (f"Variance of X is: {variance_x}")
covariance= (np.sum((X- mean_x) * (Y -mean_y)))/(len(X))
print (f"Covariance of X is: {covariance}")
print()
#Find a and b values.
a= covariance / variance_x
print("a =covariance / variance_x so, ")
print (f"a={a}")
b = mean_y-a* mean_x
print("b= mean_y-a *mean_x so, ")
print (f"b= {b}")
print()
#Predict Y- values to the existing X- values.
Y_pred=a* X + b
print (f"Regression Line: Y = {a:.2f} + {b:.2f}X")
print("Y-values obtained are =" , Y_pred)
print("And corresponding X- values are =" , X)
print()
plt.scatter (X, Y, label="Original Data")
plt.plot(X, Y_pred, color="red", label=f"Regression Line: Y = {a:.2f} + {b:.2f}X")
plt.xlabel("Experince")
plt.ylabel("Salary")
plt.legend()
plt.grid(True)
plt.show()

# Getting the Solution that is Y- value, for new data set that is X- value.
new_X = 7.5
new_Y=a* new_X + b
print()
print (f"Predict Y-value using= {a:.2f} + {b:.2f}X for new X- value= {new_X} ")
print (f"Predicted Y-value is =(new_Y:.2f) ")
Lab Program 09. Write a program to implement a decision tree classifier using
scikit-learn and visualize the decision tree and understand its splits.
from sklearn.tree import DecisionTreeClassifier, plot_tree
from matplotlib.pyplot import figure,show
import matplotlib.pyplot as plt

# Deﬁne some features and corresponding classiﬁcations

features = [[140,1],[130,0],[150,0],[170,1],[180,1],[100,0],[172,1]]
classiﬁcations = ["play","don't play","don't play","play","play","don't play","play",]

import numpy as np

features=np.array(features)
classiﬁcations=np.array(classiﬁcations)

# Create a decision tree classiﬁer

clf = DecisionTreeClassiﬁer()

# Train the classiﬁer on the data

clf = clf.ﬁt(features,classiﬁcations)

# Print the Prediction

predictions = clf.predict([[170,1]])

# Creaye a ﬁgure for Plotting the Tree

print("Decision Tree Classifier:")
print("Predict Class Label for New Instance is: [170,1]")
print("Class Label for New Instance is:",predictions[0])
plt.figure(figsize=(5,8))
plot_tree(clf,feature_names=["Temperature","Huminity"],class_names=classificatio
ns,filled=True,rounded=True)
plt.show()

Lab Program 10. Write a program to Implement K-Means clustering

and Visualize clusters.
import matplotlib.pyplot as plt
from sklearn.cluster import KMeans
import numpy as np
import pandas as pd
data=[[1,1],[1.5,1.8], [5,8], [8,8],[1,0.6],[9,11]]
print("Considered data for K-Means clustering is:")
print(data)

print("Considered data as numpy list is:")

data=np.array(data)
print(data)

print("Assumed K-Value is:")

k=3
print(k)

print("K-Means object is given following value:")

Kmeans=kMeans(n_clusters=k, random_state=42, n_init=10)
print(Kmeans)

Kmeans.ﬁt(data)
print("Integer labels provided to each data points are:")
labels=Kmeans.Labels_
print(labels)

print("Calculated centroid points are:")

centroids=Kmeans.cluster_centers_
print(Centroids)
plt.scatter(data[:,0], data[:,1], c=labels, cmap='viridis')
plt.scatter(centroids[:,0], centroids[:,1], s=60, marker='x', c='red')

plt.xlabel("X-axis")
plt.ylabel("Y-axis")
plt.title("K-Means clustering (k=" + str(k) + ")")
plt.grid()
plt.show()

ML LAB MANUAL (Ashwini Y) Compressed
No ratings yet
ML LAB MANUAL (Ashwini Y) Compressed
31 pages
Data Analytics Lab Manual
No ratings yet
Data Analytics Lab Manual
66 pages
Machine Learning Lab Dlihebca6sem
100% (1)
Machine Learning Lab Dlihebca6sem
25 pages
ML Lab 1
No ratings yet
ML Lab 1
24 pages
BCA VI Sem ML Lab Manual
No ratings yet
BCA VI Sem ML Lab Manual
42 pages
Dsa Lab Manual
No ratings yet
Dsa Lab Manual
59 pages
Fds Lab Manual PDF
No ratings yet
Fds Lab Manual PDF
80 pages
ML With Python Lab (MCA)
No ratings yet
ML With Python Lab (MCA)
36 pages
FDS Lab Manual For CSE 1
No ratings yet
FDS Lab Manual For CSE 1
86 pages
Data Science
No ratings yet
Data Science
17 pages
CS3361 - Data Science Laboratory
No ratings yet
CS3361 - Data Science Laboratory
31 pages
DS Lab Manual
No ratings yet
DS Lab Manual
113 pages
Data Science Lab Manual
No ratings yet
Data Science Lab Manual
56 pages
Scikit Learn User Guide 0.12
100% (1)
Scikit Learn User Guide 0.12
1,049 pages
Machine Learning - Python Libraries
No ratings yet
Machine Learning - Python Libraries
12 pages
ML LabManual
No ratings yet
ML LabManual
16 pages
ML Lab
No ratings yet
ML Lab
4 pages
ML - Lab - Programs - J
No ratings yet
ML - Lab - Programs - J
18 pages
Numpy, Scipy, Matplot
No ratings yet
Numpy, Scipy, Matplot
5 pages
Pai 6
No ratings yet
Pai 6
17 pages
FODS Lab Manual - Organized
No ratings yet
FODS Lab Manual - Organized
93 pages
Introduction To Python and ML Libraries
No ratings yet
Introduction To Python and ML Libraries
11 pages
Unit 4
No ratings yet
Unit 4
105 pages
Machine Learning Lab Programs
No ratings yet
Machine Learning Lab Programs
6 pages
Cse-Fds Lab Manual
No ratings yet
Cse-Fds Lab Manual
74 pages
Chapter1 2challenges
No ratings yet
Chapter1 2challenges
10 pages
ML Lab Manual VI Sem
No ratings yet
ML Lab Manual VI Sem
35 pages
Centurion University of Technology and Managament: Topic: Numpy, Pandas, Matplotlib
No ratings yet
Centurion University of Technology and Managament: Topic: Numpy, Pandas, Matplotlib
12 pages
Sec-D ML Practical File PDF
No ratings yet
Sec-D ML Practical File PDF
19 pages
Sams Teach Yourself Java 2 in 24 Hours 3rd Edition Rogers Cadenhead - Download The Ebook Today To Explore Every Detail
100% (19)
Sams Teach Yourself Java 2 in 24 Hours 3rd Edition Rogers Cadenhead - Download The Ebook Today To Explore Every Detail
70 pages
ML Lab - Abbs
No ratings yet
ML Lab - Abbs
23 pages
Data Sets
No ratings yet
Data Sets
36 pages
Unit 1-1
No ratings yet
Unit 1-1
10 pages
CS3361 Data Science Lab Manual
No ratings yet
CS3361 Data Science Lab Manual
65 pages
ML Lab (2MCA)
No ratings yet
ML Lab (2MCA)
52 pages
Lab Manual ML R22
No ratings yet
Lab Manual ML R22
27 pages
CS3362 Data Science Laboratory Alok Kumar
No ratings yet
CS3362 Data Science Laboratory Alok Kumar
50 pages
ML Lab Manual
No ratings yet
ML Lab Manual
20 pages
Algorithms With JULIA
100% (1)
Algorithms With JULIA
447 pages
Pre - Class Python Library Installation Instructions
No ratings yet
Pre - Class Python Library Installation Instructions
9 pages
ML Lab Manual (Vim)
No ratings yet
ML Lab Manual (Vim)
13 pages
Fdsa Manual
No ratings yet
Fdsa Manual
53 pages
ML Pgms - 24mar2025
No ratings yet
ML Pgms - 24mar2025
23 pages
Exp1 Ref Doc Installation
No ratings yet
Exp1 Ref Doc Installation
6 pages
Ex - No-1 Installation and Exploration
No ratings yet
Ex - No-1 Installation and Exploration
3 pages
FDS Ex No 1
No ratings yet
FDS Ex No 1
6 pages
Mrdn-Mi 5
No ratings yet
Mrdn-Mi 5
23 pages
Data Science Vs Data Analytics
No ratings yet
Data Science Vs Data Analytics
5 pages
ML-Lab Manual - NEP - DSS
No ratings yet
ML-Lab Manual - NEP - DSS
23 pages
Libraries
No ratings yet
Libraries
3 pages
1.1-1.4 - Introduction To Python
No ratings yet
1.1-1.4 - Introduction To Python
50 pages
Programming For Data Science
No ratings yet
Programming For Data Science
48 pages
ML Exp 1
No ratings yet
ML Exp 1
6 pages
Lecture # 2
No ratings yet
Lecture # 2
21 pages
Machine Learning Lab Set1
No ratings yet
Machine Learning Lab Set1
5 pages
Statistics Machine Learning Python Draft
No ratings yet
Statistics Machine Learning Python Draft
319 pages
Data Science Lab Manual
No ratings yet
Data Science Lab Manual
18 pages
23CS302 - Dslab - Experiment 1
No ratings yet
23CS302 - Dslab - Experiment 1
5 pages
Fds PDF
No ratings yet
Fds PDF
4 pages
AI/ML Python Modules
No ratings yet
AI/ML Python Modules
17 pages
MLk65opyk45o4v 22i5vi2 It9359ci5ji3tjui3wmdlakmlmakmkmfiejrieuighegiurhgiurguir
No ratings yet
MLk65opyk45o4v 22i5vi2 It9359ci5ji3tjui3wmdlakmlmakmkmfiejrieuighegiurhgiurguir
23 pages
Administracion Informix
No ratings yet
Administracion Informix
185 pages
Chapter 5 Review of C++
No ratings yet
Chapter 5 Review of C++
47 pages
Problem Set Ee8205 PDF
No ratings yet
Problem Set Ee8205 PDF
4 pages
11 - Modular Programming
No ratings yet
11 - Modular Programming
23 pages
Object Oriented Programming - CS3391 - Question Bank and Important 2 Marks Questions With Answer
No ratings yet
Object Oriented Programming - CS3391 - Question Bank and Important 2 Marks Questions With Answer
44 pages
Construction Company
100% (1)
Construction Company
23 pages
CSE Final Exam Solutions
No ratings yet
CSE Final Exam Solutions
11 pages
Web Tech Set 2
No ratings yet
Web Tech Set 2
17 pages
Image Processing (RCS082) Unit V Huffman Coding
No ratings yet
Image Processing (RCS082) Unit V Huffman Coding
12 pages
All BZ
No ratings yet
All BZ
8 pages
SBWP
No ratings yet
SBWP
5 pages
It Lab Mysql
No ratings yet
It Lab Mysql
9 pages
Lab Manual 1ala
100% (1)
Lab Manual 1ala
24 pages
Bisma Ali - Assignment
No ratings yet
Bisma Ali - Assignment
5 pages
Lizard Stream Cipher
No ratings yet
Lizard Stream Cipher
12 pages
Computer Science 3B 2020 Exam SSSA - ACSSE - CSC3B - 2020 - EXAM - SSSA
No ratings yet
Computer Science 3B 2020 Exam SSSA - ACSSE - CSC3B - 2020 - EXAM - SSSA
8 pages
MAD Experiment 25
No ratings yet
MAD Experiment 25
3 pages
1CP2 02 AdSAMS2 QU
No ratings yet
1CP2 02 AdSAMS2 QU
9 pages
Computer Science 2018 Mark Scheme
No ratings yet
Computer Science 2018 Mark Scheme
8 pages
Osy-Questiuon Bank
No ratings yet
Osy-Questiuon Bank
9 pages
Data Structutre PDF
No ratings yet
Data Structutre PDF
19 pages
Csc721 Programming Technique II
No ratings yet
Csc721 Programming Technique II
96 pages
Prctical BC
No ratings yet
Prctical BC
2 pages
Digital System Design-Module07-Behavioral Modeling (Cont'd)
No ratings yet
Digital System Design-Module07-Behavioral Modeling (Cont'd)
40 pages
Arrays Answers Python
No ratings yet
Arrays Answers Python
9 pages
Flow Chart 2
No ratings yet
Flow Chart 2
16 pages

ML (Lab Programs)

Uploaded by

ML (Lab Programs)

Uploaded by

Machine Learning

● Installation of Python and Set-up Python

Search Python in the Google search bar “https://www.python.org/downloads/

● Python latest version, Python 3.12.1 64-bit/32-bit and download the

Go to Python integrated Development Environment [IDLE] in windows search

This gives the com formation of successfully installation of Python.

a. Install numpy package:

Step3: install numpy:

b. Install pandas package:

● Steps to install pandas is

Step3: install pandas:

Simple program to show the installed library versions to provide conformation of

Program 2 Introduce scikit-learn as a machine learning library.

Scikit-learn is a popular open-source machine learning library in Python that

Some key points to introduce scikit-learn as a machine learning library:

1. Comprehensive Machine Learning Library: Scikit-learn offers a wide

2. User-Friendly and Easy to Use: It is designed with a user-friendly

3. Integration with Scientiﬁc Computing Libraries: Scikit-learn

4. Extensive Documentation and Community Support: The library

5. Eﬃcient Implementation of Algorithms: Scikit-learn is built on top of

6. Support for Model Evaluation and Validation: The library provides

7. Flexibility and Customization: Scikit-learn offers ﬂexibility for

8. Wide Adoption and Industry Usage: Due to its ease of use,

Overall, scikit-learn is a powerful and versatile machine learning library in

Lab Program 3: Install and set up scikit-learn and other necessary

Scikit-learn (Sklearn) Library:

Scikit-learn is the most useful machine learning library. It provides

Install numpy library

Step3: install numpy:

Step3: install pandas:

● Steps to install matplotlib is

Step3: install pandas:

● Steps to install scipy is

Step3: install pandas:

● Steps to install scikit-learn(sklearn) is

Step3: install pandas:

Simple program to show the installed library versions to provide conformation of

Lab Program 4: Write a program to Load and explore the dataset

print("CSV File data:")

print("\nExcel File data:")

print("\n Data Descriptions:")

print("CSV data Decription:")

print("\n Excel data Decription:")

print("\n Datatypes in CSv ﬁles:")

print("\n Datatypes in Excel ﬁles:")

CSV File data:

Name Age Score

Excel File data:

Name Course Sem

CSV data Decription:

count 5.000000 5.000000

mean 25.000000 70.400000

std 8.602325 28.736736

min 19.000000 35.000000

50% 22.000000 80.000000

75% 24.000000 95.000000

max 40.000000 97.000000

Excel data Decription:

Datatypes in CSv ﬁles:

Datatypes in Excel ﬁles:

import pandas as pd

import matplotlib.pyplot as plt

plt.scatter(data['Study Hours'], data['Exam Score'], color='cyan', edgecolor='k', alpha=0.7)

plt.title('Study Hours vs .Exam Scores')

labels=['0-2', '2-4', '4-6', '6-8', '8-10', '10-12']

data['Study Hours Range']=pd.cut(data['Study Hours'], bins=bins, labels=labels, right=False)

grouped_data=data.groupby('Study Hours Range')['Exam Score'].mean()

plt.title('Average Exam Score by Study Hour Range')

plt.xlabel('Study Hours Range')

plt.ylabel('Average Exam Scores')

from sklearn.impute import SimpleImputer

from sklearn.preprocessing import OneHotEncoder, StandardScaler

'Age': [25, 30, None, 28, 35],

'Gender': ['Female', 'Male', 'Male', 'Female', 'Male'],

'Income': [50000, 60000, 45000, None, 70000]

#Handling missing data.

Machine Learning

● Installation of Python and Set-up Python

Search Python in the Google search bar “https://www.python.org/downloads/

This gives the com formation of successfully installation of Python.

a. Install numpy package:

Step3: install numpy:

b. Install pandas package:

● Steps to install pandas is

Step3: install pandas:

Program 2 Introduce scikit-learn as a machine learning library.

Some key points to introduce scikit-learn as a machine learning library:

Scikit-learn (Sklearn) Library:

Install numpy library

Step3: install numpy:

Step3: install pandas:

● Steps to install matplotlib is

Step3: install pandas:

● Steps to install scipy is

Step3: install pandas:

● Steps to install scikit-learn(sklearn) is

Step3: install pandas:

print("CSV File data:")

print("\nExcel File data:")

print("\n Data Descriptions:")

print("CSV data Decription:")

print("\n Excel data Decription:")

print("\n Datatypes in CSv ﬁles:")

print("\n Datatypes in Excel ﬁles:")

CSV File data:

Name Age Score

Excel File data:

Name Course Sem

CSV data Decription:

count 5.000000 5.000000

mean 25.000000 70.400000

std 8.602325 28.736736

min 19.000000 35.000000

50% 22.000000 80.000000

75% 24.000000 95.000000

max 40.000000 97.000000

Excel data Decription:

Datatypes in CSv ﬁles:

Datatypes in Excel ﬁles:

import pandas as pd

import matplotlib.pyplot as plt

plt.scatter(data['Study Hours'], data['Exam Score'], color='cyan', edgecolor='k', alpha=0.7)

plt.title('Study Hours vs .Exam Scores')

labels=['0-2', '2-4', '4-6', '6-8', '8-10', '10-12']

data['Study Hours Range']=pd.cut(data['Study Hours'], bins=bins, labels=labels, right=False)

grouped_data=data.groupby('Study Hours Range')['Exam Score'].mean()

plt.title('Average Exam Score by Study Hour Range')

plt.ylabel('Average Exam Scores')

from sklearn.impute import SimpleImputer

from sklearn.preprocessing import OneHotEncoder, StandardScaler

'Age': [25, 30, None, 28, 35],

'Gender': ['Female', 'Male', 'Male', 'Female', 'Male'],

'Income': [50000, 60000, 45000, None, 70000]

#Handling missing data.

df[['Age', 'Income']] = imputer.ﬁt_transform(df[['Age', 'Income']])

#Print data after handling missing values

print("Data after handling missing values:")

#Print data after categorical encoding

print("\nData after categorical encoding:")

scaled_data =scaler.ﬁt_transform(df[['Age', 'Income']])

#Print data after feature scaling

scaled_df = pd.DataFrame(scaled_data, columns=['Scaled Age', 'Scaled Income'])

print("\nData after feature scaling:")

import numpy as np

from sklearn.model_selection import train_test_split

from sklearn.neighbors import KNeighborsClassiﬁer

from sklearn.metrics import accuracy_score

#Dummy student data: exam score 1, exam score 2, pass/fail (features)

y= np.array([1, 1, 0, 0, 0, 1, 1, 0, 0, 1]) #Binary classes for demonstration

#Split the data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=.2,random_state=42)

#Initialize the K-NN classiﬁer with k=3

#Train the classiﬁer on the training data

# Evaluate the classiﬁer's performance

print("Accuracy on the test set: {:.2f}".format(accuracy))

#Take user input for exam scores