0% found this document useful (0 votes)

35 views4 pages

Scikit Learn1

This document is a Jupyter Notebook that loads wine quality data, preprocesses it, splits it into training and test sets, and trains three classification models (logistic regression, support vector machine, and decision tree) on the data. It standardizes the features, splits the data, trains the models on the training set, makes predictions on the test set with each model, and stores the predictions in a dictionary to iterate through and print classification reports for each model.

Uploaded by

Naomie Jennifer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views4 pages

Scikit Learn1

Uploaded by

Naomie Jennifer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

14/11/2023 00:10 Scikit-learn1 - Jupyter Notebook

Entrée [22]: import pandas as pd

import numpy as np
import seaborn as sns
from sklearn.datasets import load_wine
from sklearn.preprocessing import StandardScaler
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LogisticRegression
from sklearn.svm import SVC
from sklearn.tree import DecisionTreeClassifier
from sklearn.metrics import classification_report

wine_data = load_wine()

Entrée [3]: wine_data.data

Out[3]: array([[1.423e+01, 1.710e+00, 2.430e+00, ..., 1.040e+00, 3.920e+00,

1.065e+03],
[1.320e+01, 1.780e+00, 2.140e+00, ..., 1.050e+00, 3.400e+00,
1.050e+03],
[1.316e+01, 2.360e+00, 2.670e+00, ..., 1.030e+00, 3.170e+00,
1.185e+03],
...,
[1.327e+01, 4.280e+00, 2.260e+00, ..., 5.900e-01, 1.560e+00,
8.350e+02],
[1.317e+01, 2.590e+00, 2.370e+00, ..., 6.000e-01, 1.620e+00,
8.400e+02],
[1.413e+01, 4.100e+00, 2.740e+00, ..., 6.100e-01, 1.600e+00,
5.600e+02]])

Entrée [4]: # Convert data to pandas dataframe

wine_df = pd.DataFrame(wine_data.data, columns=wine_data.feature_names)

Entrée [5]: # Add the target label

wine_df["target"] = wine_data.target

Entrée [6]: # Take a preview

wine_df.head()

Out[6]: alcohol malic_acid ash alcalinity_of_ash magnesium total_phenols flavanoids nonflavanoid

0 14.23 1.71 2.43 15.6 127.0 2.80 3.06

1 13.20 1.78 2.14 11.2 100.0 2.65 2.76

2 13.16 2.36 2.67 18.6 101.0 2.80 3.24

3 14.37 1.95 2.50 16.8 113.0 3.85 3.49

4 13.24 2.59 2.87 21.0 118.0 2.80 2.69

localhost:8888/notebooks/Scikit-learn1.ipynb 1/4
14/11/2023 00:10 Scikit-learn1 - Jupyter Notebook

Entrée [7]: wine_df.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 178 entries, 0 to 177
Data columns (total 14 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 alcohol 178 non-null float64
1 malic_acid 178 non-null float64
2 ash 178 non-null float64
3 alcalinity_of_ash 178 non-null float64
4 magnesium 178 non-null float64
5 total_phenols 178 non-null float64
6 flavanoids 178 non-null float64
7 nonflavanoid_phenols 178 non-null float64
8 proanthocyanins 178 non-null float64
9 color_intensity 178 non-null float64
10 hue 178 non-null float64
11 od280/od315_of_diluted_wines 178 non-null float64
12 proline 178 non-null float64
13 target 178 non-null int32
dtypes: float64(13), int32(1)
memory usage: 18.9 KB

Entrée [8]: wine_df.describe()

Out[8]: alcohol malic_acid ash alcalinity_of_ash magnesium total_phenols flavanoid

count 178.000000 178.000000 178.000000 178.000000 178.000000 178.000000 178.00000

mean 13.000618 2.336348 2.366517 19.494944 99.741573 2.295112 2.02927

std 0.811827 1.117146 0.274344 3.339564 14.282484 0.625851 0.99885

min 11.030000 0.740000 1.360000 10.600000 70.000000 0.980000 0.34000

25% 12.362500 1.602500 2.210000 17.200000 88.000000 1.742500 1.20500

50% 13.050000 1.865000 2.360000 19.500000 98.000000 2.355000 2.13500

75% 13.677500 3.082500 2.557500 21.500000 107.000000 2.800000 2.87500

max 14.830000 5.800000 3.230000 30.000000 162.000000 3.880000 5.08000

Entrée [9]: wine_df.tail()

Out[9]: alcohol malic_acid ash alcalinity_of_ash magnesium total_phenols flavanoids nonflavan

173 13.71 5.65 2.45 20.5 95.0 1.68 0.61

174 13.40 3.91 2.48 23.0 102.0 1.80 0.75

175 13.27 4.28 2.26 20.0 120.0 1.59 0.69

176 13.17 2.59 2.37 20.0 120.0 1.65 0.68

177 14.13 4.10 2.74 24.5 96.0 2.05 0.76

localhost:8888/notebooks/Scikit-learn1.ipynb 2/4
14/11/2023 00:10 Scikit-learn1 - Jupyter Notebook

Entrée [11]: # Split data into features and label

X = wine_df[wine_data.feature_names].copy()
y = wine_df["target"].copy()

Entrée [12]: # Instantiate scaler and fit on features

scaler = StandardScaler()
scaler.fit(X)

Out[12]: StandardScaler()

Entrée [13]: # Transform features

X_scaled = scaler.transform(X.values)

Entrée [14]: # View first instance

print(X_scaled[0])

[ 1.51861254 -0.5622498 0.23205254 -1.16959318 1.91390522 0.80899739

1.03481896 -0.65956311 1.22488398 0.25171685 0.36217728 1.84791957
1.01300893]

Entrée [16]: # Split data into train and test

X_train_scaled, X_test_scaled, y_train, y_test = train_test_split(X_scaled, y,

Entrée [17]: # Check the splits are correct

print(f"Train size: {round(len(X_train_scaled) / len(X) * 100)}% \n\
Test size: {round(len(X_test_scaled) / len(X) * 100)}%")

Train size: 70%

Test size: 30%

Entrée [19]: # Instnatiating the models

logistic_regression = LogisticRegression()
svm = SVC()
tree = DecisionTreeClassifier()

Entrée [20]: # Training the models

logistic_regression.fit(X_train_scaled, y_train)
svm.fit(X_train_scaled, y_train)
tree.fit(X_train_scaled, y_train)

Out[20]: DecisionTreeClassifier()

Entrée [21]: # Making predictions with each model

log_reg_preds = logistic_regression.predict(X_test_scaled)
svm_preds = svm.predict(X_test_scaled)
tree_preds = tree.predict(X_test_scaled)

localhost:8888/notebooks/Scikit-learn1.ipynb 3/4
14/11/2023 00:10 Scikit-learn1 - Jupyter Notebook

Entrée [23]: # Store model predictions in a dictionary

# this makes it's easier to iterate through each model
# and print the results.
model_preds = {
"Logistic Regression": log_reg_preds,
"Support Vector Machine": svm_preds,
"Decision Tree": tree_preds
}

Entrée [24]: for model, preds in model_preds.items():

print(f"{model} Results:\n{classification_report(y_test, preds)}", sep="\n\

Logistic Regression Results:

precision recall f1-score support

0 1.00 1.00 1.00 17

1 1.00 0.92 0.96 25
2 0.86 1.00 0.92 12

accuracy 0.96 54
macro avg 0.95 0.97 0.96 54
weighted avg 0.97 0.96 0.96 54

Support Vector Machine Results:

precision recall f1-score support

0 1.00 1.00 1.00 17

1 1.00 1.00 1.00 25
2 1.00 1.00 1.00 12

accuracy 1.00 54
macro avg 1.00 1.00 1.00 54
weighted avg 1.00 1.00 1.00 54

Decision Tree Results:

precision recall f1-score support

0 0.94 0.94 0.94 17

1 0.92 0.92 0.92 25
2 0.92 0.92 0.92 12

accuracy 0.93 54
macro avg 0.93 0.93 0.93 54
weighted avg 0.93 0.93 0.93 54

Entrée [ ]:

localhost:8888/notebooks/Scikit-learn1.ipynb 4/4

Singh Surender - Biostatistics & Research Methodolgy
No ratings yet
Singh Surender - Biostatistics & Research Methodolgy
18 pages
Pseudocode Cheat Sheet Guide
No ratings yet
Pseudocode Cheat Sheet Guide
13 pages
Distributed Memory Architecture
No ratings yet
Distributed Memory Architecture
48 pages
Software Quality Concepts
No ratings yet
Software Quality Concepts
38 pages
Larson PM 8e Ch03 Im
No ratings yet
Larson PM 8e Ch03 Im
16 pages
School Memorandum No.22, S. 2020 ICT Training For Teachers
No ratings yet
School Memorandum No.22, S. 2020 ICT Training For Teachers
3 pages
Employee Commute Prediction
100% (1)
Employee Commute Prediction
41 pages
CatBoost - An In-Depth Guide Python
No ratings yet
CatBoost - An In-Depth Guide Python
33 pages
From Import Import As From Import From Import From Import Import Import From Import From Import From Import
No ratings yet
From Import Import As From Import From Import From Import Import Import From Import From Import From Import
3 pages
Mini Projects 1-3-Satyaki Mitra
No ratings yet
Mini Projects 1-3-Satyaki Mitra
33 pages
Components of A Big Data Architecture
No ratings yet
Components of A Big Data Architecture
3 pages
ML LAB 12 - Jupyter Notebook
No ratings yet
ML LAB 12 - Jupyter Notebook
11 pages
19mid0034 (Chandru) - ML Lab Fat - Jupyter Notebook
No ratings yet
19mid0034 (Chandru) - ML Lab Fat - Jupyter Notebook
4 pages
USL - 21070126112 - Colaboratory
No ratings yet
USL - 21070126112 - Colaboratory
3 pages
KNN Classifier on Digits Data
No ratings yet
KNN Classifier on Digits Data
3 pages
03 Mind Map Theory
No ratings yet
03 Mind Map Theory
24 pages
20BCP021 Assignment 3
No ratings yet
20BCP021 Assignment 3
7 pages
Z390M-ITXac multiQIG
No ratings yet
Z390M-ITXac multiQIG
159 pages
Quality Prediction Checkpoint
No ratings yet
Quality Prediction Checkpoint
14 pages
ABHAYMLFILE
No ratings yet
ABHAYMLFILE
16 pages
Decision Tree Algorithm in Machine Learning
No ratings yet
Decision Tree Algorithm in Machine Learning
13 pages
Bio-Signal Analysis For Smoking
No ratings yet
Bio-Signal Analysis For Smoking
1 page
Appleton Conduit Hub
No ratings yet
Appleton Conduit Hub
1 page
Heart Disease Prediction Using Decision Tree Analysis
No ratings yet
Heart Disease Prediction Using Decision Tree Analysis
10 pages
Cross-Border Interbank Payment System (CIPS)
No ratings yet
Cross-Border Interbank Payment System (CIPS)
40 pages
Macros For Mine Planning Engineer
No ratings yet
Macros For Mine Planning Engineer
8 pages
Decision Trees
No ratings yet
Decision Trees
2 pages
The Social Engineer Toolkit
No ratings yet
The Social Engineer Toolkit
20 pages
ML 11 Decision Trees
No ratings yet
ML 11 Decision Trees
4 pages
TEST 18 (T20 gd2 11.1)
No ratings yet
TEST 18 (T20 gd2 11.1)
5 pages
Student Outcome Prediction Models
No ratings yet
Student Outcome Prediction Models
29 pages
ML Lab6.Ipynb - Colaboratory
100% (1)
ML Lab6.Ipynb - Colaboratory
5 pages
Importing Libraries: Pandas PD Matplotlib - Pyplot PLT Numpy NP
No ratings yet
Importing Libraries: Pandas PD Matplotlib - Pyplot PLT Numpy NP
10 pages
Download
No ratings yet
Download
1 page
E.X No.6 Build D: Ecision Trees and Random Forests
No ratings yet
E.X No.6 Build D: Ecision Trees and Random Forests
4 pages
Pink Shirt Day
No ratings yet
Pink Shirt Day
9 pages
Injector System Overview
No ratings yet
Injector System Overview
27 pages
FDS Lab Manual
No ratings yet
FDS Lab Manual
10 pages
MA111 Exam 2019
No ratings yet
MA111 Exam 2019
4 pages
Classification of Dry Bean
No ratings yet
Classification of Dry Bean
16 pages
ML Lab Assessment 4
No ratings yet
ML Lab Assessment 4
4 pages
Mini Project Report
No ratings yet
Mini Project Report
12 pages
45B AIML Practical07 Clustering
No ratings yet
45B AIML Practical07 Clustering
8 pages
LightGBM Python Guide: Datasets & Training
No ratings yet
LightGBM Python Guide: Datasets & Training
26 pages
Mini Project With Output
No ratings yet
Mini Project With Output
8 pages
Heat Pump Performance Analysis
No ratings yet
Heat Pump Performance Analysis
2 pages
Practical04.ipynb - Colab
No ratings yet
Practical04.ipynb - Colab
2 pages
Mini Project
No ratings yet
Mini Project
8 pages
Telemecanique ZCKE67 Datasheet
No ratings yet
Telemecanique ZCKE67 Datasheet
12 pages
Multi - Class - Scaled - Down - Data - Colaboratory
No ratings yet
Multi - Class - Scaled - Down - Data - Colaboratory
2 pages
Import As From Import From Import Import As
No ratings yet
Import As From Import From Import Import As
5 pages
Ba 1176 en (Delta) Ab增量式 (Stca900110) )
No ratings yet
Ba 1176 en (Delta) Ab增量式 (Stca900110) )
91 pages
Activity
No ratings yet
Activity
5 pages
OUC DC 911 Follow Up
No ratings yet
OUC DC 911 Follow Up
2 pages
Ensemble D'apprentissage1 - Jupyter Notebook
No ratings yet
Ensemble D'apprentissage1 - Jupyter Notebook
11 pages
Empirical Crop Suitability Model 1694688954
No ratings yet
Empirical Crop Suitability Model 1694688954
24 pages
3HAC16591 en
No ratings yet
3HAC16591 en
234 pages
ML Lab Programs 2
No ratings yet
ML Lab Programs 2
16 pages
SVM Diabetes
No ratings yet
SVM Diabetes
4 pages
Untitled7 - Jupyter Notebook
No ratings yet
Untitled7 - Jupyter Notebook
4 pages
Load Schedules For Lighting Panel Admin BLD 6 - 11-2023
No ratings yet
Load Schedules For Lighting Panel Admin BLD 6 - 11-2023
1 page
ML Keshav
No ratings yet
ML Keshav
23 pages
Data Mining 1 Practical File-1
No ratings yet
Data Mining 1 Practical File-1
24 pages
SN
No ratings yet
SN
2 pages
Comparing Functions Answered
No ratings yet
Comparing Functions Answered
14 pages
Bda 3.1
No ratings yet
Bda 3.1
2 pages
456 ML Lab
No ratings yet
456 ML Lab
7 pages
Class 12 Communication Skills Q&A
No ratings yet
Class 12 Communication Skills Q&A
5 pages
ML Project
No ratings yet
ML Project
2 pages
Devesh
No ratings yet
Devesh
11 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
8 pages
Identify Your Imac Model - Apple Support
No ratings yet
Identify Your Imac Model - Apple Support
1 page
Code in Voices
No ratings yet
Code in Voices
10 pages
Machine Learning Final Report
No ratings yet
Machine Learning Final Report
8 pages
Lecture05 IntervalTree
No ratings yet
Lecture05 IntervalTree
4 pages
ML Lab 8
No ratings yet
ML Lab 8
9 pages
New Flipkart VRP Box To Box (1918 PCS) @ (19-06-2025)
No ratings yet
New Flipkart VRP Box To Box (1918 PCS) @ (19-06-2025)
292 pages
Random Forest
No ratings yet
Random Forest
5 pages
K Nearest Neighbor
No ratings yet
K Nearest Neighbor
6 pages
ML Mini Project
No ratings yet
ML Mini Project
9 pages
Experiment 7
No ratings yet
Experiment 7
3 pages
E&U P3.Ipynb - Colab
No ratings yet
E&U P3.Ipynb - Colab
7 pages
Machine Learning - Lab Record
No ratings yet
Machine Learning - Lab Record
43 pages
ML Mini Project
No ratings yet
ML Mini Project
9 pages
ML Functions
No ratings yet
ML Functions
12 pages
3 4 Lab Session Quadcopter
No ratings yet
3 4 Lab Session Quadcopter
7 pages
Decision Tree and Forests - Ipynb - Colab
No ratings yet
Decision Tree and Forests - Ipynb - Colab
3 pages
Naive Bayes Code
No ratings yet
Naive Bayes Code
5 pages
Decision Tree PBEL With GridSearchCV
No ratings yet
Decision Tree PBEL With GridSearchCV
12 pages
Exercise 10
No ratings yet
Exercise 10
4 pages
Lab 8
No ratings yet
Lab 8
2 pages
Maternal-Risk-Prediction - Ipynb - Colab
No ratings yet
Maternal-Risk-Prediction - Ipynb - Colab
9 pages
CD 505 Itds Practical 1
No ratings yet
CD 505 Itds Practical 1
8 pages
CD 505 Itds Practical 2
No ratings yet
CD 505 Itds Practical 2
4 pages