0% found this document useful (0 votes)

17 views6 pages

Decision Tree

The document outlines a Python implementation of a Decision Tree classifier using the scikit-learn library to predict whether to play tennis based on weather conditions. It includes data preprocessing steps, model training, evaluation metrics such as accuracy and confusion matrix, and visualizes the decision tree. The model achieved an accuracy of 1.0 on the test set.

Uploaded by

angelin272004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views6 pages

Decision Tree

Uploaded by

angelin272004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

DECISION TREE

# Import necessary libraries

import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.tree import DecisionTreeClassifier, plot_tree
from sklearn.metrics import confusion_matrix, accuracy_score, classification_report
import matplotlib.pyplot as plt

# Load the dataset

data = {
'Outlook': ['Sunny', 'Sunny', 'Overcast', 'Rainy', 'Rainy', 'Rainy', 'Overcast', 'Sunny', 'Sunny',
'Rainy', 'Sunny', 'Overcast', 'Overcast', 'Rainy'],
'Temperature': ['Hot', 'Hot', 'Hot', 'Mild', 'Cool', 'Cool', 'Cool', 'Mild', 'Cool', 'Mild', 'Mild',
'Mild', 'Hot', 'Mild'],
'Humidity': ['High', 'High', 'High', 'High', 'Normal', 'Normal', 'Normal', 'High', 'Normal',
'Normal', 'Normal', 'High', 'Normal', 'High'],
'Wind': ['Weak', 'Strong', 'Weak', 'Weak', 'Weak', 'Strong', 'Strong', 'Weak', 'Weak', 'Weak',
'Strong', 'Strong', 'Weak', 'Strong'],
'PlayTennis': ['No', 'No', 'Yes', 'Yes', 'Yes', 'No', 'Yes', 'No', 'Yes', 'Yes', 'Yes', 'Yes', 'Yes', 'No']
}

# Convert the dictionary to a DataFrame

df = pd.DataFrame(data)

# Convert categorical variables to numerical using one-hot encoding

df = pd.get_dummies(df, columns=['Outlook', 'Temperature', 'Humidity', 'Wind'])

# Separate features and target variable

X = df.drop('PlayTennis', axis=1)
y = df['PlayTennis']
# Split the data into training and testing sets
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Initialize Decision Tree classifier

decision_tree = DecisionTreeClassifier()

# Train the model

decision_tree.fit(X_train, y_train)

# Make predictions on the testing set

y_pred = decision_tree.predict(X_test)

# Calculate accuracy
accuracy = accuracy_score(y_test, y_pred)
print("Accuracy:", accuracy)

# Print confusion matrix

print("\nConfusion Matrix:")
print(confusion_matrix(y_test, y_pred))

# Print classification report

print("\nClassification Report:")
print(classification_report(y_test, y_pred))

# Convert feature names Index to a list

feature_names = X.columns.tolist()

# Plot the decision tree

plt.figure(figsize=(12, 8))
plot_tree(decision_tree, feature_names=feature_names, class_names=['No', 'Yes'], filled=True)
plt.show()
Output
Accuracy: 1.0

Confusion Matrix:
[[1 0]
[0 2]]

Classification Report:
Precision recall f1-score support

No 1.00 1.00 1.00 1

Yes 1.00 1.00 1.00 2

accuracy 1.00 3
macro avg 1.00 1.00 1.00 3
weighted avg 1.00 1.00 1.00 3
Step by Step Explanation
1. Import Necessary Libraries:

import pandas as pd

from sklearn.model_selection import train_test_split

from sklearn.tree import DecisionTreeClassifier, plot_tree

from sklearn.metrics import confusion_matrix, accuracy_score, classification_report

import matplotlib.pyplot as plt

Explanation:

 pandas: Library for data manipulation and analysis.

 train_test_split: Function to split the dataset into training and testing sets.
 DecisionTreeClassifier: Class for decision tree classification model.
 plot_tree: Function to visualize the decision tree.
 confusion_matrix, accuracy_score, classification_report: Functions to evaluate the model's
performance.
 matplotlib.pyplot: Library for plotting graphs.
2. Load the Dataset:

data = {

'Outlook': ['Sunny', 'Sunny', 'Overcast', 'Rainy', 'Rainy', 'Rainy', 'Overcast', 'Sunny', 'Sunny',
'Rainy', 'Sunny', 'Overcast', 'Overcast', 'Rainy'],

'Temperature': ['Hot', 'Hot', 'Hot', 'Mild', 'Cool', 'Cool', 'Cool', 'Mild', 'Cool', 'Mild', 'Mild',
'Mild', 'Hot', 'Mild'],

'Humidity': ['High', 'High', 'High', 'High', 'Normal', 'Normal', 'Normal', 'High', 'Normal',
'Normal', 'Normal', 'High', 'Normal', 'High'],

'Wind': ['Weak', 'Strong', 'Weak', 'Weak', 'Weak', 'Strong', 'Strong', 'Weak', 'Weak', 'Weak',
'Strong', 'Strong', 'Weak', 'Strong'],

'PlayTennis': ['No', 'No', 'Yes', 'Yes', 'Yes', 'No', 'Yes', 'No', 'Yes', 'Yes', 'Yes', 'Yes', 'Yes', 'No']

Df=pd.DataFrame(data)

Explanation:

 We define a dictionary containing the "Play Tennis" dataset.

 Then we convert this dictionary to a pandas DataFrame.
3. Data Preprocessing:

df = pd.get_dummies(df, columns=['Outlook', 'Temperature', 'Humidity', 'Wind'])

X = df.drop('PlayTennis', axis=1)

y = df['PlayTennis']
Explanation:

 We use one-hot encoding to convert categorical variables into numerical format.

 X contains the features, and y contains the target variable.

4. Split Data into Training and Testing Sets:

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

Explanation:

 We split the dataset into training and testing sets using train_test_split function.
 We use 80% of the data for training and 20% for testing.

5. Initialize and Train Decision Tree Model:

decision_tree = DecisionTreeClassifier()

decision_tree.fit(X_train, y_train)

Explanation:

 We initialize a DecisionTreeClassifier object.

 Then we train the model using the training data.

6. Make Predictions and Evaluate Model:

y_pred = decision_tree.predict(X_test)

accuracy = accuracy_score(y_test, y_pred)

conf_matrix = confusion_matrix(y_test, y_pred)

class_report = classification_report(y_test, y_pred)

Explanation:

 We make predictions on the testing data using predict method.

 Then we calculate accuracy using accuracy_score.
 We also compute the confusion matrix and classification report.

7. Print Model Evaluation Metrics:

print("Accuracy:", accuracy)

print("\nConfusion Matrix:")

print(conf_matrix)

print("\nClassification Report:")

print(class_report)
Explanation:

 We print the accuracy, confusion matrix, and classification report to evaluate the
model's performance.

8. Plot the Decision Tree:

plt.figure(figsize=(12, 8))

plot_tree(decision_tree, feature_names=X.columns, class_names=['No', 'Yes'], filled=True)

plt.show()

Explanation:

 Finally, we plot the decision tree using plot_tree function to visualize the model's
decision-making process.
 We specify feature names and class names for better interpretation of the tree.

**************************

5b Python Implementation of Decision Tree
No ratings yet
5b Python Implementation of Decision Tree
7 pages
AI 3000 / CS 5500: Reinforcement Learning Assignment 1: Problem 1: Markov Reward Process
No ratings yet
AI 3000 / CS 5500: Reinforcement Learning Assignment 1: Problem 1: Markov Reward Process
5 pages
Module 3 Topic 3 Lesson 2B Weighted Graphs PDF
No ratings yet
Module 3 Topic 3 Lesson 2B Weighted Graphs PDF
14 pages
DWDM Lab 2
No ratings yet
DWDM Lab 2
3 pages
Ant Colony Optimization Explained
100% (1)
Ant Colony Optimization Explained
13 pages
Decision Tree for Admission Prediction
No ratings yet
Decision Tree for Admission Prediction
3 pages
Exhaustive Search and Naive Algorithm
No ratings yet
Exhaustive Search and Naive Algorithm
27 pages
ML NEW Final Format
No ratings yet
ML NEW Final Format
37 pages
Decision Tree Algorithm in Machine Learning
No ratings yet
Decision Tree Algorithm in Machine Learning
13 pages
ML Shristi File
No ratings yet
ML Shristi File
49 pages
This Study Resource Was
No ratings yet
This Study Resource Was
5 pages
Decision Trees for Data Scientists
No ratings yet
Decision Trees for Data Scientists
1 page
ML Ex1
No ratings yet
ML Ex1
12 pages
MLA Lab 6:-Implementation of Decision Tree
No ratings yet
MLA Lab 6:-Implementation of Decision Tree
16 pages
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
No ratings yet
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
20 pages
Decision Tree - Jupyter Notebook
No ratings yet
Decision Tree - Jupyter Notebook
4 pages
ML5 Implementation
No ratings yet
ML5 Implementation
32 pages
FDS Lab Manual
No ratings yet
FDS Lab Manual
10 pages
Najir Shaikh Practical 4
No ratings yet
Najir Shaikh Practical 4
4 pages
Model Evaluation and Selection Cheatsheet 1708023215
No ratings yet
Model Evaluation and Selection Cheatsheet 1708023215
7 pages
Experiment 8 Code
No ratings yet
Experiment 8 Code
3 pages
Aiml 5-8
No ratings yet
Aiml 5-8
19 pages
3 Classification
No ratings yet
3 Classification
16 pages
Programming & DSA Essentials
No ratings yet
Programming & DSA Essentials
5 pages
LAB MANUAL For Machine Learning
No ratings yet
LAB MANUAL For Machine Learning
15 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
ML Exp-5,6
No ratings yet
ML Exp-5,6
6 pages
23BCE7092 ML Lab Assignment
No ratings yet
23BCE7092 ML Lab Assignment
14 pages
23BCE7199 ML Lab Assignment
No ratings yet
23BCE7199 ML Lab Assignment
15 pages
MANUAL
No ratings yet
MANUAL
33 pages
MANUAL
No ratings yet
MANUAL
34 pages
Decision Tree
No ratings yet
Decision Tree
3 pages
Lecture 7.2 - DTC Algorithm Implementation
No ratings yet
Lecture 7.2 - DTC Algorithm Implementation
7 pages
EX - NO:3: Algorithm
No ratings yet
EX - NO:3: Algorithm
11 pages
ML Yogesh
No ratings yet
ML Yogesh
23 pages
Da Lab3 221it084 Final
No ratings yet
Da Lab3 221it084 Final
6 pages
Machine Learning
No ratings yet
Machine Learning
16 pages
ML Using Python Programs
No ratings yet
ML Using Python Programs
12 pages
ML Lab Programs 2
No ratings yet
ML Lab Programs 2
16 pages
Program
No ratings yet
Program
2 pages
Decision Tree
No ratings yet
Decision Tree
2 pages
Expt7 ML2025 250306 143857
No ratings yet
Expt7 ML2025 250306 143857
5 pages
Practical 15 Python
No ratings yet
Practical 15 Python
6 pages
Prac5 AAM
No ratings yet
Prac5 AAM
2 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
8 pages
ML Lab-1
No ratings yet
ML Lab-1
32 pages
ML Functions
No ratings yet
ML Functions
12 pages
ML 4
No ratings yet
ML 4
5 pages
Machine Learning
No ratings yet
Machine Learning
12 pages
Math Problem Solving Guide
No ratings yet
Math Problem Solving Guide
5 pages
CR Lab
No ratings yet
CR Lab
5 pages
Iii Aid - ML
No ratings yet
Iii Aid - ML
30 pages
Progress of CATBOOST ALGORITHM FOR ELECTRICITY THEFT DETECTION IN POWER UTILITIES
No ratings yet
Progress of CATBOOST ALGORITHM FOR ELECTRICITY THEFT DETECTION IN POWER UTILITIES
9 pages
Big Data Practical
No ratings yet
Big Data Practical
20 pages
Dit FFT
100% (1)
Dit FFT
18 pages
Machine Learning Lab: Raheel Aslam (74-FET/BSEE/F16)
No ratings yet
Machine Learning Lab: Raheel Aslam (74-FET/BSEE/F16)
3 pages
Desicion Tree Ipynb
No ratings yet
Desicion Tree Ipynb
6 pages
NF Assighment4
No ratings yet
NF Assighment4
5 pages
Artificial Intelligence and Machine Learning
No ratings yet
Artificial Intelligence and Machine Learning
64 pages
Experiment 8 ML Vtu
No ratings yet
Experiment 8 ML Vtu
4 pages
15 Dijkstra
No ratings yet
15 Dijkstra
48 pages
Web Tech Pratical
No ratings yet
Web Tech Pratical
33 pages
Assgn 06 ML - Ipynb - Colab
No ratings yet
Assgn 06 ML - Ipynb - Colab
5 pages
Out Put Code
No ratings yet
Out Put Code
2 pages
Untitled Document 1
No ratings yet
Untitled Document 1
3 pages
@vtucode - in BAD402 Module 3 AI&ML 2022 Scheme
No ratings yet
@vtucode - in BAD402 Module 3 AI&ML 2022 Scheme
26 pages
QT Duality and Sensitivity Analysis
No ratings yet
QT Duality and Sensitivity Analysis
29 pages
Dsbda 10
No ratings yet
Dsbda 10
5 pages
Final DSA GROUP 1
No ratings yet
Final DSA GROUP 1
19 pages
Hill Climbing Algo
No ratings yet
Hill Climbing Algo
27 pages
MD 5
No ratings yet
MD 5
27 pages
Machine Learning Lab Assignment 1
No ratings yet
Machine Learning Lab Assignment 1
23 pages
Decision Tree
No ratings yet
Decision Tree
1 page
B.Tech CSE: Design & Analysis of Algorithms
No ratings yet
B.Tech CSE: Design & Analysis of Algorithms
15 pages
Matlab For Microeconometrics: Numerical Optimization: Nick Kuminoff Virginia Tech: Fall 2008
No ratings yet
Matlab For Microeconometrics: Numerical Optimization: Nick Kuminoff Virginia Tech: Fall 2008
16 pages
DSP: Interpolation & Decimation
No ratings yet
DSP: Interpolation & Decimation
32 pages
Current Agile Practices
No ratings yet
Current Agile Practices
12 pages
Binary Search & String Algorithms
No ratings yet
Binary Search & String Algorithms
22 pages
Data 3rd Yr BSC Data Science
No ratings yet
Data 3rd Yr BSC Data Science
10 pages
Python Assingnment 2
No ratings yet
Python Assingnment 2
13 pages
Mid-Point Line Plotting Algorithm: Made By: Dimpy CHUGH (1833) Drishti Bhalla (1838)
No ratings yet
Mid-Point Line Plotting Algorithm: Made By: Dimpy CHUGH (1833) Drishti Bhalla (1838)
12 pages
Byrd Canción Del Pájaro TPT Pno 1
No ratings yet
Byrd Canción Del Pájaro TPT Pno 1
12 pages
Case Based Reasoning Presentation
No ratings yet
Case Based Reasoning Presentation
6 pages
Linear Programming 1
No ratings yet
Linear Programming 1
6 pages
Emotion Classification For Musical Data Using Deep Learning Techniques
No ratings yet
Emotion Classification For Musical Data Using Deep Learning Techniques
8 pages
Numerical Solutions for Linear Equations
No ratings yet
Numerical Solutions for Linear Equations
9 pages
Mobile Radio Channel Mitigation Techniques
No ratings yet
Mobile Radio Channel Mitigation Techniques
34 pages
Internals 2 Web
No ratings yet
Internals 2 Web
4 pages
ANN Question Paper 2022
No ratings yet
ANN Question Paper 2022
4 pages
Comparison of Classifiers
No ratings yet
Comparison of Classifiers
6 pages
Algorithm and Data Structures Question
No ratings yet
Algorithm and Data Structures Question
3 pages
Math and Finance Problem Solving
No ratings yet
Math and Finance Problem Solving
3 pages
KPM Algorithm
No ratings yet
KPM Algorithm
2 pages
6.4 - Linear Programming - Simplex Method of LPP - Minimization Model
No ratings yet
6.4 - Linear Programming - Simplex Method of LPP - Minimization Model
3 pages
Exercise - MLR - Colaboratory
No ratings yet
Exercise - MLR - Colaboratory
2 pages

Decision Tree

Uploaded by

Decision Tree

Uploaded by

DECISION TREE

# Import necessary libraries

# Load the dataset

# Convert the dictionary to a DataFrame

# Convert categorical variables to numerical using one-hot encoding

# Separate features and target variable

# Initialize Decision Tree classifier

# Train the model

# Make predictions on the testing set

# Print confusion matrix

# Print classification report

# Convert feature names Index to a list

# Plot the decision tree

No 1.00 1.00 1.00 1

from sklearn.model_selection import train_test_split

from sklearn.tree import DecisionTreeClassifier, plot_tree

from sklearn.metrics import confusion_matrix, accuracy_score, classification_report

import matplotlib.pyplot as plt

 pandas: Library for data manipulation and analysis.

 We define a dictionary containing the "Play Tennis" dataset.

df = pd.get_dummies(df, columns=['Outlook', 'Temperature', 'Humidity', 'Wind'])

 We use one-hot encoding to convert categorical variables into numerical format.

4. Split Data into Training and Testing Sets:

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

5. Initialize and Train Decision Tree Model:

 We initialize a DecisionTreeClassifier object.

6. Make Predictions and Evaluate Model:

accuracy = accuracy_score(y_test, y_pred)

conf_matrix = confusion_matrix(y_test, y_pred)

class_report = classification_report(y_test, y_pred)

 We make predictions on the testing data using predict method.

7. Print Model Evaluation Metrics:

8. Plot the Decision Tree:

plot_tree(decision_tree, feature_names=X.columns, class_names=['No', 'Yes'], filled=True)

You might also like