0% found this document useful (0 votes)

11 views4 pages

Experiment 8

Uploaded by

faisalkhan778877

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views4 pages

Experiment 8

Uploaded by

faisalkhan778877

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Experiment 8

Develop a program to demonstrate the working of the

decision tree algorithm. Use Breast Cancer Data set for
building the decision tree and applying this knowledge to
classify a new sample.

Introduction to Decision Trees

What is a Decision Tree?
A Decision Tree is a supervised machine learning algorithm used for classification and
regression tasks. It models decisions using a tree-like structure where:

Nodes represent decision points based on feature values.

Edges represent possible outcomes (branches).
Leaves represent the final decision or classification.

Decision trees work by recursively splitting data into subsets based on the most significant
feature, ensuring maximum information gain at each step.

Working of the Decision Tree Algorithm

1. Selecting the Best Feature for Splitting
At each step, the algorithm selects the feature that best separates the data. Common
methods for choosing the best feature include:

Gini Impurity
Gini = 1- ∑Pi2

Measures how often a randomly chosen element would be incorrectly classified.

Entropy (Information Gain)

Entropy = ∑p(X)log p(X)

Measures the uncertainty in a dataset and selects splits that maximize information gain.

Chi-Square Test
Evaluates the statistical significance of the feature split.

2. Splitting the Data

The dataset is divided into subsets based on the selected
feature. The process continues recursively until:
A stopping condition is met (e.g., pure classification, max
depth). The tree reaches a predefined depth.

3. Making Predictions
For a new sample, traverse the tree from the root to a leaf
node. The leaf node contains the predicted class label.

Advantages of Decision Trees

✔ Easy to interpret – Mimics human decision-making.
✔ Handles both numerical & categorical data.
✔ Requires little data preprocessing – No need for feature scaling.
✔ Works well with missing values.

Challenges of Decision Trees

❌ Overfitting – Deep trees may memorize noise instead of patterns.
❌ Bias towards dominant features – Features with more categories can lead to
biased splits.
❌ Instability – Small data variations can lead to different trees.

Optimizing Decision Trees

1. Pruning

Pre-Pruning: Stop the tree early using conditions (e.g., min samples per split).
Post-Pruning: Remove unnecessary branches after the tree is built.
2. Setting Tree Depth

Limiting maximum depth prevents overfitting.

3. Using Ensemble Methods

Random Forest: Combines multiple trees for better generalization.

Gradient Boosting: Sequentially improves predictions.
Applications of Decision Trees
Medical Diagnosis – Classifying diseases based on symptoms.
Fraud Detection – Identifying fraudulent transactions.
Customer Segmentation – Categorizing users based on behavior.
# Importing necessary libraries
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns

from sklearn.model_selection import train_test_split

from sklearn.tree import DecisionTreeClassifier, plot_tree
from sklearn.metrics import accuracy_score, classification_report, confusion_matrix

from sklearn.tree import export_graphviz

from IPython.display import Image
import pydotplus

import warnings
warnings.filterwarnings('ignore')

data = pd.read_csv(r')
data.head()
data.shape
data.info()
data.diagnosis.unique()
data.isnull().sum()
df = data.drop(['id'], axis=1)
df['diagnosis'] = df['diagnosis'].map({'M':1, 'B':0}) # Malignant:1, Benign:0

#Model Building
X = df.drop('diagnosis', axis=1) # Drop the 'diagnosis' column (target)
y = df['diagnosis']
# Split the dataset into training and testing sets (80% train, 20% test)
X_train, X_test, y_train, y_test = train_test_split(X,y,test_size=0.2, random_state=42)

# Fit the decision tree model

model = DecisionTreeClassifier(criterion='entropy') #criteria = gini, entropy
model.fit(X_train, y_train)
model
y_pred = model.predict(X_test)
y_pred
# Evaluate the model
accuracy = accuracy_score(y_test, y_pred) * 100
classification_rep = classification_report(y_test, y_pred)

# Print the results

print("Accuracy:", accuracy)
print("Classification Report:\n", classification_rep)

new = [[12.5, 19.2, 80.0, 500.0, 0.085, 0.1, 0.05, 0.02, 0.17, 0.06,
0.4, 1.0, 2.5, 40.0, 0.006, 0.02, 0.03, 0.01, 0.02, 0.003,
16.0, 25.0, 105.0, 900.0, 0.13, 0.25, 0.28, 0.12, 0.29, 0.08]]
y_pred = model.predict(new)

# Output the prediction (0 = Benign, 1 = Malignant)

if y_pred[0] == 0:
print("Prediction: Benign")
else:
print("Prediction: Malignant")

# Visualize the Decision Tree (optional)

plt.figure(figsize=(12, 8))
plot_tree(model, filled=True, feature_names=X.columns, class_names=['Benign', 'Mali
plt.show()

# Export the tree to DOT format

dot_data = export_graphviz(model, out_file=None,
feature_names=X_train.columns,
rounded=True, proportion=False,
precision=2, filled=True)

# Convert DOT data to a graph

graph = pydotplus.graph_from_dot_data(dot_data)

# Display the graph

Image(graph.create_png())
kkkkkkkkk

Specification For Concrete Crack Repair
100% (1)
Specification For Concrete Crack Repair
12 pages
Ai Merge All Slides'
No ratings yet
Ai Merge All Slides'
314 pages
Quantitative Methods in Procurement
No ratings yet
Quantitative Methods in Procurement
15 pages
Form Mechanics Lien Claim
No ratings yet
Form Mechanics Lien Claim
3 pages
StuffIt Expander Read Me
No ratings yet
StuffIt Expander Read Me
10 pages
IR-ADV C3530 C3525 C3520 III Series Partscatalog E EUR
No ratings yet
IR-ADV C3530 C3525 C3520 III Series Partscatalog E EUR
138 pages
Decision Trees for Data Scientists
No ratings yet
Decision Trees for Data Scientists
15 pages
Bucket Bag
100% (1)
Bucket Bag
8 pages
08 Decision - Tree
No ratings yet
08 Decision - Tree
9 pages
Decision Tree and Related Techniques For Classification in Scalation
No ratings yet
Decision Tree and Related Techniques For Classification in Scalation
12 pages
Decision Trees for Data Scientists
No ratings yet
Decision Trees for Data Scientists
1 page
Classic Rock Special Yes The Complete Story 2 ND Edition 2022
91% (11)
Classic Rock Special Yes The Complete Story 2 ND Edition 2022
148 pages
5b Python Implementation of Decision Tree
No ratings yet
5b Python Implementation of Decision Tree
7 pages
Labour Welfare Scheme
No ratings yet
Labour Welfare Scheme
20 pages
What Is Decision Tree?: ISM Implementation of Decision Tree Submitted By: Sagiruddin Akthar 19mcmc28
No ratings yet
What Is Decision Tree?: ISM Implementation of Decision Tree Submitted By: Sagiruddin Akthar 19mcmc28
4 pages
ML4 - Decision Trees & Random Forest
No ratings yet
ML4 - Decision Trees & Random Forest
44 pages
Trees and Forests: Machine Learning With Python Cookbook
No ratings yet
Trees and Forests: Machine Learning With Python Cookbook
5 pages
ML: Decision Trees & Random Forests
No ratings yet
ML: Decision Trees & Random Forests
25 pages
CSET301 LabW8L2
No ratings yet
CSET301 LabW8L2
1 page
The Architecture of Flex and Java Applications
No ratings yet
The Architecture of Flex and Java Applications
33 pages
Unit 4
No ratings yet
Unit 4
33 pages
Decision Tree Classification Guide
No ratings yet
Decision Tree Classification Guide
8 pages
Types of Pruning Techniques
No ratings yet
Types of Pruning Techniques
10 pages
The GMP Regulations Report 2020
No ratings yet
The GMP Regulations Report 2020
5 pages
Decision Trees
No ratings yet
Decision Trees
18 pages
MLA Lab 6:-Implementation of Decision Tree
No ratings yet
MLA Lab 6:-Implementation of Decision Tree
16 pages
Pari 1
No ratings yet
Pari 1
35 pages
LAB (1) Decision Tree: Islamic University of Gaza Computer Engineering Department Artificial Intelligence ECOM 5038
No ratings yet
LAB (1) Decision Tree: Islamic University of Gaza Computer Engineering Department Artificial Intelligence ECOM 5038
18 pages
Testing MOSFETs with Multimeter
100% (1)
Testing MOSFETs with Multimeter
3 pages
Draft Xai
No ratings yet
Draft Xai
16 pages
HUAWEI MateView GT Quick Start Guide - (01, En-Us, Zhuque)
No ratings yet
HUAWEI MateView GT Quick Start Guide - (01, En-Us, Zhuque)
41 pages
Decision Trees
No ratings yet
Decision Trees
38 pages
SK6805MICRO LED Specification
No ratings yet
SK6805MICRO LED Specification
18 pages
Attachment - 1
No ratings yet
Attachment - 1
2 pages
Unit-5 Decision Trees & Ensembles Methods
No ratings yet
Unit-5 Decision Trees & Ensembles Methods
11 pages
Exp 3 121a1047 Lavanya Kurup ML
No ratings yet
Exp 3 121a1047 Lavanya Kurup ML
4 pages
Experiment 2
No ratings yet
Experiment 2
17 pages
Experiment 8
No ratings yet
Experiment 8
14 pages
Decision Tree
No ratings yet
Decision Tree
44 pages
Bank Deposit Secrecy Law Overview
No ratings yet
Bank Deposit Secrecy Law Overview
7 pages
1.rakitanprinter 20 Januari 2020-1 1
No ratings yet
1.rakitanprinter 20 Januari 2020-1 1
1 page
Prac 6
No ratings yet
Prac 6
6 pages
CONTENTS
No ratings yet
CONTENTS
7 pages
RA100Z - Manual - I56-0508 - Indicador Visual
No ratings yet
RA100Z - Manual - I56-0508 - Indicador Visual
2 pages
Machine Learning With Python - Machine Learning Algorithms - Decision Tree
No ratings yet
Machine Learning With Python - Machine Learning Algorithms - Decision Tree
17 pages
Practical No4 - 5 ML
No ratings yet
Practical No4 - 5 ML
11 pages
L04 Decision Trees
No ratings yet
L04 Decision Trees
34 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Minor Project
No ratings yet
Minor Project
21 pages
Differences Between Stove Types
No ratings yet
Differences Between Stove Types
8 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
DM Lab 04
No ratings yet
DM Lab 04
6 pages
18 September 2024 Updation REWA SINCE 2007
No ratings yet
18 September 2024 Updation REWA SINCE 2007
61 pages
Lecture 7.2 - DTC Algorithm Implementation
No ratings yet
Lecture 7.2 - DTC Algorithm Implementation
7 pages
Decision Trees
No ratings yet
Decision Trees
8 pages
Lab 2
No ratings yet
Lab 2
17 pages
Decision - Tree - Regression - Ipynb - Colab
No ratings yet
Decision - Tree - Regression - Ipynb - Colab
3 pages
Progrram8-Decision Tree
No ratings yet
Progrram8-Decision Tree
3 pages
MIS410 Chapter6
No ratings yet
MIS410 Chapter6
47 pages
Program - 8
No ratings yet
Program - 8
2 pages
AIH Lab2
No ratings yet
AIH Lab2
10 pages
Object Oriented Development in PL/SQL
No ratings yet
Object Oriented Development in PL/SQL
27 pages
Uniswap Formulas
100% (2)
Uniswap Formulas
14 pages
L7805CV 5V, 1.5A, Voltage Regulator
No ratings yet
L7805CV 5V, 1.5A, Voltage Regulator
1 page
Lecture 15: Tree-Based Algorithms - Applied ML
No ratings yet
Lecture 15: Tree-Based Algorithms - Applied ML
17 pages
Decision Trees Presentation
No ratings yet
Decision Trees Presentation
10 pages
1.10. Decision Trees - Scikit-Learn 0.24.1 Documentation
No ratings yet
1.10. Decision Trees - Scikit-Learn 0.24.1 Documentation
10 pages
Practical 15 Python
No ratings yet
Practical 15 Python
6 pages
8 PRGM
No ratings yet
8 PRGM
2 pages
14 - Ensemble Methods
No ratings yet
14 - Ensemble Methods
38 pages
Machine Learning
No ratings yet
Machine Learning
16 pages
Lecture 11 Slides - After
No ratings yet
Lecture 11 Slides - After
55 pages
IRCTC Train Ticket: Rourkela to Surat
No ratings yet
IRCTC Train Ticket: Rourkela to Surat
3 pages
Decision Tree Code Explanation
No ratings yet
Decision Tree Code Explanation
4 pages
Working at Heights Verification of Competency RIIWHS204E OHS - Com.au
No ratings yet
Working at Heights Verification of Competency RIIWHS204E OHS - Com.au
4 pages
8.program Decisiontree
No ratings yet
8.program Decisiontree
15 pages
ML Lab Record2
No ratings yet
ML Lab Record2
42 pages
Experiment 8 ML Vtu
No ratings yet
Experiment 8 ML Vtu
4 pages
DDO26B1101
No ratings yet
DDO26B1101
6 pages
Notation
No ratings yet
Notation
9 pages
What Is Decision Tree
No ratings yet
What Is Decision Tree
35 pages
Notes 221104 101858
No ratings yet
Notes 221104 101858
32 pages
DataMining-Handouts1 5
No ratings yet
DataMining-Handouts1 5
8 pages
Carrier BacnetSC Setup Guide
No ratings yet
Carrier BacnetSC Setup Guide
27 pages
Developing Management Skills 10th Edition Download Instantly
No ratings yet
Developing Management Skills 10th Edition Download Instantly
315 pages
ES335
No ratings yet
ES335
22 pages
Research
No ratings yet
Research
2 pages
Experiment 3 PCA On Iris Dataset
No ratings yet
Experiment 3 PCA On Iris Dataset
2 pages
Experiment 7a and 7b
No ratings yet
Experiment 7a and 7b
3 pages
Experiment 10
No ratings yet
Experiment 10
1 page
ACADEMIC CALENDAR 2025 Approved
No ratings yet
ACADEMIC CALENDAR 2025 Approved
2 pages