Unit - 7 - Evaluation

Evaluation is the process of assessing the reliability of an AI model by comparing its predictions against actual outcomes using a testing dataset. Key evaluation metrics include accuracy, precision, recall, and F1 score, each measuring different aspects of model performance. It's crucial to avoid using training data for evaluation to prevent overfitting, which can lead to misleadingly high accuracy.

Uploaded by

aswinkumarthiya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views30 pages

Unit - 7 - Evaluation

Uploaded by

aswinkumarthiya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 30

What is evaluation ?

•As we know we have two kinds of datasets:

• Training Data Set
• Testing Data Set
•Evaluation is the process of understanding the reliability of any AI model,
based on outputs by feeding the test dataset into the model and comparing it
with actual answers.
•It’s not recommended to use the data we used to build the model to evaluate it. This is
because our model will simply remember the whole training set, and will therefore always

predict the correct label for any point in the training set. This is known as overfitting.
Models that use the training dataset during testing, will always results in correct output. This is known as
overfitting.
POSSIBLE REASONS FOR AN AI MODEL NOT BEING EFFICIENT?

• Lack of Training Data

• Unauthenticated Data / Wrong Data

• Inefficient coding / Wrong Algorithms

• Not Tested

• Not Easy

• Less Accuracy
Consider this scenario where you have an AI prediction model which
predicts the possibilities of fires in the forest. The main aim of this
model is to predict whether a forest fire has broken out into the forest or
not. To understand whether the model is working properly or not we need
to predict to check if the predictions made by the model is correct or not.
So there are two conditions:
1.Prediction
2.Reality
Type 1 error
Type 2 error
Confusion Matrix
It is a comparison between prediction and reality. It helps us to understand the
prediction result. It is not an evaluation metric but a record that can help in
evaluation.
Evaluation Methods
These evaluation methods are as follows:
1.Accuracy
2.Precision
3.Recall
4.F1 Score
Accuracy
•The percentage of correct predictions out of all the observations is called accuracy.
•If the prediction matches with reality then it said to be correct.
•There are two conditions where prediction matched with reality:
• True Positive
• True Negative
•So the formula for accuracy is:
This returns high accuracy for an AI model. But the actual cases where the fire broke out are not
taken into account. Therefore there is a need to look at another parameter that takes account of
such cases as well.
Precision
•The percentage of true positive cases versus all the cases where the prediction is true.
•It takes into account the True Positives and False Positives.

•If precision is high, it means there are more true positive cases

•If precision is low, it means there are more false positive cases
Let us consider that a model has 100%
precision. Which means that whenever
the machine says there’s a fire, there is
actually a fire (True Positive). In the
same model, there can be a rare
exceptional case where there was
actual fire but the system could not
detect it. This is the case of a False
Negative condition. But the precision
value would not be affected by it
because it does not take FN into
account. Is precision then a good
parameter for model performance
Recall
In the recall method, the fraction of positive cases that are correctly identified will be taken into
consideration. It majorly takes into account the true reality cases wherein Reality there was a
fire but the machine either detected it correctly or it didn’t. That is, it considers True Positives
and False Negatives.
Which Metric is Important?
F1 Score
F1 score can be defined as the measure of balance between precision and recall. Take a look at the
formula and think of when can we get a perfect F1 score? An ideal situation would be when we have a
value of 1 (that is 100%) for both Precision and Recall. In that case, the F1 score would also be an ideal 1
(100%). It is known as the perfect value for F1 Score. As the values of both Precision and Recall ranges
from 0 to 1, the F1 score also ranges from 0 to 1.
EVALUATION METRIC CONSIDERATION

ACCURACY CORRECT PREDICTIONS (TP + TN)

PRECISION POSITIVE PREDICTIONS (TP + FP)

RECALL TRUE REALITY CASES (TP + FN)

F1 SCORE PRECISION & RECALL

Why should we avoid using the training data for evaluation?
What should be the value of F1 score if the model needs to have 100% accuracy?
Calculate Accuracy, precision, recall and F1 score

EVALUATION
No ratings yet
EVALUATION
25 pages
Unit-3-Evaluating Models
No ratings yet
Unit-3-Evaluating Models
3 pages
CH EVALUATION
No ratings yet
CH EVALUATION
7 pages
Evaluating Models
No ratings yet
Evaluating Models
9 pages
Evaluation Grade10 Ai
No ratings yet
Evaluation Grade10 Ai
32 pages
Partiiiunit2model Performanceconfusion Matrixaccuracyprecesion Recall
No ratings yet
Partiiiunit2model Performanceconfusion Matrixaccuracyprecesion Recall
8 pages
Evaluation Notes
No ratings yet
Evaluation Notes
12 pages
CH 7 - Notes Evaluation
No ratings yet
CH 7 - Notes Evaluation
3 pages
Evaluation
No ratings yet
Evaluation
32 pages
Evaluation Class X Ai 417
No ratings yet
Evaluation Class X Ai 417
19 pages
E Book Evaluation
No ratings yet
E Book Evaluation
12 pages
Part B CH 8 Evaluation 1
No ratings yet
Part B CH 8 Evaluation 1
39 pages
EVALUATION
No ratings yet
EVALUATION
12 pages
CH 07 Evaluation
No ratings yet
CH 07 Evaluation
25 pages
Unit 7 - AI (Evaluation)
No ratings yet
Unit 7 - AI (Evaluation)
28 pages
Notes of Evaluation
No ratings yet
Notes of Evaluation
5 pages
Wa0002.
No ratings yet
Wa0002.
6 pages
5.10ai - 2B
No ratings yet
5.10ai - 2B
15 pages
Evaluation Grade10 Ai
No ratings yet
Evaluation Grade10 Ai
32 pages
EVALUATION
No ratings yet
EVALUATION
10 pages
AI Evaluation
No ratings yet
AI Evaluation
30 pages
Part B Chapter 7 (Evaluation)
No ratings yet
Part B Chapter 7 (Evaluation)
5 pages
Evaluation in AI
No ratings yet
Evaluation in AI
20 pages
Evaluation Worksheet
No ratings yet
Evaluation Worksheet
2 pages
b.3. Evaluating Models
No ratings yet
b.3. Evaluating Models
10 pages
AI Evaluation
No ratings yet
AI Evaluation
3 pages
MS Evaluation Worksheet
No ratings yet
MS Evaluation Worksheet
3 pages
Grade 10 Unit 7 - Evaluation
No ratings yet
Grade 10 Unit 7 - Evaluation
50 pages
Evaluation AI X
No ratings yet
Evaluation AI X
6 pages
EVALUATION - Notes
No ratings yet
EVALUATION - Notes
15 pages
Unit - 3 Evaluation
No ratings yet
Unit - 3 Evaluation
6 pages
AI Model Evaluation Basics
No ratings yet
AI Model Evaluation Basics
5 pages
Evaluationnai
No ratings yet
Evaluationnai
5 pages
Worksheet For 8th
No ratings yet
Worksheet For 8th
5 pages
Unit-7 Evaluation: 7. What Is Meant by Overfitting of Data?
No ratings yet
Unit-7 Evaluation: 7. What Is Meant by Overfitting of Data?
7 pages
Evaluation Data
No ratings yet
Evaluation Data
3 pages
Evaluating Models
No ratings yet
Evaluating Models
8 pages
GR 10 - Final Evaluation
No ratings yet
GR 10 - Final Evaluation
45 pages
X Unit 7 Evaluation
No ratings yet
X Unit 7 Evaluation
5 pages
517-C-30072-Assignment Chapter Evaluation
No ratings yet
517-C-30072-Assignment Chapter Evaluation
10 pages
Evaluation - Grade 10 AI
No ratings yet
Evaluation - Grade 10 AI
12 pages
Assignment 5
No ratings yet
Assignment 5
22 pages
AI Model Evaluation Guide
No ratings yet
AI Model Evaluation Guide
7 pages
Evaluation
No ratings yet
Evaluation
7 pages
Unit 7 Evaluation
No ratings yet
Unit 7 Evaluation
13 pages
EvaluationQuestions Class 10 Ai
No ratings yet
EvaluationQuestions Class 10 Ai
6 pages
Cls 10 Evaluation Final
No ratings yet
Cls 10 Evaluation Final
10 pages
Evaluation CH Summary Notes
No ratings yet
Evaluation CH Summary Notes
19 pages
Evaluation Notes
No ratings yet
Evaluation Notes
12 pages
Evaluation Notes
No ratings yet
Evaluation Notes
12 pages
Evaluating Models CH-3
No ratings yet
Evaluating Models CH-3
5 pages
Evaluation 1646538719041
No ratings yet
Evaluation 1646538719041
65 pages
Unit 7 - Evaluation
No ratings yet
Unit 7 - Evaluation
7 pages
Part B Unit 3
No ratings yet
Part B Unit 3
23 pages
AI Project Evaluation 1
No ratings yet
AI Project Evaluation 1
5 pages
10 Ai Evaluation tp01
No ratings yet
10 Ai Evaluation tp01
5 pages
Evaluating Models NOTES
No ratings yet
Evaluating Models NOTES
5 pages
AI Evaluation
No ratings yet
AI Evaluation
18 pages
Unit3 Evaluating Models
No ratings yet
Unit3 Evaluating Models
10 pages
Strings: New Syllabus 2022-23
No ratings yet
Strings: New Syllabus 2022-23
28 pages
Import: Random Generator Functions in Random Module Floating Integer Integer
No ratings yet
Import: Random Generator Functions in Random Module Floating Integer Integer
3 pages
The Ashok Leyland School, Hosur Midterm Exam Portions - 2025-26
No ratings yet
The Ashok Leyland School, Hosur Midterm Exam Portions - 2025-26
2 pages
The Ashok Leyland School, Hosur Midterm Exam Portions - 2025-26
No ratings yet
The Ashok Leyland School, Hosur Midterm Exam Portions - 2025-26
2 pages
The Ashok Leyland School, Hosur: Homework - Xi C
No ratings yet
The Ashok Leyland School, Hosur: Homework - Xi C
1 page
Original
No ratings yet
Original
39 pages
Stat 520 CH 4 Slides
No ratings yet
Stat 520 CH 4 Slides
28 pages
EstimationTheory Lecture 03
No ratings yet
EstimationTheory Lecture 03
21 pages
STA302 Week11 Full
No ratings yet
STA302 Week11 Full
49 pages
Econometrics Midterm Exam
No ratings yet
Econometrics Midterm Exam
7 pages
Lecture 5
No ratings yet
Lecture 5
127 pages
Jurnal 9
No ratings yet
Jurnal 9
21 pages
2009 Ridge Regression
No ratings yet
2009 Ridge Regression
8 pages
Confusion Matrix
No ratings yet
Confusion Matrix
13 pages
Lecture 15
No ratings yet
Lecture 15
14 pages
Time Series & Forecasting (Theory) 498 - Xid-3607638 - 1
No ratings yet
Time Series & Forecasting (Theory) 498 - Xid-3607638 - 1
2 pages
ANOVA
No ratings yet
ANOVA
2 pages
Ideal Gas Law: Present By: Group 5
100% (1)
Ideal Gas Law: Present By: Group 5
6 pages
Monthly Demand Data Analysis 2006-2010
No ratings yet
Monthly Demand Data Analysis 2006-2010
7 pages
Introduction To Statistical Mechanics: Thermodynamics Limit
No ratings yet
Introduction To Statistical Mechanics: Thermodynamics Limit
15 pages
2nd Year T-VIII Estimation
No ratings yet
2nd Year T-VIII Estimation
2 pages
Grad (Regression and Analysis)
No ratings yet
Grad (Regression and Analysis)
12 pages
Chapter 5 Quiz
No ratings yet
Chapter 5 Quiz
2 pages
Mechanisms of Biodiversity Evolution
No ratings yet
Mechanisms of Biodiversity Evolution
34 pages
Simple & Multiple Regression Models
No ratings yet
Simple & Multiple Regression Models
32 pages
ND Second Semester Time Table 2023 - 2024
No ratings yet
ND Second Semester Time Table 2023 - 2024
2 pages
Class Activity SU 4 - Memo - 2023 - Efundi
No ratings yet
Class Activity SU 4 - Memo - 2023 - Efundi
3 pages
Module 5.2b: Gas Laws Part 2
No ratings yet
Module 5.2b: Gas Laws Part 2
26 pages
HW 3
No ratings yet
HW 3
3 pages
Lab Manual
No ratings yet
Lab Manual
6 pages
10 RD
No ratings yet
10 RD
16 pages
Time Series Analysis Course Intro
No ratings yet
Time Series Analysis Course Intro
38 pages
Jurnal Zafran New
No ratings yet
Jurnal Zafran New
15 pages
Panel Data V
No ratings yet
Panel Data V
28 pages
STT153A Paper
No ratings yet
STT153A Paper
8 pages

Unit - 7 - Evaluation

Uploaded by

Unit - 7 - Evaluation

Uploaded by

What is evaluation ?

•As we know we have two kinds of datasets:

• Lack of Training Data

• Unauthenticated Data / Wrong Data

• Inefficient coding / Wrong Algorithms

ACCURACY CORRECT PREDICTIONS (TP + TN)

PRECISION POSITIVE PREDICTIONS (TP + FP)

RECALL TRUE REALITY CASES (TP + FN)

F1 SCORE PRECISION & RECALL

You might also like