0% found this document useful (0 votes)

24 views10 pages

EVALUATION

Uploaded by

mrutyunjaimohand

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views10 pages

EVALUATION

Uploaded by

mrutyunjaimohand

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

edurev.

Evaluation Chapter Notes | Artificial Intelligence for Class 10

PDF Download
9-11 minutes

What is Evaluation?

Evaluation is a process that critically examines a program. It involves collecting and analyzing

information about a program’s activities, characteristics, and outcomes. Its purpose is to make

judgments about a program, to improve its effectiveness, and/or to inform programming decisions.

Let me explain this to you:

Evaluation is basically to check the performance of your AI model. This is done by mainly two things:

“Prediction” & “Reality“. Evaluation is done by:

• First, search for some testing data with the resulted outcome that is 100% true.

• Then, feed that testing data to the AI model while you have the correct outcome with yourself,

which is termed as “Reality.”

• When you get the predicted outcome from the AI model, called “Prediction,” compare it with the

actual outcome, that is, “Reality.”

You can do this to:

• Improve the efficiency and performance of your AI model.

• Identify and correct mistakes.

Prediction and Reality

• Try not to use the dataset that has been used in the process of data acquisition or the training

data in evaluation.

• This is because your model will simply remember the whole training set and will therefore always

predict the correct label for any point in the training set. This is known as overfitting.

Evaluation Terminologies

There are various terminologies that come in when we work on evaluating our model. Let’s explore them

with an example of the Football scenario

The Scenario

• Imagine you have developed an AI-based prediction model designed to identify a football (soccer

ball). The objective of the model is to predict whether the given/shown figure is a football. To

understand the efficiency of this model, we need to check if the predictions it makes are correct

or not. Thus, there exist two conditions that we need to consider: Prediction and Reality.

◦ Prediction: The output given by the machine.

◦ Reality: The actual scenario about the figure shown when the prediction has been made.

• Now, let's look at various combinations that we can have with these two conditions:

◦ True Positive (TP): The model predicts the figure as a football, and it is indeed a football.

◦ True Negative (TN): The model predicts the figure as not a football, and it is indeed not a

football.

◦ False Positive (FP): The model predicts the figure as a football, but it is not a football.

◦ False Negative (FN): The model predicts the figure as not a football, but it is indeed a

football.

By analyzing these combinations, we can evaluate the performance and efficiency of the AI model. The

goal is to maximize the number of True Positives and True Negatives while minimizing the number of

False Positives and False Negatives.

1. Possibility
2. Case

3. Possible action

4. Last case

Try yourself:

What is the term used to describe when the model predicts the figure as a football, and it is indeed a

football?

• A.

True Positive (TP)

• B.

True Negative (TN)

• C.

False Positive (FP)

• D.

False Negative (FN)

Confusion Matrix

The comparison between the results of Prediction and Reality is known as the Confusion Matrix.

The confusion matrix helps us interpret the prediction results. It is not an evaluation metric itself but

serves as a record to aid in evaluation. Let’s review the four conditions related to the football example

once more.
Get additional INR 200 off today with EDUREV200 coupon. Avail Offer

Confusion Matrix table

Parameters to Evaluate a Model

Now let us go through all the possible combinations of “Prediction” and “Reality” & let us see how we

can use these conditions to evaluate the model.

Accuracy

Definition: Accuracy is the percentage of “correct predictions out of all observations.” A prediction is

considered correct if it aligns with the reality.

In this context, there are two scenarios where the Prediction matches the Reality:

Accuracy Formula

Here, total observations cover all the possible cases of prediction that can be True Positive (TP), True

Negative (TN), False Positive (FP), and False Negative (FN).

Example: Let’s revisit the Football example.

Assume the model always predicts that there is no football. In reality, there is a 2% chance of

encountering a football. In this scenario, the model will be correct 98% of the time when it predicts no

football. However, it will be incorrect in the 2% of cases where a football is actually present, as it

incorrectly predicts no football.

Here:
Conclusion

Precision

Definition: The percentage of "true positive cases" out of all cases where the prediction is positive. This

metric considers both True Positives and False Positives. It measures how well the model identifies

positive cases among all cases it predicts as positive.

In other words, it evaluates the proportion of correctly identified positive instances compared to all

instances the model predicted as positive.

Precision Formula

Definition: Precision is the percentage of “true positive cases” out of all cases where the prediction is

positive. It considers both True Positives and False Positives.

In the football example, if the model always predicts the presence of a football, regardless of reality, all

positive predictions are evaluated, including:

• True Positive (Prediction = Yes and Reality = Yes)

• False Positive (Prediction = Yes and Reality = No)

Just like the story of the boy who falsely cried out about wolves and was ignored when real wolves

arrived, if the precision is low (indicating more false positives), it could lead to complacency. Players

might start ignoring the predictions, thinking they're mostly false, and thus fail to check for the ball when
it’s actually there.

Example:

Recall

Definition: Recall, also known as Sensitivity or True Positive Rate, is the fraction of actual positive

cases that are correctly identified by the model.

In the football example, recall focuses on the true cases where a football was actually present,

examining how well the model detected it. It takes into account:

• True Positives (TP): Cases where the model correctly identified the presence of a football.

• False Negatives (FN): Cases where a football was present, but the model failed to detect it.

Recall Formula

In both Precision and Recall, the numerator is the same: True Positives. However, the denominators

differ: Precision includes False Positives, while Recall includes False Negatives.

F1 Score

Definition: The F1 Score measures the balance between precision and recall. It is used when there is

no clear preference for one metric over the other, providing a way to seek a balance between them.

F1 Score Formula

Try yourself:
Which metric measures the balance between precision and recall?

• A.

Accuracy

• B.

Precision

• C.

Recall

• D.

F1 Score

Which Metric is Important?

Choosing between Precision and Recall depends on the specific context and the costs associated with

False Positives and False Negatives:

• Forest Fire Detection: Here, a False Negative (failing to detect a fire when there is one) is

critical because it could lead to devastating consequences, like the forest burning down.

Therefore, Recall (which emphasizes detecting all positive cases) is crucial in this scenario.

• Viral Outbreak Prediction: A False Negative here (not identifying an outbreak when it occurs)

can lead to widespread infection and severe public health issues. Hence, Recall is again more

important.

• Mining: If a model predicts the presence of treasure (a False Positive) but there's none, it could

result in unnecessary and costly digging. In this case, Precision (which focuses on avoiding false

alarms) is more valuable.

• Spam Email Classification: If a model incorrectly labels a legitimate email as spam (a False

Positive), it could lead to missing important messages. Therefore, Precision is critical in this

scenario as well.

Cases of High FN Cost:

• Forest Fire

• Viral

Cases of High FP Cost:

• Spam

• Mining

Both the parameters are important

To sum up, if you want to assess your model’s performance comprehensively, both Precision and Recall

are crucial metrics.

• High Precision might come at the cost of Low Recall, and vice versa.

• The F1 Score is a metric that balances both Precision and Recall, providing a single score to

evaluate model performance.

• An ideal scenario would be where both Precision and Recall are 100%, leading to an F1 Score of

1 (or 100%).

Both Precision and Recall range from 0 to 1, and so does the F1 Score, with 1 representing the perfect

performance.

F1 Score Table

Let us explore the variations we can have in the F1 Score:

Examination Procedure For Lifting Beam Inspection
67% (3)
Examination Procedure For Lifting Beam Inspection
5 pages
ISO 17025 2017 Training Course and Changes
100% (8)
ISO 17025 2017 Training Course and Changes
48 pages
Cape Economics Sba Outline 1
100% (1)
Cape Economics Sba Outline 1
3 pages
Anthropometry
100% (2)
Anthropometry
50 pages
Students' Impression To Oral Corrective Feedback
No ratings yet
Students' Impression To Oral Corrective Feedback
20 pages
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
No ratings yet
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
20 pages
Feedback Mechanism Instrument
No ratings yet
Feedback Mechanism Instrument
2 pages
Human Exp
No ratings yet
Human Exp
19 pages
STUDENT OJT Performance Evaluation 2019
No ratings yet
STUDENT OJT Performance Evaluation 2019
2 pages
Merchandisers' View on Coke Distribution
No ratings yet
Merchandisers' View on Coke Distribution
12 pages
Iss22 Art1 - Stabilization of Vertical Cut
No ratings yet
Iss22 Art1 - Stabilization of Vertical Cut
4 pages
AI Model Evaluation Basics
No ratings yet
AI Model Evaluation Basics
5 pages
Lesson 4 - Performance Metrics
No ratings yet
Lesson 4 - Performance Metrics
46 pages
Eurocode 2: Romanian Guide
No ratings yet
Eurocode 2: Romanian Guide
2 pages
Business Statistics for Commerce Students
No ratings yet
Business Statistics for Commerce Students
3 pages
Additional Data Analysis and Statistics
100% (1)
Additional Data Analysis and Statistics
11 pages
MMW Finals
No ratings yet
MMW Finals
4 pages
Ladd and Lenz - Exploiting A Rare Communication Shift To Document The Persuasive Power of The News Media
No ratings yet
Ladd and Lenz - Exploiting A Rare Communication Shift To Document The Persuasive Power of The News Media
17 pages
10 Ai Evaluation tp01
No ratings yet
10 Ai Evaluation tp01
5 pages
Rcs I Sample Size Guide 2018
No ratings yet
Rcs I Sample Size Guide 2018
36 pages
Evaluation Grade10 Ai
No ratings yet
Evaluation Grade10 Ai
32 pages
All Maps Class 10
71% (7)
All Maps Class 10
23 pages
Core Plug Preparation Process
No ratings yet
Core Plug Preparation Process
4 pages
The Role of Marketing Capabilities For The Competitive Advantage in The Slovenian Market
No ratings yet
The Role of Marketing Capabilities For The Competitive Advantage in The Slovenian Market
15 pages
EvaluationQuestions Class 10 Ai
No ratings yet
EvaluationQuestions Class 10 Ai
6 pages
Practice Assignment 1 - Solutions - 2012
No ratings yet
Practice Assignment 1 - Solutions - 2012
6 pages
Evaluation Notes
No ratings yet
Evaluation Notes
12 pages
Jeepney Barkers: A Way of Life
No ratings yet
Jeepney Barkers: A Way of Life
4 pages
Engineering Asset Management Insights
No ratings yet
Engineering Asset Management Insights
2 pages
Confusion Matrix: A Confusion Matrix Is A Summary of Prediction Results On A Classification Problem
No ratings yet
Confusion Matrix: A Confusion Matrix Is A Summary of Prediction Results On A Classification Problem
13 pages
Evaluation Worksheet
No ratings yet
Evaluation Worksheet
2 pages
Evaluation 1 7
No ratings yet
Evaluation 1 7
7 pages
Class 10 Maths Formula Study Zone
No ratings yet
Class 10 Maths Formula Study Zone
12 pages
CH EVALUATION
No ratings yet
CH EVALUATION
7 pages
Part B Chapter 7 (Evaluation)
No ratings yet
Part B Chapter 7 (Evaluation)
5 pages
Unit 7 - AI (Evaluation)
No ratings yet
Unit 7 - AI (Evaluation)
28 pages
Evaluation in AI
No ratings yet
Evaluation in AI
20 pages
GEA1000 Tutorial Clement
No ratings yet
GEA1000 Tutorial Clement
3 pages
Grade 10 Unit 7 - Evaluation
No ratings yet
Grade 10 Unit 7 - Evaluation
50 pages
517-C-30072-Assignment Chapter Evaluation
No ratings yet
517-C-30072-Assignment Chapter Evaluation
10 pages
Evaluation
No ratings yet
Evaluation
32 pages
5.10ai - 2B
No ratings yet
5.10ai - 2B
15 pages
Unit-7 Evaluation: 7. What Is Meant by Overfitting of Data?
No ratings yet
Unit-7 Evaluation: 7. What Is Meant by Overfitting of Data?
7 pages
MS Evaluation Worksheet
No ratings yet
MS Evaluation Worksheet
3 pages
AI Evaluation
No ratings yet
AI Evaluation
30 pages
CH 7 - Notes Evaluation
No ratings yet
CH 7 - Notes Evaluation
3 pages
Evaluation - Grade 10 AI
No ratings yet
Evaluation - Grade 10 AI
12 pages
AI Project Evaluation 1
No ratings yet
AI Project Evaluation 1
5 pages
Evaluation 1646538719041
No ratings yet
Evaluation 1646538719041
65 pages
Evaluation Notes
No ratings yet
Evaluation Notes
12 pages
EVALUATION
No ratings yet
EVALUATION
25 pages
Evaluation Notes
No ratings yet
Evaluation Notes
12 pages
DecisionSciencesOutline 2024
No ratings yet
DecisionSciencesOutline 2024
3 pages
Evaluation AI X
No ratings yet
Evaluation AI X
6 pages
Class 9 Mathematics Notes For Session 2024 25 Chapter 10 Heron S
No ratings yet
Class 9 Mathematics Notes For Session 2024 25 Chapter 10 Heron S
45 pages
AI Evaluation and Metrics Worksheet
No ratings yet
AI Evaluation and Metrics Worksheet
5 pages
EVALUATION - Notes
No ratings yet
EVALUATION - Notes
15 pages
10TH Mat Probability Assignment
No ratings yet
10TH Mat Probability Assignment
1 page
KGBV 10TH - English-Grand Test
No ratings yet
KGBV 10TH - English-Grand Test
39 pages
Math Concepts for CBSE Students
No ratings yet
Math Concepts for CBSE Students
29 pages
Polynomial Zeros for Students
No ratings yet
Polynomial Zeros for Students
29 pages
Screenshot 2025-03-17 at 12.15.59
No ratings yet
Screenshot 2025-03-17 at 12.15.59
3 pages
EVALUATION
No ratings yet
EVALUATION
12 pages
Class 10 Mathematics Revision Notes Chapter 3 Pair of Linear Equations
No ratings yet
Class 10 Mathematics Revision Notes Chapter 3 Pair of Linear Equations
32 pages
CH 07 Evaluation
No ratings yet
CH 07 Evaluation
25 pages
9 Roc Auc
No ratings yet
9 Roc Auc
27 pages
Unit 3
No ratings yet
Unit 3
13 pages
Notes of Evaluation
No ratings yet
Notes of Evaluation
5 pages
Revitalizing Ming-Style On Contemporary Furniture Design For Sustainability
No ratings yet
Revitalizing Ming-Style On Contemporary Furniture Design For Sustainability
9 pages
Unit-7 Evaluation Notes
No ratings yet
Unit-7 Evaluation Notes
9 pages
Unit - 7 - Evaluation
No ratings yet
Unit - 7 - Evaluation
30 pages
Evaluation Class X Ai 417
No ratings yet
Evaluation Class X Ai 417
19 pages
Evaluation Data
No ratings yet
Evaluation Data
3 pages
Partiiiunit2model Performanceconfusion Matrixaccuracyprecesion Recall
No ratings yet
Partiiiunit2model Performanceconfusion Matrixaccuracyprecesion Recall
8 pages
Evaluation Grade10 Ai
No ratings yet
Evaluation Grade10 Ai
32 pages
Sustainability 2368205 Peer Review r1
No ratings yet
Sustainability 2368205 Peer Review r1
22 pages
Research Methods & Biostatistics Exam
No ratings yet
Research Methods & Biostatistics Exam
3 pages
Chater 3 Class 10
No ratings yet
Chater 3 Class 10
4 pages
Assignment 5
No ratings yet
Assignment 5
22 pages
Anti-Corruption in Addis Ababa: Effectiveness Study
No ratings yet
Anti-Corruption in Addis Ababa: Effectiveness Study
25 pages
Evaluating Models CH-3
No ratings yet
Evaluating Models CH-3
5 pages
Roles and Responsibilities For Research
No ratings yet
Roles and Responsibilities For Research
2 pages
GR X Unit 3 Evaluation NBE Key
No ratings yet
GR X Unit 3 Evaluation NBE Key
6 pages
Evaluating Models
No ratings yet
Evaluating Models
8 pages
Unit-3-Evaluating Models
No ratings yet
Unit-3-Evaluating Models
3 pages
Evaluating Models
No ratings yet
Evaluating Models
9 pages
Giving Specific and Relevant Examples Discuss The Procedure of Evaluating Primary and Secondary Sources
No ratings yet
Giving Specific and Relevant Examples Discuss The Procedure of Evaluating Primary and Secondary Sources
6 pages
Data Condensation Graphical Methods Statistics
No ratings yet
Data Condensation Graphical Methods Statistics
14 pages
Evaluation
No ratings yet
Evaluation
7 pages
Part B CH 8 Evaluation 1
No ratings yet
Part B CH 8 Evaluation 1
39 pages
Cls 10 Evaluation Final
No ratings yet
Cls 10 Evaluation Final
10 pages
GR 10 - Final Evaluation
No ratings yet
GR 10 - Final Evaluation
45 pages
Unit - 3 Evaluation
No ratings yet
Unit - 3 Evaluation
6 pages
E Book Evaluation
No ratings yet
E Book Evaluation
12 pages
b.3. Evaluating Models
No ratings yet
b.3. Evaluating Models
10 pages
Evaluating AI Models
No ratings yet
Evaluating AI Models
3 pages