0% found this document useful (0 votes)

14 views26 pages

Model Parameters

The document discusses model parameters and hyperparameters in machine learning, highlighting their roles in model optimization. It explains the difference between model parameters, which are learned from data, and hyperparameters, which are set externally and require tuning for optimal performance. Additionally, it addresses the concepts of bias and variance, emphasizing the importance of balancing these errors to achieve a well-performing model.

Uploaded by

aimabatool112

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views26 pages

Model Parameters

Uploaded by

aimabatool112

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 26

Model Parameters

Dr. Nabeela Kausar

Introduction
• Model optimization is one of the toughest challenges in the
implementation of machine learning solutions.
• Entire branches of machine learning and deep learning theory have
been dedicated to the optimization of models.
Model Parameters in Machine
Learning
In a machine learning model, there are 2 types of parameters:
Model Parameters
Model Hyperparameters
Model Parameters
• Model Parameters: These are the parameters in the model that must
be determined using the training data set. These are the fitted
parameters.
• Hyperparameters: These are adjustable parameters that must be
tuned in order to obtain a model with optimal performance.
Model Parameters
A model parameter is a configuration variable that is internal to the
model and whose value can be estimated from data.
• They are required by the model when making predictions.
• They values define the skill of the model on your problem.
• They are estimated or learned from data.
• They are often not set manually by the practitioner.
• They are often saved as part of the learned model.
• In classical machine learning literature, we may think of the model as
the hypothesis and the parameters as the tailoring of the hypothesis
to a specific set of data.
What is a Model
Hyperparameter?
• A model hyperparameter is a configuration that is external to the
model and whose value cannot be estimated from data.
• They are often used in processes to help estimate model parameters.
• They are often specified by the practitioner.
• They can often be set using heuristics.
• They are often tuned for a given predictive modeling problem.
Hyperparameter Optimization
methods
• Hyperparameters can have a direct impact on the training of machine
learning algorithms. Thus, to achieve maximal performance, it is
important to understand how to optimize them. Here are some
common strategies for optimizing hyperparameters:
Manual Hyperparameter Tuning
• Traditionally, hyperparameters were tuned manually by trial and error.
This is still commonly done, and experienced engineers can “guess”
parameter values that will deliver very high accuracy for ML models.
However, there is a continual search for better, faster, and more
automatic methods to optimize hyperparameters.
Grid Search

• Suppose, you defined the grid as:

a1 = [0,1,2,3,4,5]
a2 = [10,20,30,40,5,60]
a3 = [105,105,110,115,120,125]
Random Search
• Often some of the hyperparameters matter much more than others.
Performing random search rather than grid search allows a much
more precise discovery of good values for the important ones.
• Random Search sets up a grid of hyperparameter values and selects
random combinations to train the model and score.
Grid Search VS Random Search
Evolutionary Optimization
Bias and Variance
• A supervised Machine Learning model aims to train itself on the input
variables(X) in such a way that the predicted values(Y) are as close to
the actual values as possible. This difference between the actual
values and predicted values is the error and it is used to evaluate the
model. The error for any supervised Machine Learning algorithm
comprises of 3 parts:
• Bias error
• Variance error
• The noise
• In supervised machine learning, an algorithm is trained on the
training data to build model that is well designed to make correct
prediction on the unseen data that is not available for training.
• A machine learning model is nothing but a mathematical function
which describes relationship between Predictors ( Features and
Machine Learning terminology) and Traget variable.
• While the noise is the irreducible error that we cannot eliminate, the
other two i.e. Bias and Variance are reducible errors that we can
attempt to minimize as much as possible.
Bias
• In the simplest terms, Bias is the difference between the Predicted
Value and the Expected Value. To explain further, the model makes
certain assumptions when it trains on the data provided. When it is
introduced to the testing/validation data, these assumptions may not
always be correct.
What is Bias
• Bias is the difference between the average prediction of our model
and the correct value which we are trying to predict. Model with high
bias pays very little attention to the training data and oversimplifies
the model. It always leads to high error on training and test data.
What is variance?
• Contrary to bias, the Variance is when the model takes into account
the fluctuations in the data i.e. the noise as well. So, what happens
when our model has a high variance?
• The model will still consider the variance as something to learn from.
That is, the model learns too much from the training data, so much
so, that when confronted with new (testing) data, it is unable to
predict accurately based on it.
• Mathematically, the variance error in the model is:
• Variance[f(x))=E[X^2]−E[X]^2
Overfitting
• Since in the case of high variance, the model learns too much from
the training data, it is called overfitting.
• To summarise,
• A model with a high bias error underfits data and makes very
simplistic assumptions on it
• A model with a high variance error overfits the data and learns too
much from it
• A good model is where both Bias and Variance errors are balanced
Bias-Variance Tradeoff

Machine Learning: Professor Department of Computer Science & Engineering
No ratings yet
Machine Learning: Professor Department of Computer Science & Engineering
28 pages
Unit 2 ML Regression
No ratings yet
Unit 2 ML Regression
46 pages
Bias and Variance
No ratings yet
Bias and Variance
14 pages
Hyperparameter Tuning
No ratings yet
Hyperparameter Tuning
13 pages
Bias and Variance
No ratings yet
Bias and Variance
6 pages
Bias and Variance in Machine Learning
100% (1)
Bias and Variance in Machine Learning
7 pages
Unit 1.2 Perceptron 2024
No ratings yet
Unit 1.2 Perceptron 2024
107 pages
ML Models, Model Evaluation Methods, Overfitting, Underfitting Bias Variance Loss Function Hyperparameter and Gradient Descent
No ratings yet
ML Models, Model Evaluation Methods, Overfitting, Underfitting Bias Variance Loss Function Hyperparameter and Gradient Descent
74 pages
11 July Unit 1
No ratings yet
11 July Unit 1
47 pages
Machine Learning Bias-Variance Guide
No ratings yet
Machine Learning Bias-Variance Guide
28 pages
Understanding Probability and ML Bias
No ratings yet
Understanding Probability and ML Bias
88 pages
Bias and Variance
No ratings yet
Bias and Variance
15 pages
Machine Learning-2
No ratings yet
Machine Learning-2
87 pages
A "Short" Introduction To Model Selection
No ratings yet
A "Short" Introduction To Model Selection
25 pages
Bias, Variance, and Tradeoff
No ratings yet
Bias, Variance, and Tradeoff
8 pages
Lecture 8
No ratings yet
Lecture 8
15 pages
DL Unit1
No ratings yet
DL Unit1
61 pages
1 5 Bias Variance Trade Off
No ratings yet
1 5 Bias Variance Trade Off
34 pages
DA528 Machine Learning Midterm Exam Questions
No ratings yet
DA528 Machine Learning Midterm Exam Questions
4 pages
ML 21-22 Sem
No ratings yet
ML 21-22 Sem
10 pages
DL Unit1
100% (2)
DL Unit1
79 pages
Vsat2k - ML - Ch1a Evaluation of Learning Algorithms - Jan 2025
No ratings yet
Vsat2k - ML - Ch1a Evaluation of Learning Algorithms - Jan 2025
19 pages
Module 3 Modified
No ratings yet
Module 3 Modified
48 pages
Unit 3 ML
No ratings yet
Unit 3 ML
40 pages
Linear Regression, Polynomical, Gradiant Descent
No ratings yet
Linear Regression, Polynomical, Gradiant Descent
42 pages
Machine Learning Model Validation
No ratings yet
Machine Learning Model Validation
50 pages
Unit 2
No ratings yet
Unit 2
76 pages
Lecture 4.2 Supervised Learning Classification
No ratings yet
Lecture 4.2 Supervised Learning Classification
25 pages
All DL
No ratings yet
All DL
72 pages
Unit 2
No ratings yet
Unit 2
97 pages
Csa202 Unit 2
No ratings yet
Csa202 Unit 2
36 pages
Unit Ii ML
No ratings yet
Unit Ii ML
57 pages
Unit - 2 Deep Learning
No ratings yet
Unit - 2 Deep Learning
26 pages
Bias Variance Dichotomy
No ratings yet
Bias Variance Dichotomy
11 pages
Unit 4
No ratings yet
Unit 4
34 pages
DSOST3
No ratings yet
DSOST3
31 pages
Ensemble Method
No ratings yet
Ensemble Method
12 pages
Training Evaluation
No ratings yet
Training Evaluation
42 pages
Approach Towards Model Evaluation, Model Selection
No ratings yet
Approach Towards Model Evaluation, Model Selection
13 pages
ML Bias & Variance for B.Tech Students
No ratings yet
ML Bias & Variance for B.Tech Students
107 pages
Machine Learning Concepts Explained
No ratings yet
Machine Learning Concepts Explained
4 pages
Lec 3
No ratings yet
Lec 3
13 pages
ML Lec-7
No ratings yet
ML Lec-7
12 pages
Generative Models
No ratings yet
Generative Models
39 pages
Machine Learning Using Matlab: Lecture 8 Advice On ML Application
No ratings yet
Machine Learning Using Matlab: Lecture 8 Advice On ML Application
30 pages
Machine Learning Notes Anna University
No ratings yet
Machine Learning Notes Anna University
9 pages
19 ML Intro
No ratings yet
19 ML Intro
31 pages
Theory in Machine Learning
No ratings yet
Theory in Machine Learning
60 pages
ML 5
No ratings yet
ML 5
26 pages
Deep Feed Farwrd Neural Network
No ratings yet
Deep Feed Farwrd Neural Network
57 pages
Deep Learning Unit 3
No ratings yet
Deep Learning Unit 3
19 pages
Receiver Operator Characteristic
No ratings yet
Receiver Operator Characteristic
25 pages
ML Errors: Bias & Variance Explained
No ratings yet
ML Errors: Bias & Variance Explained
9 pages
Bias and Variance
No ratings yet
Bias and Variance
7 pages
Lec 8
No ratings yet
Lec 8
19 pages
Deep Learning
No ratings yet
Deep Learning
26 pages
Diagnosing Bias Vs Variance
No ratings yet
Diagnosing Bias Vs Variance
11 pages
12 Bias-Variance - Underfit - Overfit
No ratings yet
12 Bias-Variance - Underfit - Overfit
4 pages
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
100% (2)
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
26 pages
Byron Kaldis Encyclopedia of Philosophy and The Social Sciences
80% (5)
Byron Kaldis Encyclopedia of Philosophy and The Social Sciences
1,195 pages
Machine Learning and Data Science With Python
No ratings yet
Machine Learning and Data Science With Python
7 pages
Deep Learning Optimization Algorithms
No ratings yet
Deep Learning Optimization Algorithms
26 pages
Jkkklphftbbhuii
No ratings yet
Jkkklphftbbhuii
17 pages
Mask R-CNN for Object Detection
No ratings yet
Mask R-CNN for Object Detection
5 pages
Covolutional Neural Networks
No ratings yet
Covolutional Neural Networks
28 pages
Unit 2 DL
No ratings yet
Unit 2 DL
44 pages
The Diverse Landscape of Large Language Models Deepsense Ai
No ratings yet
The Diverse Landscape of Large Language Models Deepsense Ai
16 pages
Fpls 13 1003152
No ratings yet
Fpls 13 1003152
18 pages
20CS4701A
No ratings yet
20CS4701A
2 pages
Transfer Learning
No ratings yet
Transfer Learning
13 pages
Pid 23
No ratings yet
Pid 23
28 pages
Style TTS2
No ratings yet
Style TTS2
28 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
44 pages
Syllabus AI Bootcamp #7
No ratings yet
Syllabus AI Bootcamp #7
19 pages
AI Techniques For Stability Analysis and Control in Smart Grids
No ratings yet
AI Techniques For Stability Analysis and Control in Smart Grids
28 pages
Unit 6
No ratings yet
Unit 6
22 pages
NN Mdu Previousyears
No ratings yet
NN Mdu Previousyears
10 pages
Semester 2, 2020 Week 8: Data Mining in WEKA Tutorial/Lab Session - 7
No ratings yet
Semester 2, 2020 Week 8: Data Mining in WEKA Tutorial/Lab Session - 7
13 pages
Detecting Diffusion Model Deepfakes
No ratings yet
Detecting Diffusion Model Deepfakes
27 pages
2 - Neural Network
100% (1)
2 - Neural Network
59 pages
Machine Learning Course Overview
No ratings yet
Machine Learning Course Overview
2 pages
Unit3 2023 NNDL
No ratings yet
Unit3 2023 NNDL
69 pages
Night Vision Pedestrian Detection
No ratings yet
Night Vision Pedestrian Detection
17 pages
OCR for Hijaiyah Text Using ANN
No ratings yet
OCR for Hijaiyah Text Using ANN
9 pages
Discussion & Conclusion
No ratings yet
Discussion & Conclusion
7 pages
3rd Ass
No ratings yet
3rd Ass
6 pages
Bite308l - Artificial-Intelligence - TH - 1.0 - 71 - Bite308l - 66 Acp
No ratings yet
Bite308l - Artificial-Intelligence - TH - 1.0 - 71 - Bite308l - 66 Acp
2 pages
Ranjit - Data Scientist
No ratings yet
Ranjit - Data Scientist
1 page
Time Delay Neural Network
No ratings yet
Time Delay Neural Network
6 pages
Logistic Regression Case Study & Program
No ratings yet
Logistic Regression Case Study & Program
6 pages
History
No ratings yet
History
5 pages

Model Parameters

Uploaded by

Model Parameters

Uploaded by

Model Parameters

Dr. Nabeela Kausar

• Suppose, you defined the grid as:

You might also like