0% found this document useful (0 votes)

23 views24 pages

Lecture 5,6 - Transfer Learning

The document discusses Transfer Learning and Pretrained Models in deep learning, highlighting the challenges of training models from scratch, such as large data requirements and long training times. It explains how Transfer Learning allows the reuse of models trained on large datasets like ImageNet for related tasks, improving training efficiency and accuracy. The presentation also covers techniques like feature extraction and fine-tuning to adapt pretrained models for specific applications.

Uploaded by

Muhammad Abbas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views24 pages

Lecture 5,6 - Transfer Learning

Uploaded by

Muhammad Abbas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Deep Learning

Lecture 05+06: Transfer Learning & Pretrained Models

Presenter: Dr Shahbaz Khan

Overview

Today: Transfer Learning Next Week:

Imagenet Localization
Alexnet Detection
VGG16
Resnet
Transfer learning
Problem with our developed model

Large Data Requirement Long Training Time

Deep learning models need large datasets Training deep networks from scratch
to generalize well. requires extensive computational
Collecting enough labeled data is expensive resources.
and time-consuming. Models with millions of parameters need
Models trained on small datasets often lead days or weeks to converge, even with
to overfitting. powerful hardware.
Problem with our developed model

Difficulty in Optimization Limited Feature Extraction

Custom models often suffer from
Models built from scratch learn features
vanishing/exploding gradients in deep layers.
from randomly initialized weights.
Finding the right combination of
hyperparameters (learning rate, batch size, etc.) Initial layers take time to learn low-level
is complex. features like edges and textures.
Improper initialization can cause poor Higher layers need to learn high-level
performance, leading to unstable gradients abstract features, which requires deep
architectures and more training data.
.
Image net History

ImageNet is one of the most influential datasets in

the history of computer vision and deep learning.
It has driven remarkable advancements in the field,
particularly in the development of state-of-the-art
models
Image net large visual research challenge

The first ImageNet Large Scale Visual Recognition

Challenge (ILSVRC) was held in 2010, and it became an
annual competition.
Early entries used traditional machine learning
techniques such as support vector machines (SVMs) and
handcrafted features (e.g., SIFT, HOG), achieving around
28.2% error rate.

AlexNet: In 2012, a model called AlexNet by

Alex Krizhevsky, Ilya Sutskever, and Geoffrey
Hinton changed the course of the challenge and
computer vision research.
Alex Net
Alex Net
VGG 16
VGG 16
VGG 19
Residual Networks (ResNet) and Skip Connections
ResNet50
https://keras.io/api/applications/
So, Can we use these
models help?

Yes, Of course, and this is

called as transfer learning
Transfer Learning

A technique where a model trained on one task is

reused or fine-tuned for a different, but related, task.

Pretrained Model: Use a model trained on a large dataset

(e.g., ImageNet for images, BERT for NLP).

Knowledge Transfer: The pretrained model learns general

features and representations (like edges, textures in
images, or semantics in text).

New Task: Adapt the model for a new, often smaller,

dataset (e.g., medical image classification)
Feature Extraction

Freeze
Feature Extraction

This is extremely useful when:

•The task of interest has less data.
•But a related task has abundant data.
This is how it works:
•Train a neural network model (base model) on the related task.
•Replace the last few layers on the base model with new layers.
Fine Tuning

Freeze
Fine Tuning

Fine-tuning involves updating the weights of some or all layers of the pre-trained model to
adapt it to the new task.
Unfreeze some layers of the pretrained model and retrain them for the new task..
Solving the Custom Model Problem
Faster Training

Instead of training from scratch, we reuse the weights of pretrained models.

Training time is reduced to hours or minutes instead of days because only a few layers need fine-
tuning.

Improved Accuracy

Transfer learning typically results in higher accuracy since the model starts with useful features
instead of random initialization.

Pretrained models already capture low-level and mid-level features like edges, textures, and
shapes, so fine-tuning focuses on high-level task-specific features.
Thank You
Any Questions?

NB4-10 PT V Transfer Learning
No ratings yet
NB4-10 PT V Transfer Learning
16 pages
Program 5n6 DL
No ratings yet
Program 5n6 DL
9 pages
Classic CNN
No ratings yet
Classic CNN
39 pages
ch4 CNN
No ratings yet
ch4 CNN
35 pages
Chapter 6 - Notes PDF
No ratings yet
Chapter 6 - Notes PDF
22 pages
Unit 5 1
No ratings yet
Unit 5 1
1 page
MN906 AI Watermarking
No ratings yet
MN906 AI Watermarking
99 pages
DL - Unit IV
No ratings yet
DL - Unit IV
36 pages
14174-English
No ratings yet
14174-English
7 pages
PROGRAM 5n6 DL - Final
No ratings yet
PROGRAM 5n6 DL - Final
9 pages
Session15 TransferLearning
No ratings yet
Session15 TransferLearning
13 pages
Unit 2
No ratings yet
Unit 2
9 pages
Arijit Dey - 34230822006 - PCCAIML 602
No ratings yet
Arijit Dey - 34230822006 - PCCAIML 602
15 pages
Transfer Learning with Pre-trained Models
No ratings yet
Transfer Learning with Pre-trained Models
16 pages
06 Pytorch Transfer Learning
No ratings yet
06 Pytorch Transfer Learning
18 pages
Operations Slides
No ratings yet
Operations Slides
11 pages
Popular Pre-Trained CNN Models
No ratings yet
Popular Pre-Trained CNN Models
15 pages
DL Exp-6 16010422230
No ratings yet
DL Exp-6 16010422230
8 pages
Notes
No ratings yet
Notes
15 pages
Transfer Learning for Image Classification
No ratings yet
Transfer Learning for Image Classification
4 pages
Aai TT1
No ratings yet
Aai TT1
50 pages
Bascis of AI - Module 2 - Complementary Study Material - 4
No ratings yet
Bascis of AI - Module 2 - Complementary Study Material - 4
4 pages
Cat and Dog 1
No ratings yet
Cat and Dog 1
9 pages
AlexNet and Other Pretrained Models - Presentation
No ratings yet
AlexNet and Other Pretrained Models - Presentation
182 pages
Transfer Learning with MRCNN
No ratings yet
Transfer Learning with MRCNN
12 pages
DL7 2
No ratings yet
DL7 2
11 pages
COMP3220 Lect 11 - Introduction To Convolutional Neural Networks
No ratings yet
COMP3220 Lect 11 - Introduction To Convolutional Neural Networks
13 pages
Transfer Learning
No ratings yet
Transfer Learning
10 pages
Unit Iv - NNDL
No ratings yet
Unit Iv - NNDL
32 pages
CNN - Case Study
No ratings yet
CNN - Case Study
4 pages
4b Image Processing
No ratings yet
4b Image Processing
63 pages
Conference 101719
No ratings yet
Conference 101719
5 pages
Data Science Interview Preparation (#DAY 14)
No ratings yet
Data Science Interview Preparation (#DAY 14)
11 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
NNDL PPT Subashini
No ratings yet
NNDL PPT Subashini
16 pages
Transfer Learning: Objectives
No ratings yet
Transfer Learning: Objectives
16 pages
VGG-16 Transfer Learning for Image Classification
No ratings yet
VGG-16 Transfer Learning for Image Classification
9 pages
CH 5
No ratings yet
CH 5
16 pages
Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey
No ratings yet
Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey
24 pages
NNDL - Unit 3CBS
No ratings yet
NNDL - Unit 3CBS
6 pages
BreastCancer EXP
No ratings yet
BreastCancer EXP
8 pages
Make 04 00002 v2
No ratings yet
Make 04 00002 v2
20 pages
Deep Learning Model Compression Survey
No ratings yet
Deep Learning Model Compression Survey
10 pages
Deep Learning in Matlab
No ratings yet
Deep Learning in Matlab
36 pages
5b Dana
No ratings yet
5b Dana
67 pages
Neural Network Project Report.
No ratings yet
Neural Network Project Report.
12 pages
Intro to Convolutional Networks
No ratings yet
Intro to Convolutional Networks
17 pages
Efficient Selective Pre-Training For Imbalanced Fine-Tuning Data in Transfer Learning
No ratings yet
Efficient Selective Pre-Training For Imbalanced Fine-Tuning Data in Transfer Learning
10 pages
CNN Models for Face Recognition
No ratings yet
CNN Models for Face Recognition
5 pages
Video 18 - Transfer Learning and Fine-Tuning Pretrained Models
No ratings yet
Video 18 - Transfer Learning and Fine-Tuning Pretrained Models
14 pages
Expt 7
No ratings yet
Expt 7
3 pages
Why Transfer Learning
No ratings yet
Why Transfer Learning
24 pages
TransferLearningwithAdaptiveFine Tuning
No ratings yet
TransferLearningwithAdaptiveFine Tuning
16 pages
Understanding AlexNet
No ratings yet
Understanding AlexNet
8 pages
CNN Basic
No ratings yet
CNN Basic
64 pages
L3 - UUCLxDeepMind DL2020
No ratings yet
L3 - UUCLxDeepMind DL2020
110 pages
Mercity - Ai-Guide To Fine-Tuning LLMs Using PEFT and LoRa Techniques
No ratings yet
Mercity - Ai-Guide To Fine-Tuning LLMs Using PEFT and LoRa Techniques
25 pages
Tutorial 1 PYTHON AI
No ratings yet
Tutorial 1 PYTHON AI
7 pages
Lecture 3 - MLP and ANN
No ratings yet
Lecture 3 - MLP and ANN
31 pages
Shift Rota 2019
No ratings yet
Shift Rota 2019
7 pages
Surveillance System Pricing
No ratings yet
Surveillance System Pricing
2 pages
Plant Operations: Page 1 of 2
No ratings yet
Plant Operations: Page 1 of 2
2 pages
Switchyard
No ratings yet
Switchyard
13 pages
Air & Gas
No ratings yet
Air & Gas
22 pages
WAPDA Operating Procedure
No ratings yet
WAPDA Operating Procedure
55 pages
SOP Foam Operation.
100% (2)
SOP Foam Operation.
1 page
Shift Schedule for 2021
No ratings yet
Shift Schedule for 2021
2 pages
From One-Month CFD To One-Day CFD - Efforts For Reducing Time and Cost of CFD
No ratings yet
From One-Month CFD To One-Day CFD - Efforts For Reducing Time and Cost of CFD
2 pages
PG Valves Passing July-2019
No ratings yet
PG Valves Passing July-2019
5 pages
BCP Water Cooler Cleaning Through Backflushing
No ratings yet
BCP Water Cooler Cleaning Through Backflushing
1 page
Steam Drum PDF
No ratings yet
Steam Drum PDF
59 pages
Steam Drum
100% (1)
Steam Drum
10 pages
Thermal Analysis of Pin Fin With Different Shape Forms Using ANSYS PDF
No ratings yet
Thermal Analysis of Pin Fin With Different Shape Forms Using ANSYS PDF
8 pages
VTU Module-4 Chapter-2 Ensemble Learning and Random Forests
No ratings yet
VTU Module-4 Chapter-2 Ensemble Learning and Random Forests
61 pages
Future of Work in Jamaica
No ratings yet
Future of Work in Jamaica
5 pages
Performance - Evaluation - of - Recurrent - Neural - Networks-LSTM - and - GRU - For ASR - IC2E3
No ratings yet
Performance - Evaluation - of - Recurrent - Neural - Networks-LSTM - and - GRU - For ASR - IC2E3
6 pages
Ann-Unit I
No ratings yet
Ann-Unit I
40 pages
Big Data in Finance: Opportunities and Challenges of Financial Digitalization
No ratings yet
Big Data in Finance: Opportunities and Challenges of Financial Digitalization
283 pages
Medical Care Thesis Statement
100% (3)
Medical Care Thesis Statement
6 pages
江汉区2022年中考英语模拟试卷 (一) 试卷及答案
No ratings yet
江汉区2022年中考英语模拟试卷 (一) 试卷及答案
13 pages
Code Review Practices - Guidelines and Benefits
No ratings yet
Code Review Practices - Guidelines and Benefits
9 pages
Shield Intro For Deloitte UK
No ratings yet
Shield Intro For Deloitte UK
19 pages
SD50232GB-HNR S0 Datasheet 20230810
No ratings yet
SD50232GB-HNR S0 Datasheet 20230810
4 pages
Aruna
No ratings yet
Aruna
21 pages
Inter-IIT Proposal
No ratings yet
Inter-IIT Proposal
3 pages
Literature Review On Banking Technology
100% (1)
Literature Review On Banking Technology
4 pages
Work and Co - Health & Wellness Trends 2024
100% (1)
Work and Co - Health & Wellness Trends 2024
7 pages
Nature-Inspired Algorithms in Deep Learning
No ratings yet
Nature-Inspired Algorithms in Deep Learning
14 pages
IBPS Clerk Mains 2024 Memory Based Paper Free Book
No ratings yet
IBPS Clerk Mains 2024 Memory Based Paper Free Book
65 pages
Hema Bhaskar Sai
No ratings yet
Hema Bhaskar Sai
2 pages
Predictive Modelling For The Future of P
No ratings yet
Predictive Modelling For The Future of P
4 pages
Cybersecurity and Digital Forensics
100% (1)
Cybersecurity and Digital Forensics
204 pages
Ethical and Psychological Implications of Human
No ratings yet
Ethical and Psychological Implications of Human
1 page
26th September 2024 - Report of The Arso Coco Plenary Meeting
No ratings yet
26th September 2024 - Report of The Arso Coco Plenary Meeting
8 pages
AI For Designers
No ratings yet
AI For Designers
60 pages
Exploring Human Resource Management Digital Transformation in The Digital Age
No ratings yet
Exploring Human Resource Management Digital Transformation in The Digital Age
17 pages
Report On Sentiment Analysis For Customer Reviews
No ratings yet
Report On Sentiment Analysis For Customer Reviews
4 pages
07 Boosting Notes
No ratings yet
07 Boosting Notes
10 pages
Gender Detection by Voice Using Deep Learning
No ratings yet
Gender Detection by Voice Using Deep Learning
5 pages
Saudi Arabia's Strategic Leap Towards A Diversified Economy and Technological Innovation
No ratings yet
Saudi Arabia's Strategic Leap Towards A Diversified Economy and Technological Innovation
15 pages
GENIUS - Generating Expertise in Data and Robotics Using Innovation and Skills (Amended)
No ratings yet
GENIUS - Generating Expertise in Data and Robotics Using Innovation and Skills (Amended)
30 pages
Adhithyan V P: Education
No ratings yet
Adhithyan V P: Education
1 page
Tushar ML
No ratings yet
Tushar ML
52 pages

Lecture 5,6 - Transfer Learning

Uploaded by

Lecture 5,6 - Transfer Learning

Uploaded by

Deep Learning

Lecture 05+06: Transfer Learning & Pretrained Models

Presenter: Dr Shahbaz Khan

Today: Transfer Learning Next Week:

Large Data Requirement Long Training Time

Difficulty in Optimization Limited Feature Extraction

ImageNet is one of the most influential datasets in

The first ImageNet Large Scale Visual Recognition

AlexNet: In 2012, a model called AlexNet by

Yes, Of course, and this is

A technique where a model trained on one task is

Pretrained Model: Use a model trained on a large dataset

Knowledge Transfer: The pretrained model learns general

New Task: Adapt the model for a new, often smaller,

This is extremely useful when:

Instead of training from scratch, we reuse the weights of pretrained models.

You might also like