0% found this document useful (0 votes)

104 views92 pages

DeepLearning L1 Intro

Deep learning introduces neural networks that can learn representations of data directly from large datasets. This overcomes limitations of hand-engineered features. Recent progress is due to large datasets, powerful GPUs, and improved techniques like backpropagation for training networks. The basic building block of neural networks is the perceptron, which performs a weighted sum of its inputs and applies an activation function. Networks are trained by minimizing a loss function using gradient descent and backpropagation to update weights. Techniques like dropout and early stopping help prevent overfitting during training.

Uploaded by

lafdali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

104 views92 pages

DeepLearning L1 Intro

Uploaded by

lafdali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 92

Deep Learning: Introduction

Pr. Tarik Fissaa

DATA – INE2

A.U : 2022/2023
What is Deep Learning ?
Why Deep Learning and Why Now ?
Why Deep Learning ?
Hand engineered features are time consuming, brittle, and not scalable in practice.

Can we learn the underlying features directly from data?

Why Now?

Neural networks date back decades, so why the resurgence?

1. Big Data 2. Hardware 3. Software

. Larger Datasets . Graphics Processing . Improved Techniques
. Easier Collection Units (GPUs) . New Models
& Storage . Massively Parallelizable . Toolboxes
The Perceptron
The structural building block for deep learning
The perceptron: Forward Propagation
The perceptron: Forward Propagation
The perceptron: Forward Propagation
The perceptron: Forward Propagation
Common Activation Functions
Importance of Activation Functions
Importance of Activation Functions
The Perceptron: Example
The Perceptron: Example
The Perceptron: Example
The Perceptron: Example
Building neural networks with Perceptrons
The Perceptron: simplified
The Perceptron: simplified
Multi Output Perceptron
Because all inputs are densely connected to all outputs, these layers are called Dense layers
Multi Output Perceptron
Because all inputs are densely connected to all outputs, these layers are called Dense layers
Single Layer Neural Network
Single Layer Neural Network
Single Layer Neural Network
Deep neural Network
Deep neural Network
Applying Neural Networks
Exemple

Will I pass this class?

let’s start with a simple two feature model:

𝑥1 = Number of lectures you attend.

𝑥2 = Hours spent on the final project

Exemple problem: Will I pass this class?

𝑥2 = Hours
spent on the
final project

𝑥1 = Number of lectures you attend

Exemple problem: Will I pass this class?

𝑥2 = Hours
spent on the
final project

𝑥1 = Number of lectures you attend

Exemple problem: Will I pass this class?
Exemple problem: Will I pass this class?
Quantifying Loss
The loss of our network measures the cost incurred from incorrect predictions
Empirical Loss
The empirical loss measures the total loss over our entire dataset
Binary Cross Entropy Loss
Cross entropy loss can be used with models that ouput a probability between o and 1
Mean Squared Error Loss MSE
Mean squared error loss can be used with regression models that ouput continuos
real numbers
Training Neural Networks
Loss Optimization
We want to find the network weights that achieve the lowest loss
Loss Optimization
We want to find the network weights that achieve the lowest loss
Loss Optimization
Loss Optimization
Loss Optimization
Loss Optimization
Loss Optimization
Algorithme du gradient (Gradient descent)
Computing gradients: Backpropagation

How does a small change in one weight (ex. 𝑤2 ) affect the final loss 𝐽 𝑊 ?
Computing gradients: Backpropagation
Computing gradients: Backpropagation
Computing gradients: Backpropagation
Computing gradients: Backpropagation

Repeat this for every weight in the network using gradients from later layers
Computing gradients: Backpropagation
Neural Networks in Practice:
Optimization
Training Neural Networks is difficult

« Visualizing the loss

landscape ». Hao Li, Dec 2017
Loss functions can be difficult to optimize
Loss functions can be difficult to optimize
Setting the learning Rate
Setting the learning Rate
Setting the learning Rate
Comment gérer cela?

Idea 1:
Try lots of different learning rates and see what works « just right »
Comment gérer cela?

Idée 1:
essayez de nombreux taux d'apprentissage différents et voyez ce qui fonctionne « juste »

Idée 2:
Do something smarter!
Design and adaptive learning rate that adapts to the landscape
Adaptive Learning Rates

• Learning rates are no longer fixed

• Can be made larger or smaller depending on:
• How large gradient is
• How fast learning is happening
• Size of particular weights
• Etc...
Gradient Descent Algorithms
Neural Networks in Practice:
Mini-batches
Mini-batches while training

More accurate estimation of gradient

smoother convergence

Allows for larger learning rates

Mini-batches while training
Estimation plus précise du gradient
Convergence plus fluide

Permet des taux d'apprentissage plus élevés

Mini-batches lead to faster training

Can parallelize computation + achieve significant speed increase on GPU’s
Neural Networks in Practice:
Overfitting
Regularization

what is it?
Technique that constrains our optimization problem to discourage complex models
Regularization

C’est quoi?
Technique qui contraint notre problème d'optimisation à décourager les modèles complexes

Why?
Improve generalisation of our model on unseen data
Regularization 1: Dropout
Regularization 1: Dropout
Regularization 1: Dropout
Regularization 2: Early Stopping
Regularization 2: Early Stopping
Regularization 2: Early Stopping
Regularization 2: Early Stopping
Regularization 2: Early Stopping
Regularization 2: Early Stopping
Regularization 2: Early Stopping
Regularization 2: Early Stopping
Résumé: les fondations de base

Al3502 - DLV Unit 2
No ratings yet
Al3502 - DLV Unit 2
18 pages
DL Unit 1
No ratings yet
DL Unit 1
199 pages
Deep Learning Tutorial: Reference: Hung-Yi Lee
100% (1)
Deep Learning Tutorial: Reference: Hung-Yi Lee
179 pages
FDL Module1
No ratings yet
FDL Module1
102 pages
Module 1
No ratings yet
Module 1
64 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
28 pages
DL Unit - I CSD Iv
No ratings yet
DL Unit - I CSD Iv
19 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
40 pages
Deep Learning(Handout)
No ratings yet
Deep Learning(Handout)
11 pages
Lecture 2
No ratings yet
Lecture 2
71 pages
Artificial Neural Networks Overview
No ratings yet
Artificial Neural Networks Overview
40 pages
Fundamentals of Deep Learning
No ratings yet
Fundamentals of Deep Learning
26 pages
DL Unit 1
No ratings yet
DL Unit 1
200 pages
CS4442 - CS9542 - Part 2 - Lecture 5 - DNN - Intro
No ratings yet
CS4442 - CS9542 - Part 2 - Lecture 5 - DNN - Intro
113 pages
Introduction Deep Eng
No ratings yet
Introduction Deep Eng
50 pages
Unit 2 DL
No ratings yet
Unit 2 DL
70 pages
Deep Learning for Tech Enthusiasts
No ratings yet
Deep Learning for Tech Enthusiasts
95 pages
Unit 4 ML NN, DL, CNN-1
No ratings yet
Unit 4 ML NN, DL, CNN-1
84 pages
6S191 MIT DeepLearning L1
No ratings yet
6S191 MIT DeepLearning L1
108 pages
Lect 12 - Deep Feed Forward NN - Review
No ratings yet
Lect 12 - Deep Feed Forward NN - Review
93 pages
Deep Learning & Neural Networks Guide
No ratings yet
Deep Learning & Neural Networks Guide
87 pages
Chapter 5 Final
No ratings yet
Chapter 5 Final
80 pages
For 5 Marks
No ratings yet
For 5 Marks
38 pages
UNIT 1 Introduction Part 1
No ratings yet
UNIT 1 Introduction Part 1
37 pages
Fundamentals of Deep Learning
No ratings yet
Fundamentals of Deep Learning
195 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
54 pages
Unit 2
No ratings yet
Unit 2
10 pages
The Little Book of Deep Learning
No ratings yet
The Little Book of Deep Learning
168 pages
Unit 4 Short Notes
No ratings yet
Unit 4 Short Notes
27 pages
Deep Learning - Intro, Methods & Applications
100% (1)
Deep Learning - Intro, Methods & Applications
37 pages
Deep Learning UNIT-II Part1
No ratings yet
Deep Learning UNIT-II Part1
48 pages
Main
No ratings yet
Main
183 pages
Deep Neural Network
No ratings yet
Deep Neural Network
60 pages
Lec 8 Training NN
No ratings yet
Lec 8 Training NN
71 pages
Complete Generative AI Curriculum
No ratings yet
Complete Generative AI Curriculum
6 pages
Components-Algorithms/: The Basic Architecture of Neural Networks: Single Computational Layer
No ratings yet
Components-Algorithms/: The Basic Architecture of Neural Networks: Single Computational Layer
65 pages
Lec 8 Training NN
No ratings yet
Lec 8 Training NN
71 pages
2 Deep Neural Network - 241120 - 095158
No ratings yet
2 Deep Neural Network - 241120 - 095158
47 pages
ML.8-Neural Networks - Deep Learning (Week 12,13)
No ratings yet
ML.8-Neural Networks - Deep Learning (Week 12,13)
80 pages
Deep Learning Tutorial Complete (v3)
No ratings yet
Deep Learning Tutorial Complete (v3)
109 pages
03-Lecture Notes-Mid
No ratings yet
03-Lecture Notes-Mid
23 pages
Cabs Availability Prediction Using Deep Learning: Project Member
No ratings yet
Cabs Availability Prediction Using Deep Learning: Project Member
58 pages
AI Chapter 4
No ratings yet
AI Chapter 4
63 pages
Deep Learning - Part II-1
No ratings yet
Deep Learning - Part II-1
23 pages
6S191 MIT DeepLearning L1
No ratings yet
6S191 MIT DeepLearning L1
108 pages
6S191 MIT DeepLearning L1
No ratings yet
6S191 MIT DeepLearning L1
101 pages
Fixing Neural Network Course 2 1659759284
No ratings yet
Fixing Neural Network Course 2 1659759284
30 pages
Notes 7sem Pec Csm701
No ratings yet
Notes 7sem Pec Csm701
23 pages
Neural Network
No ratings yet
Neural Network
7 pages
Top Deep Learning Interview Questions You Must Know-2
No ratings yet
Top Deep Learning Interview Questions You Must Know-2
8 pages
Unit3 DL JNTUK
No ratings yet
Unit3 DL JNTUK
15 pages
4.0 The Complete Guide To Artificial Neural Networks
No ratings yet
4.0 The Complete Guide To Artificial Neural Networks
23 pages
Tensorflow
No ratings yet
Tensorflow
25 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
21 pages
Unit 3
No ratings yet
Unit 3
8 pages
AI Video Captioning for Developers
No ratings yet
AI Video Captioning for Developers
8 pages
6S191 MIT DeepLearning L1
No ratings yet
6S191 MIT DeepLearning L1
104 pages
Artificial Intelligence - Chapter 7
No ratings yet
Artificial Intelligence - Chapter 7
18 pages
NLP Transformers for Data Scientists
No ratings yet
NLP Transformers for Data Scientists
38 pages
Neural Networks: A Beginner's Guide
No ratings yet
Neural Networks: A Beginner's Guide
23 pages
KNN Algorithm Guide for Students
No ratings yet
KNN Algorithm Guide for Students
7 pages
Image Generator
No ratings yet
Image Generator
11 pages
ML Assignment
No ratings yet
ML Assignment
3 pages
The Improvement of Forecasting ATMs Cash Demand of Iran Banking Network Using
No ratings yet
The Improvement of Forecasting ATMs Cash Demand of Iran Banking Network Using
11 pages
Final Lab Exam - Attempt Review Ai 2333
No ratings yet
Final Lab Exam - Attempt Review Ai 2333
17 pages
Unit 5
No ratings yet
Unit 5
46 pages
Sarowar 2025 Ijca 924776
100% (1)
Sarowar 2025 Ijca 924776
34 pages
Diary II SakethPulluri
No ratings yet
Diary II SakethPulluri
3 pages
Pattern Recognition Techniques in AI
No ratings yet
Pattern Recognition Techniques in AI
6 pages
Ai-Ml For Desmoking and Dehazing Synopsis
No ratings yet
Ai-Ml For Desmoking and Dehazing Synopsis
4 pages
Keras Applications
No ratings yet
Keras Applications
16 pages
Unit 2 - Week 1: Assignment 1
No ratings yet
Unit 2 - Week 1: Assignment 1
3 pages
Artificial Neural Network: Intelligent System Course
No ratings yet
Artificial Neural Network: Intelligent System Course
16 pages
Deep Learning Essentials for Learners
No ratings yet
Deep Learning Essentials for Learners
74 pages
Bits F446 1816 20230809111214
No ratings yet
Bits F446 1816 20230809111214
2 pages
(Semester Scheme - NEP Syllabus - 2022-23) : Answer Any Two Full Questions, Choosing One Full Question From Each Module
No ratings yet
(Semester Scheme - NEP Syllabus - 2022-23) : Answer Any Two Full Questions, Choosing One Full Question From Each Module
2 pages
AI API Course
No ratings yet
AI API Course
85 pages
LLaMA-Adapter: Efficient LLM Tuning
No ratings yet
LLaMA-Adapter: Efficient LLM Tuning
30 pages
2021 - UniPELT - A Unified Framework For Parameter-Efficient Language Model Tuning - Mao Et Al
No ratings yet
2021 - UniPELT - A Unified Framework For Parameter-Efficient Language Model Tuning - Mao Et Al
12 pages
Lecture 02 - Introduction To Neural Networks (Optional)
No ratings yet
Lecture 02 - Introduction To Neural Networks (Optional)
28 pages
Sentiment Analysis in The Age of Generative AI: Jan Ole Krugmann Jochen Hartmann
No ratings yet
Sentiment Analysis in The Age of Generative AI: Jan Ole Krugmann Jochen Hartmann
19 pages
Bbus2000 - PPT Chapter 11
No ratings yet
Bbus2000 - PPT Chapter 11
7 pages
Channel Attention For Quantum Convolutional Neural Networks
No ratings yet
Channel Attention For Quantum Convolutional Neural Networks
6 pages
Vision Transformers Explained
No ratings yet
Vision Transformers Explained
11 pages
Decision Tree Algorithm in Machine Learning
No ratings yet
Decision Tree Algorithm in Machine Learning
13 pages
Adversarial Image Detection Advances
No ratings yet
Adversarial Image Detection Advances
15 pages

DeepLearning L1 Intro

Uploaded by

DeepLearning L1 Intro

Uploaded by

Deep Learning: Introduction

Pr. Tarik Fissaa

Can we learn the underlying features directly from data?

Neural networks date back decades, so why the resurgence?

1. Big Data 2. Hardware 3. Software

Will I pass this class?

let’s start with a simple two feature model:

𝑥1 = Number of lectures you attend.

𝑥2 = Hours spent on the final project

𝑥1 = Number of lectures you attend

𝑥1 = Number of lectures you attend

« Visualizing the loss

• Learning rates are no longer fixed

More accurate estimation of gradient

Allows for larger learning rates

Permet des taux d'apprentissage plus élevés

Mini-batches lead to faster training

You might also like