0% found this document useful (0 votes)

116 views20 pages

Intro4 ANN Deep CNN PDF

Deep learning is a family of techniques for learning compositional vector representations of complex data using neural networks. Deep neural networks learn hierarchical representations of data by building higher-level features from lower-level ones. Convolutional neural networks apply this idea to visual data by incorporating spatial structure through local connectivity and parameter sharing. Modern deep learning techniques like residual networks enable very deep networks to be trained effectively.

Uploaded by

pranab sarker

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

116 views20 pages

Intro4 ANN Deep CNN PDF

Uploaded by

pranab sarker

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

What is deep learning?

A family of techniques for learning compositional vector representations

of complex data.

CS221 / Spring 2020 / Finn & Anari 9

Review: linear predictors
w
x1

x2 f✓ (x)

Output:

f✓ (x) = w · x

Parameters: ✓ = w

CS221 / Spring 2020 / Finn & Anari 11

Review: neural networks
V h1
x1 w
x2 f✓ (x)

x3
h2
Intermediate hidden units:
z 1
hj (x) = (vj · x) (z) = (1 + e )
Output:
f✓ (x) = w · h(x)
Parameters: ✓ = (V, w)

CS221 / Spring 2020 / Finn & Anari 12

Deep neural networks
1-layer neural network: x
w>
score =

2-layer neural network: x

> V
w
score = ( )

3-layer neural network: x

U V
>
w
score = ( ( ))

CS221 / Spring 2020 / Finn & Anari

... 13
Depth
x
h h0 h00 h000
f✓ (x)

Intuitions:
• Hierarchical feature representations
• Can simulate a bounded computation logic circuit (original moti-
vation from McCulloch/Pitts, 1943)
• Learn this computation (and potentially more because networks
are real-valued)
• Formal theory/understanding is still incomplete
• Some hypotheses emerging: double descent, lottery ticket hypoth-
esis

CS221 / Spring 2020 / Finn & Anari 14

[figure from Honglak Lee]

What’s learned?

CS221 / Spring 2020 / Finn & Anari 15

Review: optimization
Regression:
Loss(x, y, ✓) = (f✓ (x) y)2
Key idea: minimize training loss
1 X
TrainLoss(✓) = Loss(x, y, ✓)
|Dtrain |
(x,y)2Dtrain

min TrainLoss(✓)
✓2Rd

Algorithm: stochastic gradient descent

For t = 1, . . . , T :
For (x, y) 2 Dtrain :
✓ ✓ ⌘t r✓ Loss(x, y, ✓)

CS221 / Spring 2020 / Finn & Anari 16

Training

• Non-convex optimization

• No theoretical guarantees that it works

• Before 2000s, empirically very difficult to get working

CS221 / Spring 2020 / Finn & Anari 17

What’s di↵erent today
Computation (time/memory) Information (data)

CS221 / Spring 2020 / Finn & Anari 18

How to make it work

• More hidden units (over-parameterization)

• Adaptive step sizes (AdaGrad, Adam)
• Dropout to guard against overfitting
• Careful initialization (pre-training)
• Batch normalization

Model and optimization are tightly coupled

CS221 / Spring 2020 / Finn & Anari 19
Summary
• Deep networks learn hierarchical representations of data

• Train via SGD, use backpropagation to compute gradients

• Non-convex optimization, but works empirically given enough com-

pute and data

CS221 / Spring 2020 / Finn & Anari 20

Motivation
x
W

• Observation: images are not arbitrary vectors

• Goal: leverage spatial structure of images (translation equivari-
ance)

CS221 / Spring 2020 / Finn & Anari 22

Idea: Convolutions

CS221 / Spring 2020 / Finn & Anari 23

[figure from Andrej Karpathy]

Prior knowledge

• Local connectivity: each hidden unit operates on a local image

patch (3 instead of 7 connections per hidden unit)

• Parameter sharing: processing of each image patch is same (3

parameters instead of 3 · 5)

• Intuition: try to match a pattern in image

CS221 / Spring 2020 / Finn & Anari 24

Convolutional layers

• Instead of vector to vector, we do volume to volume

[Andrej Karpathy’s demo]

CS221 / Spring 2020 / Finn & Anari 25

[figure from Andrej Karpathy]

Max-pooling

• Intuition: test if there exists a pattern in neighborhood

• Reduce computation, prevent overfitting

CS221 / Spring 2020 / Finn & Anari 26

Example of function evaluation

[Andrej Karpathy’s demo]

CS221 / Spring 2020 / Finn & Anari 27

[Krizhevsky et al., 2012]

AlexNet

• Non-linearity: use RelU (max(z, 0)) instead of logistic

• Data augmentation: translate, horizontal reflection, vary intensity,
dropout (guard against overfitting)
• Computation: parallelize across two GPUs (6 days)
• Results on ImageNet: 16.4% error (next best was 25.8%)

CS221 / Spring 2020 / Finn & Anari 28

[He et al. 2015]

Residual networks
x 7! (W x) + x

• Key idea: make it easy to learn the iden-

tity (good inductive bias)
• Enables training 152 layer networks
• Results on ImageNet: 3.6% error

CS221 / Spring 2020 / Finn & Anari 29

Summary
• Key idea 1: locality of connections, capture spatial structure

• Key idea 2: Filters share parameters, capture translational equiv-

ariance

• Depth matters

• Applications to images, text, Go, drug design, etc.

CS221 / Spring 2020 / Finn & Anari 30

Khairuls-Basic-Math-Mental Ability PDF
100% (1)
Khairuls-Basic-Math-Mental Ability PDF
75 pages
Neural Networks for CS Students
100% (1)
Neural Networks for CS Students
22 pages
CNN PPT Unit Iv
100% (2)
CNN PPT Unit Iv
134 pages
Deep Learning for CS Students
No ratings yet
Deep Learning for CS Students
75 pages
4.pattern Recognition (Pattern Classification) - Convolutional Neural Networks - (CNN)
No ratings yet
4.pattern Recognition (Pattern Classification) - Convolutional Neural Networks - (CNN)
235 pages
Basics of Machine Learning and Deep Learning
100% (1)
Basics of Machine Learning and Deep Learning
49 pages
Neural Networks & Deep Learning Basics
No ratings yet
Neural Networks & Deep Learning Basics
24 pages
Learning Deep Learning Theory and Practice of Neural Networks Computer Vision NLP and Transformers Using TensorFlow 1st Edition Ekman Magnus Instant Download
100% (1)
Learning Deep Learning Theory and Practice of Neural Networks Computer Vision NLP and Transformers Using TensorFlow 1st Edition Ekman Magnus Instant Download
82 pages
Csps 1
100% (2)
Csps 1
62 pages
Lecture 17. Convolutional Neural Networks PDF
No ratings yet
Lecture 17. Convolutional Neural Networks PDF
32 pages
Computer Vision Unit 4
No ratings yet
Computer Vision Unit 4
186 pages
Partial List of Seminole On Dawes Rolls ($5 Indian Signups)
No ratings yet
Partial List of Seminole On Dawes Rolls ($5 Indian Signups)
197 pages
ANN Notes
No ratings yet
ANN Notes
54 pages
Chap 7-2 Regularization For Deep Learning-Hyun-Lim Yang
No ratings yet
Chap 7-2 Regularization For Deep Learning-Hyun-Lim Yang
49 pages
UNIT 2-3 - Notes - Unit-2-3-Notes
No ratings yet
UNIT 2-3 - Notes - Unit-2-3-Notes
16 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
Lecture Notes SC
No ratings yet
Lecture Notes SC
21 pages
Deep Learning (MODULE-3)
No ratings yet
Deep Learning (MODULE-3)
85 pages
1.neural Networks and Convolutional Processing
No ratings yet
1.neural Networks and Convolutional Processing
94 pages
Set Design Checklist1
50% (2)
Set Design Checklist1
4 pages
Deep Learning Unit-III
No ratings yet
Deep Learning Unit-III
9 pages
Ucchista Ganapati Mantra Guide
No ratings yet
Ucchista Ganapati Mantra Guide
19 pages
Destiny Control Manual For Swara Calendar App
100% (7)
Destiny Control Manual For Swara Calendar App
12 pages
K-Means Clustering Tutorial - Matlab Code
No ratings yet
K-Means Clustering Tutorial - Matlab Code
6 pages
Introduction To Feed Forward Neural Networks
No ratings yet
Introduction To Feed Forward Neural Networks
121 pages
Ornaments Fingerings Authorship
100% (1)
Ornaments Fingerings Authorship
23 pages
Neural Networks for Advanced Learners
No ratings yet
Neural Networks for Advanced Learners
23 pages
Optic Disc Cupping: Causes & Diagnosis
No ratings yet
Optic Disc Cupping: Causes & Diagnosis
18 pages
Professor - S Bcs Math - Mental-Ability PDF
No ratings yet
Professor - S Bcs Math - Mental-Ability PDF
19 pages
Deep
No ratings yet
Deep
73 pages
178
No ratings yet
178
1 page
Introduction To Neural Networks Using Matlab 6 0 S N Sivanandam Sumathi Deepa
0% (1)
Introduction To Neural Networks Using Matlab 6 0 S N Sivanandam Sumathi Deepa
4 pages
Notions de Deep Learning
No ratings yet
Notions de Deep Learning
116 pages
Artificial Intelligence in Mechanical Engineering: A Case Study On Vibration Analysis of Cracked Cantilever Beam
No ratings yet
Artificial Intelligence in Mechanical Engineering: A Case Study On Vibration Analysis of Cracked Cantilever Beam
4 pages
PCA & Image Processing Lecture
No ratings yet
PCA & Image Processing Lecture
66 pages
Stats & ML Model Comparisons
100% (1)
Stats & ML Model Comparisons
72 pages
Backpropagation Learning in Neural Networks
No ratings yet
Backpropagation Learning in Neural Networks
27 pages
2.neural Network
No ratings yet
2.neural Network
19 pages
02 ML Supervised Learning
No ratings yet
02 ML Supervised Learning
32 pages
Neural Networks & SVMs in AI
No ratings yet
Neural Networks & SVMs in AI
19 pages
Deep Learning Tutorial for Business
No ratings yet
Deep Learning Tutorial for Business
58 pages
Deep Learning Course Intro 2020
No ratings yet
Deep Learning Course Intro 2020
77 pages
Regularization: Swetha V, Research Scholar
No ratings yet
Regularization: Swetha V, Research Scholar
32 pages
What Is Computer Vision?
No ratings yet
What Is Computer Vision?
120 pages
Shooting To Win Pierson U 2 1st Edition Elouise Tynan - Download The Ebook Now For Instant Access To All Chapters
100% (1)
Shooting To Win Pierson U 2 1st Edition Elouise Tynan - Download The Ebook Now For Instant Access To All Chapters
74 pages
Lecture 10 Tensor and Tensor Algebra 2 PDF
No ratings yet
Lecture 10 Tensor and Tensor Algebra 2 PDF
14 pages
Deep Learning: A Technical Guide
No ratings yet
Deep Learning: A Technical Guide
106 pages
Neural Networks
No ratings yet
Neural Networks
116 pages
4a's Lesson Plan in PE 3 (Body Shapes and Body Action)
No ratings yet
4a's Lesson Plan in PE 3 (Body Shapes and Body Action)
5 pages
Bayesian Networks Lecture SEO
No ratings yet
Bayesian Networks Lecture SEO
76 pages
Advanced Digital Image Processing: Lecture - 3 Basic Relationships Between Pixels
No ratings yet
Advanced Digital Image Processing: Lecture - 3 Basic Relationships Between Pixels
46 pages
Lec 1 PDF
No ratings yet
Lec 1 PDF
31 pages
Lec 2 PDF
No ratings yet
Lec 2 PDF
36 pages
Overview Stanford PDF
No ratings yet
Overview Stanford PDF
113 pages
Iso 45009
No ratings yet
Iso 45009
30 pages
Types of Neural Networks
No ratings yet
Types of Neural Networks
7 pages
Case Study On Anthropology
No ratings yet
Case Study On Anthropology
4 pages
Coursera Solutions Quiz 4
0% (1)
Coursera Solutions Quiz 4
6 pages
Btech CSE
100% (1)
Btech CSE
17 pages
Neural Network Backpropagation Guide
No ratings yet
Neural Network Backpropagation Guide
7 pages
2D Convolution Example & Code
No ratings yet
2D Convolution Example & Code
5 pages
Artificial Neural Networks: Part 1/3
No ratings yet
Artificial Neural Networks: Part 1/3
25 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
25 pages
The Present Simple and Present Continuous in Engli Activities Promoting Classroom Dynamics Group Form - 94392
No ratings yet
The Present Simple and Present Continuous in Engli Activities Promoting Classroom Dynamics Group Form - 94392
2 pages
Astral Pet Store: Novel Next
No ratings yet
Astral Pet Store: Novel Next
616 pages
The Backpropagation Algorithm
No ratings yet
The Backpropagation Algorithm
4 pages
Module2.3 Hyperparameter Optimization
No ratings yet
Module2.3 Hyperparameter Optimization
29 pages
Ron Clark
No ratings yet
Ron Clark
1 page
UNIT-I - Introduction To Computer Vision
No ratings yet
UNIT-I - Introduction To Computer Vision
45 pages
Machine Learning Guide Line
No ratings yet
Machine Learning Guide Line
10 pages
Modular Arithmetic Lesson Plan
No ratings yet
Modular Arithmetic Lesson Plan
4 pages
Spring 2022 CS7643 Deep Learning Syllabus and Schedule - v5.1
No ratings yet
Spring 2022 CS7643 Deep Learning Syllabus and Schedule - v5.1
11 pages
Three Canonical Learning Problems
No ratings yet
Three Canonical Learning Problems
13 pages
What Is Artificial Intelligence?: John Mccarthy, Stanford University
No ratings yet
What Is Artificial Intelligence?: John Mccarthy, Stanford University
35 pages
Millennial Marketing Guide
No ratings yet
Millennial Marketing Guide
2 pages
Why and How Do I Get Into Machine Learning Development?
No ratings yet
Why and How Do I Get Into Machine Learning Development?
3 pages
Identify Collective Nouns Printable Worksheets For Grade 2
No ratings yet
Identify Collective Nouns Printable Worksheets For Grade 2
5 pages
Tensor Analysis in Continuum Mechanics
No ratings yet
Tensor Analysis in Continuum Mechanics
21 pages
How To Do Deep Learning With SAS: Title
No ratings yet
How To Do Deep Learning With SAS: Title
16 pages
Deep Learning For Medical Image Analysis 1st Edition S. Kevin Zhou
No ratings yet
Deep Learning For Medical Image Analysis 1st Edition S. Kevin Zhou
62 pages
Graph Neural Network The Next Frontier in Deep Learning
No ratings yet
Graph Neural Network The Next Frontier in Deep Learning
1 page
Muhammad and the Rise of Islam
No ratings yet
Muhammad and the Rise of Islam
7 pages
Monocular Depth Estimation with U-Net
No ratings yet
Monocular Depth Estimation with U-Net
8 pages
Network Security: Intrusion Detection
No ratings yet
Network Security: Intrusion Detection
4 pages
Meeting English for Professionals
No ratings yet
Meeting English for Professionals
2 pages
Deep Learning Basics & Applications
No ratings yet
Deep Learning Basics & Applications
6 pages
1 Amartya Sen
No ratings yet
1 Amartya Sen
5 pages
Schacht's Islamic Jurisprudence Review
No ratings yet
Schacht's Islamic Jurisprudence Review
6 pages
Neural
No ratings yet
Neural
35 pages
Unit 8 Reading Comprehension
No ratings yet
Unit 8 Reading Comprehension
4 pages
Library System Viewpoint Analysis
No ratings yet
Library System Viewpoint Analysis
2 pages
REC FR 0057 Ethics Informed Consent Form ICF 2
No ratings yet
REC FR 0057 Ethics Informed Consent Form ICF 2
10 pages
Unit 2 - Soft Computing - WWW - Rgpvnotes.in
No ratings yet
Unit 2 - Soft Computing - WWW - Rgpvnotes.in
20 pages
Italiano Amare
No ratings yet
Italiano Amare
3 pages
Essence of Poetry
No ratings yet
Essence of Poetry
6 pages
Radial Basis Function
No ratings yet
Radial Basis Function
35 pages
Lab Activity 1
No ratings yet
Lab Activity 1
2 pages
Neural Networks
No ratings yet
Neural Networks
29 pages
Gardner - Property & Theft' Notes
No ratings yet
Gardner - Property & Theft' Notes
4 pages

Intro4 ANN Deep CNN PDF

Uploaded by

Intro4 ANN Deep CNN PDF

Uploaded by

What is deep learning?

A family of techniques for learning compositional vector representations

CS221 / Spring 2020 / Finn & Anari 9

CS221 / Spring 2020 / Finn & Anari 11

CS221 / Spring 2020 / Finn & Anari 12

2-layer neural network: x

3-layer neural network: x

CS221 / Spring 2020 / Finn & Anari

CS221 / Spring 2020 / Finn & Anari 14

CS221 / Spring 2020 / Finn & Anari 15

Algorithm: stochastic gradient descent

CS221 / Spring 2020 / Finn & Anari 16

• No theoretical guarantees that it works

• Before 2000s, empirically very difficult to get working

CS221 / Spring 2020 / Finn & Anari 17

CS221 / Spring 2020 / Finn & Anari 18

• More hidden units (over-parameterization)

Model and optimization are tightly coupled

• Train via SGD, use backpropagation to compute gradients

• Non-convex optimization, but works empirically given enough com-

CS221 / Spring 2020 / Finn & Anari 20

• Observation: images are not arbitrary vectors

CS221 / Spring 2020 / Finn & Anari 22

CS221 / Spring 2020 / Finn & Anari 23

• Local connectivity: each hidden unit operates on a local image

• Parameter sharing: processing of each image patch is same (3

• Intuition: try to match a pattern in image

CS221 / Spring 2020 / Finn & Anari 24

• Instead of vector to vector, we do volume to volume

CS221 / Spring 2020 / Finn & Anari 25

• Intuition: test if there exists a pattern in neighborhood

• Reduce computation, prevent overfitting

CS221 / Spring 2020 / Finn & Anari 26

[Andrej Karpathy’s demo]

CS221 / Spring 2020 / Finn & Anari 27

• Non-linearity: use RelU (max(z, 0)) instead of logistic

CS221 / Spring 2020 / Finn & Anari 28

• Key idea: make it easy to learn the iden-

CS221 / Spring 2020 / Finn & Anari 29

• Key idea 2: Filters share parameters, capture translational equiv-

• Applications to images, text, Go, drug design, etc.

CS221 / Spring 2020 / Finn & Anari 30

You might also like