0% found this document useful (0 votes)

19 views55 pages

Lecture 18. Backpropagation

Back propaganda

Uploaded by

Satyarth Gupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views55 pages

Lecture 18. Backpropagation

Back propaganda

Uploaded by

Satyarth Gupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 55

Backpropagation

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

MuCulloch Pitts Neuron and Perceptron

𝑛 𝑛

𝑦 = 1 𝑖𝑓 ෍ 𝑥𝑖 ≥ 0 𝑦 = 1 𝑖𝑓 ෍ 𝒘𝒊 ∗ 𝑥𝑖 ≥ 0
𝑖=0 𝑖=0

𝑛 𝑛

= 0 𝑖𝑓 ෍ 𝑥𝑖 < 0 = 0 𝑖𝑓 ෍ 𝒘𝒊 ∗ 𝑥𝑖 < 0
𝑖=0 𝑖=0

➢ A perceptron separates the input space into two halves

➢ In other words, a single perceptron can only be used to implement linearly separable functions

➢ The weights (including threshold) can be learned and the inputs can be real valued
Credit: Mitesh Khapra, IITM
DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI
Perceptron Learning

➢ Consider two vectors w and x

w = [w0,w1,w2,...,wn]
x = [1,x1,x2,...,xn]

➢ We can rewrite the perceptron rule as

Credit: Mitesh Khapra, IITM

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI
Perceptron Learning

➢ Consider some points (vectors) which lie in the positive

half space of this line (i.e., wTx>0).

➢ What will be the angle between any such vector and w?

▪ Obviously, less than 90∘

➢ Consider some points (vectors) which lie in the negative

half space of this line (i.e., wTx<0).

➢ What will be the angle between any such vector and w ?

▪ Obviously, greater than 90∘

➢ The algorithm has converge Credit: Mitesh Khapra, IITM

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI
Proof of Perceptron Learning

➢ So far we made corrections for 𝑤 𝑇 𝑃𝑖 < 0

➢ 𝑐𝑜𝑠𝛽 thus grows proportional to 𝑘

➢ Thus, there can only be a finite number of

corrections (k) to w and the algorithm will converge!
Credit: Mitesh Khapra, IITM
DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI
Multilayered Network of Perceptron's

Credit: Mitesh Khapra, IITM

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI
Sigmoid Neuron

Credit: Mitesh Khapra, IITM

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI
Sigmoid Neuron

Not smooth smooth

not continuous continuous
not differentiable differentiable
Credit: Mitesh Khapra, IITM
DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI
Sigmoid Neuron

➢ Typical supervised machine learning setup :

▪ Data: {𝑥𝑖 , 𝑦𝑖 }𝑛𝑖=1

▪ Model: Approximation of the relation between x and y

𝟏
ෝ=
𝒚 𝑻𝒙
𝟏 + 𝒆−𝒘

▪ Parameters: In all the above cases, w is a parameter which needs to be learned from the data

▪ Learning algorithm: An algorithm for learning the parameters w of the model (for example,
perceptron learning algorithm, gradient descent, etc.)

▪ Objective/Loss/Error function: To guide the learning algorithm

Credit: Mitesh Khapra, IITM

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI
Typical Supervised Learning : Guess work

➢ Training the Network: Finding w* and b* manually

▪ With some guess work, we are able to find out the optimal values for w and b.

Credit: Mitesh Khapra, IITM

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI
Error surface for Guess work

➢ Geometric interpretation of our “guess work” algorithm in terms of this error surface

Credit: Mitesh Khapra, IITM

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI
Gradient Descent: More Efficient and principled way

➢ Thus, ℒ 𝜃 + 𝜂𝑢 − ℒ 𝜃 = 𝑢𝑇 ∇𝜃 ℒ 𝜃 = 𝑘 ∗ cos 𝛽 is most negative when cos 𝛽 = −1 𝑖. 𝑒., 𝛽 = 1800

➢ The direction u that we intend to move in should be at 180° w.r.t. the gradient

➢ In other words, move in a direction opposite to the gradient

Credit: Mitesh Khapra, IITM

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI
Gradient Descent

➢ Algorithm

➢ For two points,

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Feed-forward Neural Network

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Feed-forward Neural Network

➢ The input layer can be called the 0th layer and the
output layer can be called the (L)th layer

➢ 𝑊𝑖 ∈ ℝ𝑛×𝑛 and 𝑏𝑖 ∈ ℝ𝑛 are the weights and bias

between layers 𝑖 − 1 and 𝑖(0 < 𝑖 < 𝐿)

➢ 𝑊𝑖 ∈ ℝ𝑘×𝑛 and 𝑏𝑖 ∈ ℝ𝑘 are the weight and bias

between the last hidden layer and the output layer
(L=3 in this case)

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Feed-forward Neural Network

➢ The pre-activation at layer 𝑖 is given by,

𝑎𝑖 = 𝑏𝑖 + 𝑊𝑖 ℎ𝑖−1

➢ The activation at layer 𝑖 is given by,

ℎ𝑖 = 𝑔(𝑎𝑖 )
▪ where, g is called the activation function (for
example, logistic, tanh, linear, etc.)

➢ The activation at the output layer is given by,

𝑓 𝑥 = ℎ𝐿 = 𝑂(𝑎𝐿 )
▪ Where, O is the output activation function
(for example, softmax, linear, etc.)

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

➢ Now 𝜵𝜽 looks much more nasty,

➢ 𝛻𝜃 is this composed of,

𝛻𝑊1 , 𝛻𝑊2 , … , 𝛻𝑊𝐿−1 ∈ ℝ𝑛×𝑛 , 𝛻𝑊𝐿 ∈ ℝ𝑘×𝑛
𝛻𝑏1 , 𝛻𝑏2 , … , 𝛻𝑏𝐿−1 ∈ ℝ𝑛 , 𝛻𝑏𝐿 ∈ ℝ𝑘
DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI
➢ We need to answer two equations?

▪ How to choose the loss function ℒ 𝜃 ?

▪ How to compute 𝛻𝜃 which is composed of

𝛻𝑊1 , 𝛻𝑊2 , … , 𝛻𝑊𝐿−1 ∈ ℝ𝑛×𝑛 , 𝛻𝑊𝐿 ∈ ℝ𝑘×𝑛

𝛻𝑏1 , 𝛻𝑏2 , … , 𝛻𝑏𝐿−1 ∈ ℝ𝑛 , 𝛻𝑏𝐿 ∈ ℝ𝑘

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Output Functions and Loss Functions

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?

➢ The choice of loss function depends on the problem at hand.

➢ We will illustrate this with the help of two examples

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?

➢ Consider our movie example again but this time we are

interested in predicting ratings

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?

➢ Consider our movie example again but this time we are

interested in predicting ratings

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?

➢ Consider our movie example again but this time we are

interested in predicting ratings

➢ Here, 𝑦𝑗 ∈ ℝ3

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?

➢ Consider our movie example again but this time we are

interested in predicting ratings

➢ Here, 𝑦𝑗 ∈ ℝ3

➢ The loss function should capture how much 𝑦ො𝑗 deviates

from 𝑦𝑗

➢ If 𝑦𝑗 ∈ ℝ𝑘 then the squared error loss can capture this

deviation

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?

➢ Consider our movie example again but this time we are

interested in predicting ratings

➢ Here, 𝑦𝑗 ∈ ℝ3

➢ The loss function should capture how much 𝑦ො𝑗 deviates

from 𝑦𝑗

➢ If 𝑦𝑗 ∈ ℝ𝑘 then the squared error loss can capture this

deviation
𝟏
𝓛 𝜽 = σ𝑵 𝒌
𝒚𝒊𝒋 − 𝒚𝒊𝒋 )𝟐
𝒊=𝟏 σ𝒋=𝟏(ෝ
𝑵

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?
➢ A related question: What should the output function
'O' be if 𝑦𝑗 ∈ ℝ ?

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?
➢ A related question: What should the output function
'O' be if 𝑦𝑗 ∈ ℝ ?

➢ More specifically, can it be the logistic function?

▪ No, because it restricts 𝑦𝑗 to a value between 0 &
1 but we want 𝑦𝑗 ∈ ℝ

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?
➢ A related question: What should the output function
'O' be if 𝑦𝑗 ∈ ℝ ?

➢ More specifically, can it be the logistic function?

▪ No, because it restricts 𝑦𝑗 to a value between 0 &
1 but we want 𝑦𝑗 ∈ ℝ

➢ So, in such cases it makes sense to have 'O' as

linear function

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?
➢ A related question: What should the output function
'O' be if 𝑦𝑗 ∈ ℝ ?

➢ More specifically, can it be the logistic function?

▪ No, because it restricts 𝑦𝑗 to a value between 0 &
1 but we want 𝑦𝑗 ∈ ℝ

➢ So, in such cases it makes sense to have 'O' as

linear function

➢ 𝑦ො𝑗 is no longer bounded between 0 and 1

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Entropy

➢ Expectations 𝐸𝑥𝑝 = ෍ 𝑃𝑖 𝑉(𝑖)

𝑖=1

➢ Entropy 𝐸𝑛𝑡 = ෍ 𝑃𝑖 log(𝑃𝑖 )

𝑖=1

➢ Cross Entropy 𝐶𝑟𝑜𝑠𝐸 = ෍ 𝑃𝑖 log(𝑞𝑖 )

𝑖=1

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?
➢ Now let us consider another problem for which a
different loss function would be appropriate

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?
➢ Now let us consider another problem for which a
different loss function would be appropriate

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?
➢ Now let us consider another problem for which a
different loss function would be appropriate

➢ Suppose we want to classify an image into 1 of k classes

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?
➢ Now let us consider another problem for which a
different loss function would be appropriate

➢ Suppose we want to classify an image into 1 of k classes

➢ Here, again we could use the squared error loss to

capture the deviation

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?
➢ Now let us consider another problem for which a
different loss function would be appropriate

➢ Suppose we want to classify an image into 1 of k classes

➢ Here, again we could use the squared error loss to

capture the deviation

➢ But, can you think of a better function?

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?
➢ Notice that y is a probability distribution

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?
➢ Notice that y is a probability distribution

➢ Therefore, we should also ensure that 𝑦𝑗 is a probability

distribution

➢ What choice of the output activation 'O' will ensure this ?

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?
➢ Notice that y is a probability distribution

➢ Therefore, we should also ensure that 𝑦𝑗 is a probability

distribution

➢ What choice of the output activation 'O' will ensure this ?

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?
➢ Notice that y is a probability distribution

➢ Therefore, we should also ensure that 𝑦𝑗 is a probability

distribution

➢ What choice of the output activation 'O' will ensure this ?

𝑎𝐿 = 𝑊𝐿 ℎ𝐿−1 + 𝑏𝐿

➢ 𝑂(𝑎𝐿 )𝑗 is the 𝑗𝑡ℎ element of 𝑦ො and 𝑎𝐿,𝑗 is the 𝑗𝑡ℎ element

of vector 𝑎𝐿

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?
➢ Notice that y is a probability distribution

➢ Therefore, we should also ensure that 𝑦𝑗 is a probability

distribution

➢ What choice of the output activation 'O' will ensure this ?

𝑎𝐿 = 𝑊𝐿 ℎ𝐿−1 + 𝑏𝐿

➢ 𝑂(𝑎𝐿 )𝑗 is the 𝑗𝑡ℎ element of 𝑦ො and 𝑎𝐿,𝑗 is the 𝑗𝑡ℎ element

of vector 𝑎𝐿

➢ This function is called the softmax function

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI
Loss function 𝓛 𝜽 ?
➢ Now that we have ensured that both y & 𝑦ො are probability
distributions can you think of a function which captures
the difference between them?

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?
➢ Now that we have ensured that both y & 𝑦ො are probability
distributions can you think of a function which captures
the difference between them?

➢ Cross-entropy 𝑘

ℒ 𝜃 = − ෍ 𝑦𝑐 𝑙𝑜𝑔 𝑦ො 𝑐
𝑐=1

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?
➢ Now that we have ensured that both y & 𝑦ො are probability
distributions can you think of a function which captures
the difference between them?

➢ Cross-entropy 𝑘

ℒ 𝜃 = − ෍ 𝑦𝑐 𝑙𝑜𝑔 𝑦ො 𝑐
𝑐=1

➢ So, for classification problem (where you have to choose

1 of K classes), we use the following objective function
min ℒ 𝜃 = −𝑦𝑐 𝑙𝑜𝑔 𝑦ො 𝑐
𝜃

max −ℒ 𝜃 = 𝑦𝑐 𝑙𝑜𝑔 𝑦ො 𝑐
𝜃

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?
➢ Now that we have ensured that both y & 𝑦ො are probability
distributions can you think of a function which captures
the difference between them?

➢ Cross-entropy 𝑘

ℒ 𝜃 = − ෍ 𝑦𝑐 𝑙𝑜𝑔 𝑦ො 𝑐
𝑐=1

➢ So, for classification problem (where you have to choose

1 of K classes), we use the following objective function
min ℒ 𝜃 = −𝑦𝑐 𝑙𝑜𝑔 𝑦ො 𝑐= − 𝑙𝑜𝑔 𝑦ො 𝑙
𝜃

max −ℒ 𝜃 = 𝑦𝑐 𝑙𝑜𝑔 𝑦ො 𝑐 = 𝑙𝑜𝑔 𝑦ො 𝑙

𝜃

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?
➢ Now that we have ensured that both y & 𝑦ො are probability
distributions can you think of a function which captures
the difference between them?

➢ Cross-entropy 𝑘

ℒ 𝜃 = − ෍ 𝑦𝑐 𝑙𝑜𝑔 𝑦ො 𝑐
𝑐=1

➢ So, for classification problem (where you have to choose

1 of K classes), we use the following objective function
min ℒ 𝜃 = −𝑦𝑐 𝑙𝑜𝑔 𝑦ො 𝑐= − 𝑙𝑜𝑔 𝑦ො 𝑙
𝜃

max −ℒ 𝜃 = 𝑦𝑐 𝑙𝑜𝑔 𝑦ො 𝑐 = 𝑙𝑜𝑔 𝑦ො 𝑙

𝜃
ෝ𝒍 is called log-likelihood of the data
𝒍𝒐𝒈 𝒚
DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI
Loss function 𝓛 𝜽 ?
➢ Is 𝑦ෝ𝑙 a function of 𝜃 = [𝑊1 , 𝑊2 , … , 𝑊𝐿 , 𝑏1 , 𝑏2 , … , 𝑏𝐿 ]?

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?
➢ Is 𝑦ෝ𝑙 a function of 𝜃 = [𝑊1 , 𝑊2 , … , 𝑊𝐿 , 𝑏1 , 𝑏2 , … , 𝑏𝐿 ]?

➢ Yes, it is indeed a function of 𝜃

𝑦ො𝑙 = [𝑂(𝑊3 𝑔 𝑊2 𝑔 𝑊1 𝑥𝑖 + 𝑏1 + 𝑏2 + 𝑏1 )] 𝑙

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?
➢ Is 𝑦ෝ𝑙 a function of 𝜃 = [𝑊1 , 𝑊2 , … , 𝑊𝐿 , 𝑏1 , 𝑏2 , … , 𝑏𝐿 ]?

➢ Yes, it is indeed a function of 𝜃

𝑦ො𝑙 = [𝑂(𝑊3 𝑔 𝑊2 𝑔 𝑊1 𝑥𝑖 + 𝑏1 + 𝑏2 + 𝑏1 )] 𝑙

➢ What does 𝑦ො𝑙 encode?

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?
➢ Is 𝑦ෝ𝑙 a function of 𝜃 = [𝑊1 , 𝑊2 , … , 𝑊𝐿 , 𝑏1 , 𝑏2 , … , 𝑏𝐿 ]?

➢ Yes, it is indeed a function of 𝜃

𝑦ො𝑙 = [𝑂(𝑊3 𝑔 𝑊2 𝑔 𝑊1 𝑥𝑖 + 𝑏1 + 𝑏2 + 𝑏1 )] 𝑙

➢ What does 𝑦ො𝑙 encode?

➢ It is the probability that 𝑥 belongs to the 𝑙 𝑡ℎ class

(bring it as close to 1).

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?

Real Values Probabilities

Output Activation
Loss Function

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?

Real Values Probabilities

Output Activation Linear
Loss Function Squared Error

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?

Real Values Probabilities

Output Activation Linear Softmax
Loss Function Squared Error Cross Entropy

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?

Real Values Probabilities

Output Activation Linear Softmax
Loss Function Squared Error Cross Entropy

➢ Of course, there could be other loss functions depending on the problem at hand but the two loss
functions that we just saw are encountered very often

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Loss function 𝓛 𝜽 ?

Real Values Probabilities

Output Activation Linear Softmax
Loss Function Squared Error Cross Entropy

➢ Of course, there could be other loss functions depending on the problem at hand but the two loss
functions that we just saw are encountered very often

➢ For the rest of this lecture, we will focus on the case where the output activation is a softmax function
and the loss function is cross entropy

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Any questions….

DA322M: Deep Learning by Dr. P. W. Patil, MFSDSAI

Distinguish Between IRS and DBMS and DM
100% (3)
Distinguish Between IRS and DBMS and DM
3 pages
Lec1 PerceptronPocket Recap
100% (1)
Lec1 PerceptronPocket Recap
61 pages
DL Unit 1
No ratings yet
DL Unit 1
10 pages
Unit V
No ratings yet
Unit V
25 pages
Module 2
No ratings yet
Module 2
55 pages
Instructor's Solution Manual For Neural Networks
0% (1)
Instructor's Solution Manual For Neural Networks
40 pages
Neural Network
No ratings yet
Neural Network
42 pages
Neural Networks
No ratings yet
Neural Networks
108 pages
Deep Learning
No ratings yet
Deep Learning
50 pages
2 DL Training
No ratings yet
2 DL Training
60 pages
Deep Neural Networks
No ratings yet
Deep Neural Networks
79 pages
Week5 Summary Detail
No ratings yet
Week5 Summary Detail
10 pages
Neural Networks
No ratings yet
Neural Networks
38 pages
Mla Cat2
No ratings yet
Mla Cat2
8 pages
Handwritten Notes - Unit 1,2
No ratings yet
Handwritten Notes - Unit 1,2
9 pages
Data Binning
No ratings yet
Data Binning
9 pages
DL145611 03 Shallow
No ratings yet
DL145611 03 Shallow
92 pages
Chapter 5 Final
No ratings yet
Chapter 5 Final
80 pages
Neural Networks & Gradient Descent
No ratings yet
Neural Networks & Gradient Descent
77 pages
First
No ratings yet
First
92 pages
1) Deep - Learning
No ratings yet
1) Deep - Learning
60 pages
Lecture - 14 - FFNN
No ratings yet
Lecture - 14 - FFNN
59 pages
Lecture 03 - Feedforward Networks - 4p
No ratings yet
Lecture 03 - Feedforward Networks - 4p
19 pages
Unit 2.1
No ratings yet
Unit 2.1
37 pages
Intro to Machine Learning Basics
No ratings yet
Intro to Machine Learning Basics
61 pages
2.3 Feed Forward Netwoks
No ratings yet
2.3 Feed Forward Netwoks
25 pages
DL Assi02
No ratings yet
DL Assi02
9 pages
Week 4
No ratings yet
Week 4
61 pages
Week 2 Artificial Neural Networks
No ratings yet
Week 2 Artificial Neural Networks
62 pages
Module 2 Deep Feed Forward Networks
No ratings yet
Module 2 Deep Feed Forward Networks
18 pages
Deep Learning Lectures - 2
No ratings yet
Deep Learning Lectures - 2
73 pages
Neural Networks & Backpropagation
No ratings yet
Neural Networks & Backpropagation
77 pages
Instructor Solution Manual To Neural Networks and Deep Learning A Textbook Solutions 3319944622 9783319944623 - Compress
No ratings yet
Instructor Solution Manual To Neural Networks and Deep Learning A Textbook Solutions 3319944622 9783319944623 - Compress
40 pages
Data Mining For Financial Statement Analysis
100% (1)
Data Mining For Financial Statement Analysis
4 pages
Motivation of Data Mining
No ratings yet
Motivation of Data Mining
4 pages
DL - M2 - Deep Feedforward NN
No ratings yet
DL - M2 - Deep Feedforward NN
97 pages
Practical-5 - 2CEIT606 - Artificial Intelligence
No ratings yet
Practical-5 - 2CEIT606 - Artificial Intelligence
14 pages
DL Quiz1
No ratings yet
DL Quiz1
5 pages
16 DL 1
No ratings yet
16 DL 1
9 pages
Lec 04 Deep Networks 2
No ratings yet
Lec 04 Deep Networks 2
78 pages
Introduction To Data Mining Assignment 2
No ratings yet
Introduction To Data Mining Assignment 2
1 page
Chapter 9
No ratings yet
Chapter 9
73 pages
Deep Learning
No ratings yet
Deep Learning
15 pages
Lecture 4
No ratings yet
Lecture 4
50 pages
Week2 DL
No ratings yet
Week2 DL
29 pages
DL M2 Tech
No ratings yet
DL M2 Tech
32 pages
Module 2
No ratings yet
Module 2
44 pages
Lecture 19
No ratings yet
Lecture 19
8 pages
Lecture Slides 2 - Neural Networks - 2021
No ratings yet
Lecture Slides 2 - Neural Networks - 2021
42 pages
"Guide to Becoming a Data Scientist"
100% (1)
"Guide to Becoming a Data Scientist"
16 pages
4-Neural Networks and Activation Function
No ratings yet
4-Neural Networks and Activation Function
28 pages
TT1 QBAns1
No ratings yet
TT1 QBAns1
15 pages
Ann MJJ-1
No ratings yet
Ann MJJ-1
64 pages
Lecture 2 - GD Linear Regression
No ratings yet
Lecture 2 - GD Linear Regression
28 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
100 pages
Deep Learning
No ratings yet
Deep Learning
19 pages
EE769 7 Introduction To Neural Networks
No ratings yet
EE769 7 Introduction To Neural Networks
52 pages
Kagan Lecture2
No ratings yet
Kagan Lecture2
118 pages
E-Health Care Management
No ratings yet
E-Health Care Management
92 pages
Loss Functions in Deep Learning - MLearning - Ai
No ratings yet
Loss Functions in Deep Learning - MLearning - Ai
14 pages
Aula 4 (L) - Oggi La Tua Lezione È in Presenza
No ratings yet
Aula 4 (L) - Oggi La Tua Lezione È in Presenza
11 pages
DNN Cluster S2 22 MidSem Makeup
No ratings yet
DNN Cluster S2 22 MidSem Makeup
7 pages
Manufacturing Information Systems
No ratings yet
Manufacturing Information Systems
25 pages
Ece18898g Neural Networks
No ratings yet
Ece18898g Neural Networks
47 pages
MCS 207 Previous Years Question With Answer
No ratings yet
MCS 207 Previous Years Question With Answer
19 pages
Data Mining
No ratings yet
Data Mining
232 pages
Deep Learning Summer School 2015: Introduction To Machine Learning
No ratings yet
Deep Learning Summer School 2015: Introduction To Machine Learning
46 pages
NLP-NeuralNetworks Reading Notes
No ratings yet
NLP-NeuralNetworks Reading Notes
13 pages
Data Mining A Tutorial Based Primer 2nd Edition Richard J. Roiger Instant Download
100% (7)
Data Mining A Tutorial Based Primer 2nd Edition Richard J. Roiger Instant Download
81 pages
Fake News Detection AI
No ratings yet
Fake News Detection AI
18 pages
Mooc On Weka
No ratings yet
Mooc On Weka
59 pages
Syllabus MSCIT Full
No ratings yet
Syllabus MSCIT Full
48 pages
DWM Assignment
No ratings yet
DWM Assignment
15 pages
Individual Assignment
100% (1)
Individual Assignment
3 pages
Introduction to Data Science Basics
No ratings yet
Introduction to Data Science Basics
6 pages
Business Analytics for Executives
No ratings yet
Business Analytics for Executives
4 pages
Computational Health Informatics in The Big Data Age - A Survey
No ratings yet
Computational Health Informatics in The Big Data Age - A Survey
36 pages
Prog Announcment FCS MSPHD
No ratings yet
Prog Announcment FCS MSPHD
7 pages
IT-416 Data Mining
No ratings yet
IT-416 Data Mining
3 pages
CS583 - Data Mining and Text Mining
No ratings yet
CS583 - Data Mining and Text Mining
22 pages
Unit 5 Data Science
No ratings yet
Unit 5 Data Science
18 pages
Data Mining & Warehousing Q&A
No ratings yet
Data Mining & Warehousing Q&A
3 pages
Types and Applications of Data Mining
No ratings yet
Types and Applications of Data Mining
11 pages
Tech Researchers & Professionals
No ratings yet
Tech Researchers & Professionals
8 pages
Textual Knowledge Discovery
No ratings yet
Textual Knowledge Discovery
7 pages
Data Mining
No ratings yet
Data Mining
7 pages
Tugas Clustering - 132021012 - Kevin Gazkia Naufal
No ratings yet
Tugas Clustering - 132021012 - Kevin Gazkia Naufal
6 pages
Classification Performances of Data Mining Clustering Algorithms For Remotely Sensed Multispectral Image Data
No ratings yet
Classification Performances of Data Mining Clustering Algorithms For Remotely Sensed Multispectral Image Data
4 pages