0% found this document useful (0 votes)

7 views43 pages

CS2011 5

The document provides an introduction to neural networks, explaining their structure and functionality through mathematical models and the concept of artificial neurons. It covers the training process of single and multilayer neural networks, including error calculation and optimization techniques such as back-propagation. Additionally, it discusses activation functions, weight initialization, and the significance of deep learning in modern AI applications.

Uploaded by

rr1151818

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views43 pages

CS2011 5

Uploaded by

rr1151818

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 43

Introduction to Neural Network

Lecture 10-11: Data Science

Outlines
• Introduction to Neural Network
• Mathematical Model for Neural Network
• Differentiation and its Application to Train Neural Network

11/8/2024 1
Artificial Neural Network
• An Artificial Neural Network (ANN) is a
mathematical model that loosely simulates the
structure and functionality of Biological nervous
system to map the inputs to outputs.
Block Diagram of Biological Nervous System

Neural
Stimulus Receptors
Network
Effectors Response
Or
Brain
Typical Human Brain

Cell Body
Human Brain Neuron vs Artificial Neuron
Artificial Neuron

x1 bk
Wk1
x2 Wk2
Wk3 Vk
x3
:
∑
: Wkn
: Kth
xn Neuron

𝑽𝒌 = 𝑾𝒌𝟏 ∗ 𝒙𝟏 + 𝑾𝒌𝟐 ∗ 𝒙𝟐 + 𝑾𝒌𝟑 ∗ 𝒙𝟑 + ⋯ + 𝑾𝒌𝒏 ∗ 𝒙𝒏 + 𝒃𝒌

Artificial Neuron

x1 bk
Wk1
x2 Wk2
Wk3 Vk Activation
x3
:
∑ Function yk
: Wkn
: Kth
xn Neuron
𝒏

𝑽𝒌 = (𝑾𝒌𝒋 ∗ 𝒙𝒋) + 𝒃𝒌
𝒋=𝟏
𝒚𝒌 = 𝒇(𝑽𝒌)
Single Neuron Model

Output is Vk
Wk1
Linearly
Dependent
x1 ∑ yk
on Input
Parameters
𝑽𝒌 = 𝑾𝒌𝟏 ∗ 𝒙𝟏 + 𝒃𝒌

𝒚𝒌 = 𝒇(𝑽𝒌) = 𝑾𝒌𝟏 ∗ 𝒙𝟏 + 𝒃𝒌
Single Neuron Model
• Application
– For data Fitting applications where we have to fit a
ystraight
=mx+cline to a large data set.
Where m=Slope Of Straight Line
X=Height c=Intercept y=Weight

W 80 v
E c
I 60 v
G
H 40
T
20

0
1 2 3Height 4 5 6
Single Neuron Model
Error Calculation
• The error Ei=(Actual Value – Predicted value)=(𝑇𝑖 − 𝑦𝑖)
• For making +ve= 𝐸𝑖 = (𝑇𝑖 − 𝑦𝑖)2 [Error for ith input instance]

W 80 v
E c
I 60 v
G
H 40
T
20

0
1 2 3Height 4 5 6
Linear Neural Network
• Error Calculation
– It is done to adjust the slope(m) and intercept for better
fitting next time.

80
W
E 60
I
G 40
H
T 20
0
1 2 3 4 5 6
Height
Linear Neural Network

yk= Wk1*x1 + bk
y=m*x+c
bk

Output is Vk
Wk1
Linearly
Dependent on
x1 ∑ yk
Input
Parameters
Vk= Wk1*x1 + bk

yk= f(Vk)= Wk1*x1 + bk

Plotting Error

Error


Wk1 
Differentiation…

𝑦 = 𝑓(𝑥)
𝑦 = 𝑓(𝑥)
y2 (x2,y2)

𝑑𝑦 𝑑𝑓 (x1,y1) 𝜃
= = 𝑦′ = 𝑓 ′ y1
𝑑𝑥 𝑑𝑥

𝑥 ------ x1 x2
Δ𝑦 𝑦2 − 𝑦1 𝑝
How much does y change as x changes= = = = tan⁡
(𝜃)
Δ𝑥 𝑥2 − 𝑥1 𝑏

𝑑𝑦 Δ𝑦
= lim
𝑑𝑥 Δ𝑥→0 Δ𝑥
Differentiation…

𝑦 = 𝑓(𝑥)
𝑑𝑦 Δ𝑦
= lim
𝑑𝑥 Δ𝑥→0 Δ𝑥
As Δ𝑥 → 0 we obtain a y2 y1
tangent at x.
𝜃
𝑑𝑦 𝑥 x1 x2
= tan(𝜃)=slope of the tangent at x=x1
𝑑𝑥
𝑑𝑦
= Slope of the tangent to x-axis at x=x1
𝑑𝑥
Differentiation…

𝑦 = 𝑓(𝑥)
0 < 𝜃 < 90 tan(𝜃)= +ve
90 tan(90)=Undefined

𝜃
𝑥 x1
Differentiation…

𝑦 = 𝑓(𝑥)
𝜃 > 90 tan(𝜃)= - ve

𝜃
𝑥 x1
Differentiation…

𝑦 = 𝑓(𝑥)
𝜃 = 0 tan(𝜃)= 0

𝑥 x1
Differentiation…

𝑦 = 𝑓(𝑥)
𝜃 = 0 tan(𝜃)= 0

Maxima

𝑥 x1

Minima

Note: At minima and Maxima the

𝑑𝑦
Slope is 0  tan(𝜃)=0  = 0
𝑑𝑥
Differentiation…
Distinguishing between a Minima & Maxima
Let f(x)= X2 - 3X + 2
𝑑𝑓
=0
𝑑𝑥
2X -3 =0
X=1.5
f(1.5)=-0.25

Take a point near 1.5, let X=1

f(1)=1-3+2=0

X=1.5 can’t be maxima. It is a minima.

Error Function with Minima and No Maxima

𝑦 = 𝑓(𝑥)
𝑥

Minima

12
Error Function with a Maxima and No Minima

𝑦 = 𝑓(𝑥)
Maxima

𝑥

12
Error Function without a Maxima and Minima

𝑦 = 𝑓(𝑥)
𝑥

12
Error Function with multiple Maxima and Minima

𝑦 = 𝑓(𝑥)
𝑥

Global
Minima
Local
Minima

12
TRAINING A SINGLE-NEURON MODEL
xi1 bk
Wk1
xi2 Wk2
Wk3 Vk Activation
xi3
:
∑ Function y’k
: Wkd
: Kth
xid Neuron

Vk= ∑j=1 d (W
kj*xij) + bk y’k= f(Vk) L= ∑i=1n (yi - f(wTxi +b)2

• Step-1: Define the loss function

• Step-2: Define the optimization

13
TRAINING A SINGLE-NEURON MODEL
bk y’k= f(Vk)
xi1
W1 Vk= ∑j=1d (Wj*xij) + bk
xi2 W2 L=(y-y’)2
W3 Vk Activation
xi3
:
∑ Function y’k
: Wn
:
xin
• Step-3: Solve the optimization problem
– Randomly initialize the weights -2(y-y’))

– Feed forward the inputs and compute the loss function

– Update the weights -2(y-y’)*x x

27
TYPES OF NEURAL NETWORK

28
WHY MULTILAYER NEURAL NETWORK?
• Biological Inspiration
• Universal Approximators: Can approximate any nonlinear
function to any desired level of accuracy.
• Results in Powerful Models

29
TRAINING MULTILAYER NEURAL NETWORK

Randomly Forward it Back-

Sample Update the
through the
labeled data Initialize the propagate the network
network, get
Weights errors weights
predictions

• Back-Propagation: Chain Rule + Memoization

– In Stochastic Gradient Descent (SGD) U take one point (Input Vector)
– In Mini-Batch SGD, U take a set of points(input vectors)
– In Gradient Descent, U take all the input vectors

30
AI vs Machine Learning vs Deep Learning
Deep Learning
• A type of machine learning based on artificial
neural networks in which multiple layers of
processing are used to extract progressively higher
level features from data.

- ―Deep Learning with Python‖ Francois Chollet

32
DEEP LEARNING APPROACH
• Standard Approach (Mathematicians)
– Build new theories
– Perform Experiments
• New Deep Learning (Engineers way)
– Given huge amount of computational power
– People First Experiment and then try to build a theory

33
Why Deep Learning ? Why Now ?
• Computer Vision- Convolutional Neural Networks
and Backpropagation —well understood since 1989

• Time Series Forecasting- Long Short-Term

Memory — well understood since 1997

- ―Deep Learning with Python‖ Francois Chollet

34
Why Deep Learning ? Why Now ?

35
Algorithmic Advancements…
• Better Activation Functions for neural layers.

• Better Weight Initialization Schemes starting with

layer-wise pretraining.

• To avoid Overfitting the Concepts like Dropout is

Introduced.

• Better optimization schemes, such as RMSProp and

Adam.

36
Activation Functions…
• An Activation Function (Transfer Function) maps the
weighted summation of inputs to output.
• An Activation function is used to add Nonlinearity so
that the network can learn complex patterns.

37
Sigmoid Activation Functions
• Characteristics:
– Differentiable
– Nonlinear
– O/P lies in [0-1]
– Fast
– Vanishing Gradient
Problem

38
VANISHING GRADIENT PROBLEM
• Because of sigmoid activation function the derivative is
less than 1 and when the derivatives are multiplied it
gives a very small number which ultimately changes the
weight very less.
• Usually occurs when the derivative is less than 1.
• In case of sigmoid and tanh activation function it occurs
frequently.

𝑑𝐿 𝑑𝐿 𝑑𝑓1 𝑑𝑓2 𝑑𝑓𝑛

= × × × ⋯……………×
𝑑𝑤 𝑑𝑓1 𝑑𝑓2 𝑑𝑓3 𝑑𝑤

39
ReLU Activation Function
• f(x)= x, when x>0
= 0, when x<=0
• Avoids Vanishing Gradient Problem.
• Derivative is Simple
– f’(x)= 1 for x>=0
= 0 for x<0
• Problem:
– Dead ReLU Units

40
Leaky ReLU Activation Function
• f(x)= x, when x>0
= 0.1x, when x<=0
• The advantages of Leaky ReLU are same as that
of ReLU.
• In addition, it enables Backpropagation, even for
negative input values.
• Avoids Dead ReLU
• Simple Derivative
– f’(x)= 1 for x>=0
= 0.1 for x<0

41
WEIGHT INITIALIZATION

Error----
Weights----

42
WEIGHT INITIALIZATION
• Mostly used
– We should never initialize to same values.
• Asymmetry is necessary
– We should not initialize to large –ve values
• Vanishing Gradient problems
– Weights should be small (not too small)
– Weights should have good variance
– Weights should come from a Normal distribution with
mean zero and small variance
– Should have some +ve and Some –ve values

Notes On Introduction To Deep Learning
No ratings yet
Notes On Introduction To Deep Learning
19 pages
Neural Network
No ratings yet
Neural Network
58 pages
Kagan Lecture2
No ratings yet
Kagan Lecture2
118 pages
DeepLearing Theory
No ratings yet
DeepLearing Theory
51 pages
0905 Cs 161183 Vishal
No ratings yet
0905 Cs 161183 Vishal
38 pages
Intro to Machine Learning Basics
100% (2)
Intro to Machine Learning Basics
16 pages
Fundamentals Deep Learning Activation Functions When To Use Them
No ratings yet
Fundamentals Deep Learning Activation Functions When To Use Them
15 pages
Deep Learning
100% (1)
Deep Learning
189 pages
Functii de Activare1
No ratings yet
Functii de Activare1
89 pages
EE769 7 Introduction To Neural Networks
No ratings yet
EE769 7 Introduction To Neural Networks
52 pages
Module1 - Upto Loss Function
No ratings yet
Module1 - Upto Loss Function
137 pages
Neural Networks: Key Concepts & Functions
No ratings yet
Neural Networks: Key Concepts & Functions
22 pages
NN Unit - 1
No ratings yet
NN Unit - 1
27 pages
Unit 2
No ratings yet
Unit 2
18 pages
Ann MJJ-1
No ratings yet
Ann MJJ-1
64 pages
7 Types of Neural Network Activation Functions
No ratings yet
7 Types of Neural Network Activation Functions
16 pages
Deep Learning Course Overview
No ratings yet
Deep Learning Course Overview
105 pages
Machine Learning Mini-Project Report
No ratings yet
Machine Learning Mini-Project Report
26 pages
Aditya Jain NN Assignment
No ratings yet
Aditya Jain NN Assignment
13 pages
Neural Networks: A Deep Dive
No ratings yet
Neural Networks: A Deep Dive
34 pages
AI by Hand: Neural Network Concepts
No ratings yet
AI by Hand: Neural Network Concepts
28 pages
AML 03 Dense Neural Networks
No ratings yet
AML 03 Dense Neural Networks
20 pages
Week 2 Artificial Neural Networks
No ratings yet
Week 2 Artificial Neural Networks
62 pages
Deep Learning: International Islamic University of Chittagong
No ratings yet
Deep Learning: International Islamic University of Chittagong
31 pages
DL M2 Tech
No ratings yet
DL M2 Tech
32 pages
Neural Networks
No ratings yet
Neural Networks
61 pages
Activation Function
No ratings yet
Activation Function
4 pages
DEEP LEARNING Paper
No ratings yet
DEEP LEARNING Paper
12 pages
Annette Paper
No ratings yet
Annette Paper
7 pages
Neural Networks: A Technical Overview
No ratings yet
Neural Networks: A Technical Overview
44 pages
Sequence Generation With RNNs - Post Quiz - Attempt Review
100% (2)
Sequence Generation With RNNs - Post Quiz - Attempt Review
5 pages
Artificial Neural Networks (Anns) VS Deep Neural Networks
No ratings yet
Artificial Neural Networks (Anns) VS Deep Neural Networks
24 pages
Ad3451 ML Unit 4 Notes Eduengg
No ratings yet
Ad3451 ML Unit 4 Notes Eduengg
36 pages
Neural Networks & Gradient Descent
No ratings yet
Neural Networks & Gradient Descent
77 pages
AI & ML Unit 5 Notes
No ratings yet
AI & ML Unit 5 Notes
23 pages
Neural Network - Optimization DRAFT 3.11
No ratings yet
Neural Network - Optimization DRAFT 3.11
66 pages
Activation Function in NN
No ratings yet
Activation Function in NN
29 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
21 pages
Understanding LSTM Networks
No ratings yet
Understanding LSTM Networks
15 pages
Artificial Neural Networks (ANN)
No ratings yet
Artificial Neural Networks (ANN)
67 pages
Ad3451 ML Unit 4 Notes
No ratings yet
Ad3451 ML Unit 4 Notes
34 pages
26 - Netinput Activation Function Forward and Back Propogation
No ratings yet
26 - Netinput Activation Function Forward and Back Propogation
41 pages
ML Lec-22
No ratings yet
ML Lec-22
25 pages
Unit 3 Deep Learning
No ratings yet
Unit 3 Deep Learning
11 pages
Unit - 4 Artificial Neural Networks
No ratings yet
Unit - 4 Artificial Neural Networks
33 pages
Activation Functions in Neural Networks - 241102 - 224129
No ratings yet
Activation Functions in Neural Networks - 241102 - 224129
7 pages
MLS 1 - Presentation
No ratings yet
MLS 1 - Presentation
11 pages
IAT Set 1
No ratings yet
IAT Set 1
3 pages
NNFLC Question
No ratings yet
NNFLC Question
1 page
Lecture NN Part1
No ratings yet
Lecture NN Part1
62 pages
Unit 4
No ratings yet
Unit 4
19 pages
Deep Learning
No ratings yet
Deep Learning
22 pages
Unit Ii DNN
No ratings yet
Unit Ii DNN
24 pages
06 AIS302 ANN Backpropagation
No ratings yet
06 AIS302 ANN Backpropagation
83 pages
Hendra Bayu - 12419795 - Robot M3
No ratings yet
Hendra Bayu - 12419795 - Robot M3
5 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
AD3501 Deep Learning Course Plan
No ratings yet
AD3501 Deep Learning Course Plan
6 pages
Neural Networks
No ratings yet
Neural Networks
29 pages
Single Neuron Model
No ratings yet
Single Neuron Model
16 pages
Unit 2 Deep Learning and Neural Networks
No ratings yet
Unit 2 Deep Learning and Neural Networks
38 pages
Activation FN
No ratings yet
Activation FN
15 pages
200-Article Text-3847-1-10-20230705
No ratings yet
200-Article Text-3847-1-10-20230705
7 pages
Unit 2 - Activation Function - PR
No ratings yet
Unit 2 - Activation Function - PR
22 pages
cs231n 2019 Lecture10
No ratings yet
cs231n 2019 Lecture10
106 pages
Unit II
No ratings yet
Unit II
12 pages
Ad3451 ML Unit 4 Notes
No ratings yet
Ad3451 ML Unit 4 Notes
36 pages
CS 329 Lecture4 2025new
No ratings yet
CS 329 Lecture4 2025new
61 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
35 pages
Artificial Neural Network Notes
No ratings yet
Artificial Neural Network Notes
24 pages
Stock Prediction RNN
No ratings yet
Stock Prediction RNN
7 pages
Unit 2
No ratings yet
Unit 2
35 pages
Artificial Neural Artificial Neural Networks
No ratings yet
Artificial Neural Artificial Neural Networks
40 pages
PE - IV - 102047804 - Deep Learning and Applications
No ratings yet
PE - IV - 102047804 - Deep Learning and Applications
3 pages
Lab 3
No ratings yet
Lab 3
40 pages
Battery Data Analysis for Engineers
No ratings yet
Battery Data Analysis for Engineers
7 pages
Deep Learning
No ratings yet
Deep Learning
20 pages
Notes ML 02 Slides RNN ANN
No ratings yet
Notes ML 02 Slides RNN ANN
105 pages
Gradient Exploding Vanishing Problem v2
No ratings yet
Gradient Exploding Vanishing Problem v2
3 pages
DL Mod1.PDF Flashcards
No ratings yet
DL Mod1.PDF Flashcards
10 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
87 pages
Machine Learning and Generative AI
No ratings yet
Machine Learning and Generative AI
5 pages
DL Bits
No ratings yet
DL Bits
3 pages
Anna University Aiml
No ratings yet
Anna University Aiml
3 pages
Guidelines - Deep Learning
No ratings yet
Guidelines - Deep Learning
2 pages
DL Module II Till7thAug
No ratings yet
DL Module II Till7thAug
131 pages
Dl-Module 2
No ratings yet
Dl-Module 2
138 pages
Module 1 DL
No ratings yet
Module 1 DL
84 pages
PDC Lecture 12
No ratings yet
PDC Lecture 12
42 pages

CS2011 5

Uploaded by

CS2011 5

Uploaded by

Introduction to Neural Network

Lecture 10-11: Data Science

𝑽𝒌 = 𝑾𝒌𝟏 ∗ 𝒙𝟏 + 𝑾𝒌𝟐 ∗ 𝒙𝟐 + 𝑾𝒌𝟑 ∗ 𝒙𝟑 + ⋯ + 𝑾𝒌𝒏 ∗ 𝒙𝒏 + 𝒃𝒌

yk= f(Vk)= Wk1*x1 + bk

Note: At minima and Maxima the

Take a point near 1.5, let X=1

X=1.5 can’t be maxima. It is a minima.

• Step-1: Define the loss function

• Step-2: Define the optimization

– Feed forward the inputs and compute the loss function

Randomly Forward it Back-

• Back-Propagation: Chain Rule + Memoization

- ―Deep Learning with Python‖ Francois Chollet

• Time Series Forecasting- Long Short-Term

- ―Deep Learning with Python‖ Francois Chollet

• Better Weight Initialization Schemes starting with

• To avoid Overfitting the Concepts like Dropout is

• Better optimization schemes, such as RMSProp and

𝑑𝐿 𝑑𝐿 𝑑𝑓1 𝑑𝑓2 𝑑𝑓𝑛

You might also like