0% found this document useful (0 votes)

13 views114 pages

Unit 3

The document outlines key concepts in neural networks, including the differences between biological and artificial neurons, activation functions, and learning methods such as parameter and structured learning. It covers the perceptron model, backpropagation algorithms, and advanced topics like recurrent neural networks and swarm intelligence. Additionally, it discusses concepts like overfitting, underfitting, and the importance of loss functions in training neural networks.

Uploaded by

G0REM0ND

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views114 pages

Unit 3

Uploaded by

G0REM0ND

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 114

UNIT3

NEURAL NETWORKS

MISSION VISION CORE VALUES

CHRIST is a nurturing ground for an individual’s Excellence and Service Faith in God | Moral Uprightness
holistic development to make effective contribution to Love of Fellow Beings
the society in a dynamic environment Social Responsibility | Pursuit of Excellence
CHRIST
Deemed to be University

UNIT2- UN SUPERVISED LEARNING

Neural Network Representation(T2-4.2)

– Problems ( T 2 - 4 . 3 )
– Perceptron's (T2-4.4)
– Multilayer Networks((T2-4.5)
- Back Propagation Algorithms(T2-4.6)
– Advanced Topics(T2-4.8)

Excellence and Service

CHRIST
Deemed to be University

ANN INTRODUCTION

Excellence and Service

CHRIST
Deemed to be University

Biological Neuron Vs Artificial Neuron

Excellence and Service

CHRIST
Deemed to be University

ANN INTRODUCTION

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Artificial Neural Network(ANN) MODEL

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

ANN

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Parameter learning vs Structured learning

● Parameter learning focuses on optimizing the values of parameters within
a predefined model structure to best fit the data, such as adjusting weights
in neural networks or coefficients in regression models.

● parameter learning fine-tunes the model's performance within a given

structure

● Structured learning, on the other hand, involves learning the structure of

the model itself, which may include determining the relationships between
variables, the model architecture, or the network topology.

● Structured learning discovers the optimal model structure to represent the

underlying data patterns accurately

Excellence and Service

CHRIST
Deemed to be University

• Activation functions, helps to learn complex patterns and relationships in

the non-linear data.

• Without activation functions, neural networks would be limited to linear

transformations, severely restricting their capability.

• They help determine the output of neurons, allowing the network to make
more accurate predictions.

• Common activation functions like ReLU, sigmoid, and tanh each have
unique properties that influence the network's performance, convergence
speed, and ability to handle gradient issues.

Excellence and Service

CHRIST
Deemed to be University

Purpose of an activation function is to transform the

summed weighted input from a node into an output value
that is passed on to the next hidden layer or used as the
final output.

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Function may be
summation or
addition

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

(0 to 1)

(-1 to +1)

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

0
CHRIST
Deemed to be University

-1

Excellence and Service

CHRIST
Deemed to be University

Mc-culloch-Pitts Neuron

Excellence and Service

CHRIST
Deemed to be University

Ex1: Sigmoid Activation Function

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Binary Sigmoid value is between 0 to 1

By thresholding we get 0 or 1

Bipolar value ranges from -1 to +1

Excellence and Service
CHRIST
Deemed to be University
Example2: Mc-culloch-Pitts Neuron

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Perceptron Learning
The perceptron model starts by multiplying every input value and its weights. Then,
it adds these values to generate the weighted sum.

This weighted sum is then applied to the activation function “f” to get the
anticipated output. The corresponding activation function is also called the step
function

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Perceptron Rule Flow Chart

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Where
S: Training vector
t=testing vector

If test output & expected

output not equal then
update the new weights

Excellence and Service

CHRIST
Deemed to be University

Ex4: Apply perceptron neuron rule to classify the given dataset into two
classes ie 1 and -1 using the initial weights as zero

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Initialize
Learning Rate α= 1
b=0
t=target
w1=w2=w3=w4=0
New w1=w1+ ˄w1

Excellence and Service

CHRIST
Deemed to be University

Observe Weights

Excellence and Service

CHRIST
Deemed to be University

• No change in weights & bias. We can stop further iterations

• Now Model is fully trained and ready for testing
• FINAL WEIGHTS ARE Observe Weights
• W1=-2, w2=2, w3=0, w4=2, b=0

Excellence and Service

CHRIST
Deemed to be University

Final Neural Network

Final neural network with these
final weights(2-2,2,0,2) can
classify the given data wrt their
target

Excellence and Service

CHRIST
Deemed to be University

Back Propagation
Algorithm

Excellence and Service

CHRIST
Deemed to be University

What is Backpropagation?

● Backpropagation is the process of neural network training. It is the

method of fine-tuning the weights of a neural network based on the
error rate obtained in the previous epoch (i.e., iteration). Proper
tuning of the weights allows you to reduce error rates and make the
model reliable by increasing its generalization.

● Backpropagation is also called as “backward propagation of errors.”

● It is useful mathematical tool for improving the accuracy of predictions

in machine learning

● In case of feedforward neural network where the nodes never form a

cycle. (No feedback).

Excellence and Service

CHRIST
Deemed to be University

Types of Backpropagation Networks

1. Static back-propagation:
It maps static input to static output where the weights are updated after
all data points(images) have been processed. It is useful to solve static
classification issues like optical character recognition.(ex Digits,
Alphabets etc)

2. Recurrent Backpropagation:
● In which feed forward until a fixed value is achieved. After that, the
error is computed and propagated backward. It adjusts weights based
on sequences, considering temporal dependencies.

● The main difference between both of these methods is: that the
mapping is rapid in static back-propagation while it is non-static in
recurrent backpropagation

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Back Propagation Algorithm Steps

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Repeat the steps 1-4 until the stopping condition is met.

Excellence and Service

CHRIST
Deemed to be University
Steps in BPNN algorithm: Construct FFNN

Excellence and Service

CHRIST
Deemed to be University

Step2: Find Output O4, O5, O6, O7 for Node 4-7

Respectively

Excellence and Service

CHRIST
Deemed to be University

Step3: Calculate Error rate

Excellence and Service

CHRIST
Deemed to be University

Exercise: Backpropagation NN

Excellence and Service

CHRIST
Deemed to be University
Epoch1

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University
Epoch2

Previous error is -0.19 now

reduced to -0.182

Repeat the above steps till error is minimum or zero and whose weights are
used for the classification
Excellence and Service
CHRIST

Exercise: DIY
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Some Basic Concepts

in
Machine Learning

Excellence and Service

CHRIST
Deemed to be University

GRADIENT DESCENT
or
Delta Rule
Perceptron Learning for Non Linearity

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

GRADIENT DESCENT or Delta Rule

Always look
for Global
Minima

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Error Term

Where Td: Target Output

Od: Calculated Output

Weights are
updated

Weights are
Weights are
decreased as
increased as
slope is positive
slope is negative

Excellence and Service

CHRIST
Deemed to be University

How to Decide Learning Rate

In case of higher learning rate In case of smaller learning rate

it never converges to solution it converges to optimum
solution but takes time. Hence
parameter tuning is required

Excellence and Service

CHRIST
Deemed to be University

Global Minima is
Winner

Excellence and Service

CHRIST
Deemed to be University

Model
Overfitting Vs Underfitting

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Try to cover all points Many points missed All points covered
No best fit line No best fit line Best fit line
Training is ok Training is not ok Training is ok
Testing Not ok Testing Not ok Testing ok

Excellence and Service

CHRIST
Deemed to be University

Final Test

Class Test

Excellence and Service

CHRIST
Deemed to be University

Low Bias & Low variance is accepted

Excellence and Service
CHRIST
Deemed to be University
Bias: measure of Gap between Actual & Predicted
Variance: Measure of scatter within each other data

Excellence and Service

CHRIST
Deemed to be University

Concept learning
● Concept learning in machine learning involves identifying and understanding a
target concept or pattern from a set of examples.

● It aims to distinguish between positive examples (those that fit the concept) and
negative examples (those that do not).

● This process involves generating hypotheses or rules that correctly classify new
instances based on the learned concept.

● Techniques include decision trees, rule-based systems, and neural networks.

● Concept learning is fundamental for tasks such as classification and regression,

where the goal is to generalize from observed data to make accurate predictions
on new, unseen data

Excellence and Service

CHRIST
Deemed to be University
Example: Boolean valued Function

Hypothesis can
be a Boolean
values function
or linear
function or non
linear function

Excellence and Service

CHRIST
Deemed to be University

Error Functions

Which Line fits best?

Excellence and Service

CHRIST
Deemed to be University

Objective
Function
Minimize loss
function

Excellence and Service

CHRIST
Deemed to be University

Types of Loss Functions

Excellence and Service

CHRIST
Deemed to be University

The error is less due to

least outliers
Excellence and Service
CHRIST
Deemed to be University

The error is high due to

more outliers

Excellence and Service

CHRIST
Deemed to be University

Low error

High error

Excellence and Service

CHRIST
Deemed to be University

Probability
distributions

Excellence and Service

CHRIST
Deemed to be University

Hinge loss is obtained due to penalized Data points

Excellence and Service

CHRIST
Deemed to be University

How to decide No of Hidden layers

&
Hidden neurons

Excellence and Service

CHRIST
Deemed to be University

ANN Model

Excellence and Service

CHRIST
Deemed to be University

Neuron Network with No hidden layer is just a

Linear Model

Excellence and Service

CHRIST
Deemed to be University

When do we need hidden layers

Excellence and Service

CHRIST
Deemed to be University

How many Hidden layers required?

Excellence and Service

CHRIST
Deemed to be University

No of Neurons in Hidden layers

Excellence and Service

CHRIST
Deemed to be University

Advanced Topics
● RNN
● Swarm Intelligence

Excellence and Service

CHRIST
Deemed to be University

RNN

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

Excellence and Service

CHRIST
Deemed to be University

RNN Sequence for text Mining

Excellence and Service

CHRIST
Deemed to be University

SWARM
INTELLIGENCE
Bio Inspired

105
Excellence and Service
CHRIST
Deemed to be University

Swarm intelligence
● Swarm intelligence is the collective behavior of decentralized, self-organized
systems, typically made up of simple agents interacting locally with one another and
their environment.

● This phenomenon is observed in nature, such as in ant colonies, bird flocking, fish
schooling, and bee hives.

● The agents follow simple rules, and although there is no centralized control, the
group exhibits complex, intelligent behavior.

● Swarm intelligence is applied in artificial intelligence and robotics, optimizing tasks

like routing, scheduling, and problem-solving by mimicking these natural processes.
Key algorithms include Ant Colony Optimization (ACO) and Particle Swarm
Optimization (PSO).

Excellence and Service

CHRIST
Deemed to be University

SWARM INTELLIGENCE
AI not only inspired by human intelligence, AI is also inspired by swarm
intelligence called as Particle Swarm Optimization(PSO)

● Bio Inspired(Ex. Ants, Birds, Bees)

5 Principles
● Awareness: No collisions
● Autonomy: Self Coordinated(No slaves)
● Solidarity: Collective behavior but Independent
● Scalability: Add members dynamically
● Resilience: Self healing when members are removed

APPLICATIONS: Military Robots

107
Excellence and Service
CHRIST
Deemed to be University

Example1: Bio Inspired Computing

Ant Bee Colony(ABC) Optimization

More
accumulation of
pheromones

Based on high volume of pheromones(Chemicals) ants find shortest path

108
Excellence and Service
CHRIST
Deemed to be University

Ant Colony Optimization(ACO)

109
Excellence and Service
CHRIST
Deemed to be University

Example2: Swarm Intelligence(Auklets)

110
Excellence and Service
CHRIST
Deemed to be University

Demo on Swarm Intelligence

Swarm Robots DEMO

Excellence and Service

CHRIST
Deemed to be University

Future Military Technology

Excellence and Service

CHRIST
Deemed to be University

UNIT3 SUMMARY

● Activation Functions
● Neural Network M o d e l
● Problems
● Perceptron’s l e a r n i n g
● MultilayerNetworks
● Back Propagation Algorithms-Examples
● Delta Rule/Grad Descent
● Error Rate
● Overfitting/UnderFitting
● How to select Hidden layers
● Hypothesis Space
● RNN-Introduction
● Swarm Intelligence

Excellence and Service

CHRIST
Deemed to be University

Digit Classification Video-30 Min

https://www.youtube.com/watch?v=zfiSAzpy9NM

Excellence and Service

L10 - Walsh & Hadamard Transforms
100% (1)
L10 - Walsh & Hadamard Transforms
25 pages
Oracle Generative AI (1Z0-1127-25) Mock Test - Set - 7
No ratings yet
Oracle Generative AI (1Z0-1127-25) Mock Test - Set - 7
5 pages
Backpropagation & Neural Networks
No ratings yet
Backpropagation & Neural Networks
30 pages
AI Class 10 Sample Paper-1 - 2024
90% (10)
AI Class 10 Sample Paper-1 - 2024
7 pages
Lesson 3 Artificial Neural Network
No ratings yet
Lesson 3 Artificial Neural Network
77 pages
Chapter 3 - Reduction of Multiple Subsystems PDF
No ratings yet
Chapter 3 - Reduction of Multiple Subsystems PDF
28 pages
Classification 1
No ratings yet
Classification 1
78 pages
CNN Stanford2015
No ratings yet
CNN Stanford2015
129 pages
Concepts and Techniques: Data Mining
No ratings yet
Concepts and Techniques: Data Mining
78 pages
Supervised Learning: Multilayer Networks I
No ratings yet
Supervised Learning: Multilayer Networks I
40 pages
Unit 1
No ratings yet
Unit 1
143 pages
Lecture 10
No ratings yet
Lecture 10
155 pages
Shortcomings in Single Layer Neural Networks: Most Real World Problems Are Not
No ratings yet
Shortcomings in Single Layer Neural Networks: Most Real World Problems Are Not
43 pages
Data Mining, Advance Methods
No ratings yet
Data Mining, Advance Methods
83 pages
4.2 Ann
No ratings yet
4.2 Ann
26 pages
Kagan Lecture2
No ratings yet
Kagan Lecture2
118 pages
Lecture 9
No ratings yet
Lecture 9
78 pages
Backpropagation Algorithm
No ratings yet
Backpropagation Algorithm
9 pages
MIS416 Chapter7 by DrAsimAlwabel
No ratings yet
MIS416 Chapter7 by DrAsimAlwabel
70 pages
Unit5 - Updated
No ratings yet
Unit5 - Updated
112 pages
Backpropagation
No ratings yet
Backpropagation
4 pages
Renormalisation in Quantum Field Theory
No ratings yet
Renormalisation in Quantum Field Theory
127 pages
Unit4 C
No ratings yet
Unit4 C
107 pages
B.Tech VIII SEM GPS Question Bank
No ratings yet
B.Tech VIII SEM GPS Question Bank
3 pages
UNIT 3 - Backpropagation Algorithm
No ratings yet
UNIT 3 - Backpropagation Algorithm
38 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
ANN Notes Updated
0% (1)
ANN Notes Updated
46 pages
Classification and Diagnosis Using Back Propagation Artificial Neural Networks ANN
No ratings yet
Classification and Diagnosis Using Back Propagation Artificial Neural Networks ANN
5 pages
Chapter 5 Final
No ratings yet
Chapter 5 Final
80 pages
DL - ANN - RNN - CNN (Autosaved) (Autosaved)
No ratings yet
DL - ANN - RNN - CNN (Autosaved) (Autosaved)
53 pages
Bayesian Belief and Regression
No ratings yet
Bayesian Belief and Regression
19 pages
DSA Chapter 7 - Graphs
No ratings yet
DSA Chapter 7 - Graphs
71 pages
PNAL6 MLPTraining
No ratings yet
PNAL6 MLPTraining
40 pages
Drives Training Foils: PID - Closed Loop Control
No ratings yet
Drives Training Foils: PID - Closed Loop Control
18 pages
Chapter - 2-ML
No ratings yet
Chapter - 2-ML
63 pages
Bologna 07
No ratings yet
Bologna 07
315 pages
Lec 6
No ratings yet
Lec 6
18 pages
SwethaCordelia Final
No ratings yet
SwethaCordelia Final
31 pages
Deep Learning - Lecture 2 - Neural Networks
No ratings yet
Deep Learning - Lecture 2 - Neural Networks
39 pages
Deep Neural Networks - 2
No ratings yet
Deep Neural Networks - 2
55 pages
Scunit 2 Application of Soft Computing kcs056
No ratings yet
Scunit 2 Application of Soft Computing kcs056
26 pages
29324-Article Text-33378-1-2-20240324
No ratings yet
29324-Article Text-33378-1-2-20240324
8 pages
Notes On Introduction To Deep Learning
No ratings yet
Notes On Introduction To Deep Learning
19 pages
NN Ch3
No ratings yet
NN Ch3
40 pages
Deep Learning vs Machine Learning
No ratings yet
Deep Learning vs Machine Learning
11 pages
Chapter 11 Neural Nets (Python)
No ratings yet
Chapter 11 Neural Nets (Python)
43 pages
Roughdraft ICCT2025
No ratings yet
Roughdraft ICCT2025
24 pages
Midterm Study Guide Csci566
No ratings yet
Midterm Study Guide Csci566
20 pages
Deep Learning
No ratings yet
Deep Learning
19 pages
Lec01 introductionToToC
No ratings yet
Lec01 introductionToToC
34 pages
Present Value of A Single Amount
No ratings yet
Present Value of A Single Amount
21 pages
wfm01 01 Que 20230530
No ratings yet
wfm01 01 Que 20230530
32 pages
Linear Separability Linearly Separable Data Non-Linearly Separable Data
No ratings yet
Linear Separability Linearly Separable Data Non-Linearly Separable Data
1 page
36-Multi-Layer Perceptron and Its Properties-30-10-2024
No ratings yet
36-Multi-Layer Perceptron and Its Properties-30-10-2024
39 pages
Module 02
No ratings yet
Module 02
20 pages
DMDW 12 Classification Advance
No ratings yet
DMDW 12 Classification Advance
22 pages
Machine Learning Unit 2 Que and Ans Same
No ratings yet
Machine Learning Unit 2 Que and Ans Same
18 pages
Backpropagation
No ratings yet
Backpropagation
4 pages
Lecture-17 Machine Learning With Python
No ratings yet
Lecture-17 Machine Learning With Python
37 pages
Machine Learning Unit 5 Notes
No ratings yet
Machine Learning Unit 5 Notes
19 pages
CL Back Propogation
No ratings yet
CL Back Propogation
11 pages
14 Backprop
No ratings yet
14 Backprop
34 pages
Back Propagation
No ratings yet
Back Propagation
29 pages
Deep Learning
No ratings yet
Deep Learning
24 pages
cs3235 3 PDF
No ratings yet
cs3235 3 PDF
142 pages
Top 100 Deep Learning Interview Questions
No ratings yet
Top 100 Deep Learning Interview Questions
157 pages
Lindu Software Presentation SEACG 2018 BALI
No ratings yet
Lindu Software Presentation SEACG 2018 BALI
24 pages
Coal Comminution Circuit MPC Implementation
No ratings yet
Coal Comminution Circuit MPC Implementation
14 pages
Machine Learning (Unit-5) Machine Learning (Unit-5) : Scan To Open On Studocu Scan To Open On Studocu
No ratings yet
Machine Learning (Unit-5) Machine Learning (Unit-5) : Scan To Open On Studocu Scan To Open On Studocu
11 pages
4 Perceptron 06 08 2025
No ratings yet
4 Perceptron 06 08 2025
32 pages
Dynamics of Hashtag Communities
No ratings yet
Dynamics of Hashtag Communities
13 pages
21 CA1 Mahak
No ratings yet
21 CA1 Mahak
10 pages
A Comparative Study of Existing Machine Learning Approaches For Parkinson's Disease Detection
No ratings yet
A Comparative Study of Existing Machine Learning Approaches For Parkinson's Disease Detection
12 pages
Backpropagation
No ratings yet
Backpropagation
7 pages
DL&A
No ratings yet
DL&A
24 pages
Admt Stat Final - SP24
No ratings yet
Admt Stat Final - SP24
6 pages
????? ??????????
No ratings yet
????? ??????????
6 pages
ss2 3rd Term Exam
No ratings yet
ss2 3rd Term Exam
4 pages
Wavelet Packet: A Multirate Adaptive Filter For De-Noising of TDM Signal
No ratings yet
Wavelet Packet: A Multirate Adaptive Filter For De-Noising of TDM Signal
6 pages
A White Paper On The Future of Artificial Intelligence
No ratings yet
A White Paper On The Future of Artificial Intelligence
6 pages
Stat 200 - Mathematical Probability and Statistics I Lecture I - Random Events and Experiments / Approaches To Probability
No ratings yet
Stat 200 - Mathematical Probability and Statistics I Lecture I - Random Events and Experiments / Approaches To Probability
4 pages
Hybrid SFLA Optimization Algorithm
No ratings yet
Hybrid SFLA Optimization Algorithm
8 pages
แบบฝึกหัดการวิเคราะห์อัลกอริทึม (ซูโดโค้ด)
No ratings yet
แบบฝึกหัดการวิเคราะห์อัลกอริทึม (ซูโดโค้ด)
5 pages
L04 Slides - mlp1
No ratings yet
L04 Slides - mlp1
22 pages
Quantum Computing
No ratings yet
Quantum Computing
2 pages
W6A1
No ratings yet
W6A1
5 pages
ETD Syllabus
No ratings yet
ETD Syllabus
2 pages