0% found this document useful (0 votes)

8 views20 pages

Neural Network and Deep Learning - Unit 1

The document provides an overview of neural networks and deep learning, focusing on learning rules such as Hebbian learning and the Perceptron learning rule. It discusses the implementation of an AND gate using Hebbian learning, the importance of activation functions for introducing non-linearity, and the differences between single-layer and multi-layer perceptrons. Additionally, it highlights various activation functions, their impact on model performance, and the architecture of neural networks.

Uploaded by

benittomiraclin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views20 pages

Neural Network and Deep Learning - Unit 1

Uploaded by

benittomiraclin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

NEURAL NETWORK AND DEEP LEARNING

UNIT 1 - NEURAL NETWORKS

12M:

Learning Rule

● A learning rule is a method or algorithm that helps a neural network

improve during training.

Hebbian Learning Rule

● Simple neural learning rule that strengthens the connection between
neurons when they are activated together. It follows the principle:
"Neurons that fire together, wire together."
● Proposed by Donald O Hebb and one of the earliest learning rules.
● Used for pattern classification.
● It is a single-layer neural network with one input layer (with n units)
and one output unit.
● The rule updates weights between neurons after each training
sample.

Hebbian Learning Rule Algorithm

1. Initialize weights and bias: Set all weights and bias to zero.
2. For each input vector and target output, repeat steps 3-5.
3. Set input activations:
4. Set output: y = t.
5. Update weights and bias using:

Implementing AND Gate

● 4 training samples → 4 iterations.

● Activation Function: Bipolar Sigmoid (range: [-1,1])
● Formula:

Step 1: Initialize

● Weights = [0, 0, 0]ᵀ, Bias = 0.

Step 2: Input Vectors

● X1 = [-1, -1, 1]ᵀ

● X2 = [-1, 1, 1]ᵀ
● X3 = [1, -1, 1]ᵀ
● X4 = [1, 1, 1]ᵀ

Step 3: Assign Outputs

● Set y = t for each input.

So, the target values (t) become:

Input (X₁, X₂) Target (t)

(-1, -1) -1

(-1, 1) -1

(1, -1) -1

(1, 1) 1

Step 4: Update Weights Using Hebbian Rule

W = W + tX

Iteration 1:

Using X1 = [-1, -1, 1]ᵀ and t = −1,

W = [0,0,0] + (−1) × [−1,−1,1] = [1, 1, -1]ᵀ

Iteration 2:

Using X2 = [-1, 1, 1]ᵀ and t = -1,

W = [1,1,−1] + (−1) × [−1,1,1] = [2, 0, -2]ᵀ

Iteration 3:

Using X3 = [1, -1, 1]ᵀ and t=−1,

W = [2,0,−2] + (−1) × [1,−1,1] = [1, 1, -3]ᵀ

Iteration 4:

Using X4 = [1, 1, 1]ᵀ and t=1,

W = [1,1,−3] + (1) × [1,1,1] = [2, 2, -2]ᵀ

Final Weights:

W = [2, 2, -2]ᵀ

Testing the Network

Each input is represented as an augmented vector (including bias):

The output Y is computed using the dot product:

Y=W⋅X

Where:

Y = (w1 × x1) + (w2 × x2) + (w3 × x3)

1. For (-1, -1):

Y = (2 × −1) + (2 × −1) + (−2 × 1) = - 2 - 2 - 2 = −6

2. For (-1, 1):

Y = (2 × −1) + (2 × 1) + (−2 × 1) = − 2 + 2 − 2= −2
3. For (1, -1):

Y = (2 × 1) + (2 × −1) + (−2 × 1) = 2 − 2 − 2 = −2

4. For (1, 1):

Y = (2 × 1) + (2 × 1) + ( −2 × 1) = 2 + 2 − 2 = 2

The final weight vector obtained through Hebbian learning is:

Y = [-6 , -2 , -2 , 2]

Decision Boundary

Perceptron Learning Rule

● Purpose: Used in supervised learning for binary classification tasks

(output: +1 or -1).
● Created by: Frank Rosenblatt for a binary classifier.
● Components:
1. Input Nodes: Accept numerical values for processing.
2. Weights & Bias:
■ Weights determine the strength of connections between
neurons.
■ Bias acts as an intercept in a linear equation, helping
improve model performance.
3. Activation Function: Decides if the neuron will activate or not
(commonly a step function).

Types of Activation Functions:

1. Sign Function

2. Step Function
3. Sigmoid Function (Output between 0 and 1)

Range value of Sign and step functions

How Perceptron Works:

1. Step 1: Calculate the weighted sum of the inputs:

2. Step 2: Apply the activation function to the weighted sum to get the
output:

The output can be binary or continuous, based on the function used.

Example of Perceptron Learning Rule:

Dataset:

X1 X2 Y

0 0 0

0 1 0

1 0 0

1 1 1
Initial Weights:

● w1 = 0.9 , w2 = 0.9
● Activation Threshold: 0.5
● Learning Rate: 0.5

Step-by-step Calculation:

1. First Instance (X1 = 0, X2 = 0):

○ Weighted sum: 0 × 0.9 + 0 × 0.9 = 0

○ Output = 0 (no error, no weight update).
2. Second Instance (X1 = 0, X2 = 1):

○ Weighted sum: 0 × 0.9 + 1 × 0.9 = 0.9

○ Output = 1 (but actual output is 0, error = -1).
○ Update weights:

w1 = 0.9 + 0.5 × (−1) = 0.4

w2 = 0.9 + 0.5 × (−1) = 0.4

3. Third Instance (X1 = 1, X2 = 0):

○ Weighted sum: 1 × 0.4 + 0 × 0.4 = 0.4

○ Output = 0 (no error, no weight update).
4. Fourth Instance (X1 = 1, X2 = 1):

○ Weighted sum: 1 × 0.4 + 1 × 0.4 = 0.8

○ Output = 1 (correct, no weight update).

Feedforward with Updated Weights:

● After updating weights, reapply the process to all instances.

● First Instance (X1 = 0, X2 = 0):
○ Weighted sum: 0 × 0.4 + 0 × 0.4 = 0
○ Output = 0 (correct, no weight update).
● Second Instance (X1 = 0, X2 = 1):
○ Weighted sum: 0 × 0.4 + 1 × 0.4 = 0.4
○ Output = 0 (correct, no weight update).
● Third & Fourth Instances: Already classified correctly in the previous
round.

This process repeats for all training data until the model consistently classifies
instances correctly.

_____________________________________________________________________________
_

Activation Functions

● A mathematical function applied to a neuron's output.

● Introduces non-linearity, allowing neural networks to learn complex
patterns.
● Without it, networks behave like linear regression models and cannot
handle complex tasks.
● Determines neuron activation based on the weighted sum of inputs
and bias.

Need for Non-Linearity in Neural Networks

● Neurons rely on weights, biases, and activation functions for

learning.
● Backpropagation updates weights and biases based on errors.
● Activation functions provide gradients that help in efficient learning.
Why Activation Functions Are Necessary

Without Non-Linearity

● Neurons passing weighted sums directly keep the network linear.

● Multiple layers still act as a single-layer perceptron, limiting learning
ability.

With Non-Linearity

● Non-linear activation functions (e.g., ReLU) help in learning complex

patterns.
● Example (ReLU):

○
○ Hidden Layer:

○ Output Layer:

Types of Activation Functions

1. Linear Activation Function

● y = x, produces a straight-line output.

● Limitations: Cannot model complex patterns, so only used in output
layers for regression tasks.
Non-Linear Activation Functions

2. Sigmoid Function

● Formula:

● Range: 0 to 1 (useful for binary classification).

● Issues:
○ Vanishing Gradient Problem: For large or small values of x,
gradients become very small, slowing learning.

3. Hyperbolic Tangent (Tanh) Function

● Formula:
● Range: -1 to 1 (zero-centered, better than Sigmoid).
● Problem: Still suffers from vanishing gradients.

4. Rectified Linear Unit (ReLU)

● Formula:
● Range: [0, ∞) (outputs non-negative values).
● Advantages:
○ Simple, fast computation.
○ Helps avoid the vanishing gradient problem.
● Issue: Can cause "dead neurons" (always outputting 0 for negative
values).

5. Leaky ReLU

● Modification of ReLU that allows small negative values instead of 0.

● Formula:
● Fixes the dead neuron problem, but choosing α is crucial.

6. Parametric ReLU (PReLU)

● Similar to Leaky ReLU, but learns α during training.

● Formula:

● Advantage: Optimized performance.

● Issue: Increases model complexity, risking overfitting.
7. Softmax Function

● Used in multi-class classification to convert outputs into

probabilities.
● Formula:

● Advantage: Helps handle multiple classes effectively.

● Issue: Computationally expensive for large datasets.

Impact of Activation Functions on Model Performance

● Training Speed: ReLU is faster; Sigmoid/Tanh can slow learning.

● Gradient Flow: ReLU allows deeper layers to learn better, while
Sigmoid/Tanh struggles.
● Model Complexity:
○ Softmax: Best for multi-class problems.
○ ReLU/Leaky ReLU: Good for hidden layers.

Choosing the Right Activation Function

Function Best For Limitations

Sigmoid Binary classification Vanishing gradient problem

Tanh Hidden layers, Still suffers from vanishing

zero-centered gradients

ReLU Hidden layers, fast training Dead neurons for negative values

Leaky ReLU Fixes dead neurons Choosing α\alpha is tricky

Softmax Multi-class classification Computationally expensive

_____________________________________________________________________________
_

Single Layer Perceptron and MultiLayer Perceptron

Single-Layer Perceptron (SLP)

● A basic neural unit that classifies data into two categories (Binary
Classifier).
● Works only for linearly separable problems (e.g., AND, OR).
● Uses a step function to give binary output (0 or 1).
● No hidden layers—just input and output layers.
● Uses a simple learning rule (no backpropagation).

Demonstration of Single Layer Perceptron using OR and AND Function

AND Function
Designing of Neuron using AND Function

OR Function
Multilayer Perceptron

● MLP is a type of neural network that moves data in the forward

direction only.
● It contains input, hidden, and output layers.
● All nodes are fully connected, and each node passes values only
forward.
● Uses the Backpropagation algorithm to improve accuracy during
training.
Working of MultiLayer Perceptron Neural Network

1. Input Layer:

○ Represents features of the dataset.
○ Passes vector input values to the hidden layer.
2. Hidden Layer:
○ Each edge has a weight multiplied by the input variable.
○ The weighted values from all nodes are summed together.
○ An activation function identifies which nodes should activate.
3. Output Layer:
○ Processes the activated values and generates the final output.
4. Error Calculation:
○ The difference between predicted output and actual output is
calculated.
5. Backpropagation:
○ The network adjusts weights to reduce error and improve
accuracy.

Designing of Non-Linear Problem using Multilayer Perceptron

Difference Between SLP, MLP, and Deep Learning Networks
Aspect Single-Layer Multi-Layer Perceptron Deep Learning
Perceptron (SLP) (MLP) Networks

Architecture One input layer, one Input, hidden, and Many layers (deep
output layer (no output layers architecture)
hidden layers)

Problem Solves only linearly Solves linear & Handles highly

Solvability separable problems non-linear problems complex tasks
(e.g., AND) (e.g., XOR)

Activation Step function (binary Non-linear functions Advanced activations

Function output) (ReLU, Sigmoid, Tanh) (Softmax, Leaky ReLU,
etc.)

Learning Uses Perceptron Uses Backpropagation Uses advanced

Algorithm Learning Rule, no and Gradient Descent optimization
backpropagation techniques (e.g.,
Adam, RMSprop)

Output Binary (0 or 1) Continuous (regression) Highly flexible

or multi-class output (text, images,
(classification) signals, etc.)

Applications Basic binary Complex tasks AI-based

classification (classification, regression, applications
image recognition) (computer vision, NLP,
robotics)

Complexity Simple and limited More complex and Highly complex and
powerful scalable

_____________________________________________________________________________
_

Sigma-Delta Modulator FPGA Design
No ratings yet
Sigma-Delta Modulator FPGA Design
76 pages
EE418 HW5 Solutions
No ratings yet
EE418 HW5 Solutions
12 pages
Artificial Neural Networks: System That Can Acquire, Store, and Utilize Experiential Knowledge
No ratings yet
Artificial Neural Networks: System That Can Acquire, Store, and Utilize Experiential Knowledge
43 pages
Lesson 3 Artificial Neural Network
No ratings yet
Lesson 3 Artificial Neural Network
77 pages
UPTU B.tech Computer Science 3rd 4th Yr
No ratings yet
UPTU B.tech Computer Science 3rd 4th Yr
51 pages
Artificial Neural Networks Explained
No ratings yet
Artificial Neural Networks Explained
54 pages
Artificial Neural Networks Overview
100% (1)
Artificial Neural Networks Overview
40 pages
Karachi LTE1800 Model Tuning - Cluster Comparison
No ratings yet
Karachi LTE1800 Model Tuning - Cluster Comparison
18 pages
Artificial Neural Network: Lecture Module 22
No ratings yet
Artificial Neural Network: Lecture Module 22
54 pages
Shallow Neural Network
No ratings yet
Shallow Neural Network
152 pages
Neural Networks
No ratings yet
Neural Networks
37 pages
Shortcomings in Single Layer Neural Networks: Most Real World Problems Are Not
No ratings yet
Shortcomings in Single Layer Neural Networks: Most Real World Problems Are Not
43 pages
ANN-Implemetation of Back-Prop
No ratings yet
ANN-Implemetation of Back-Prop
89 pages
Data Structures & Algorithms MCQs
No ratings yet
Data Structures & Algorithms MCQs
6 pages
Learning With Linear Neurons: Adapted From Lectures by Geoffrey Hinton and Others Updated by N. Intrator, May 2007
No ratings yet
Learning With Linear Neurons: Adapted From Lectures by Geoffrey Hinton and Others Updated by N. Intrator, May 2007
59 pages
Unit-5 Vector Calculus
No ratings yet
Unit-5 Vector Calculus
9 pages
Dave Reed: Connectionist Approach To AI
No ratings yet
Dave Reed: Connectionist Approach To AI
26 pages
Lecture 10 Neural Network
No ratings yet
Lecture 10 Neural Network
34 pages
DL CHPT 1
No ratings yet
DL CHPT 1
59 pages
Backpropagation in MLP: A Detailed Guide
No ratings yet
Backpropagation in MLP: A Detailed Guide
34 pages
Ann MJJ-1
No ratings yet
Ann MJJ-1
64 pages
Perceptron and Gradient Descent Calculations
No ratings yet
Perceptron and Gradient Descent Calculations
43 pages
Unit 2
No ratings yet
Unit 2
18 pages
Unit 2
No ratings yet
Unit 2
36 pages
Lecture 19 NN
No ratings yet
Lecture 19 NN
32 pages
Slide 2
No ratings yet
Slide 2
35 pages
Lecture 19 NN
No ratings yet
Lecture 19 NN
32 pages
NN-Ch2 New V1
No ratings yet
NN-Ch2 New V1
99 pages
Aim: Write A Program To Implement Insertion Sort. Theory: Insertion Sort Is The Simple Sorting Algorithm That Is
No ratings yet
Aim: Write A Program To Implement Insertion Sort. Theory: Insertion Sort Is The Simple Sorting Algorithm That Is
21 pages
Neural Networks for Tech Enthusiasts
No ratings yet
Neural Networks for Tech Enthusiasts
15 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
Soft Computing
No ratings yet
Soft Computing
92 pages
Lecture 9
No ratings yet
Lecture 9
97 pages
FALLSEM2023-24 CSE4020 ELA VL2023240104096 2023-09-07 Reference-Material-I
No ratings yet
FALLSEM2023-24 CSE4020 ELA VL2023240104096 2023-09-07 Reference-Material-I
7 pages
Operations Research Project: A Report Presented in Fulfillment of The Term Project in Operations Research To
No ratings yet
Operations Research Project: A Report Presented in Fulfillment of The Term Project in Operations Research To
10 pages
Slide 07a - Control Structure - Selection
No ratings yet
Slide 07a - Control Structure - Selection
19 pages
AN2DL 02 2324 Perceptron 2 FeedForward
No ratings yet
AN2DL 02 2324 Perceptron 2 FeedForward
55 pages
Ppt-Ii NNFL
No ratings yet
Ppt-Ii NNFL
43 pages
Unit 2
No ratings yet
Unit 2
20 pages
08 NN
No ratings yet
08 NN
43 pages
Searchain Blockchain-Based Private Keyword Search in Decentralized Storage
No ratings yet
Searchain Blockchain-Based Private Keyword Search in Decentralized Storage
25 pages
Deep Learning
No ratings yet
Deep Learning
19 pages
AI Lec24-25
No ratings yet
AI Lec24-25
63 pages
Analisis Signal-To-Noise Ratio Pada Sinyal Audio Dengan Teknik Konvolusi
No ratings yet
Analisis Signal-To-Noise Ratio Pada Sinyal Audio Dengan Teknik Konvolusi
9 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
39 pages
Mathematica for Grad Students
No ratings yet
Mathematica for Grad Students
8 pages
Ch1-Fundamental of Neural Network
No ratings yet
Ch1-Fundamental of Neural Network
59 pages
Support Vector Machine
100% (1)
Support Vector Machine
40 pages
36-Multi-Layer Perceptron and Its Properties-30-10-2024
No ratings yet
36-Multi-Layer Perceptron and Its Properties-30-10-2024
39 pages
UNIT1 Perceptron MLP
No ratings yet
UNIT1 Perceptron MLP
26 pages
22SCSE1180094 - Shyam Lab File (SCA) !!!!
No ratings yet
22SCSE1180094 - Shyam Lab File (SCA) !!!!
28 pages
Final PPT DataMining
No ratings yet
Final PPT DataMining
64 pages
Linearly Separable 1
No ratings yet
Linearly Separable 1
36 pages
Question Bank With Answers AI
No ratings yet
Question Bank With Answers AI
5 pages
Deep Learning-Material For The Units 1,2,3
No ratings yet
Deep Learning-Material For The Units 1,2,3
36 pages
Cost Estimation Methods Guide
No ratings yet
Cost Estimation Methods Guide
2 pages
Softcomputing Assignment 1
No ratings yet
Softcomputing Assignment 1
7 pages
ML Tushar Assignment
No ratings yet
ML Tushar Assignment
8 pages
Proses Pengendalian Proses
No ratings yet
Proses Pengendalian Proses
2 pages
Assignment 1
No ratings yet
Assignment 1
7 pages
Predictive Analysis for Question Weightage
No ratings yet
Predictive Analysis for Question Weightage
3 pages
Deep Learning-Based Assessment Model For Real-Time Identification of Visual Learners Using Raw EEG
No ratings yet
Deep Learning-Based Assessment Model For Real-Time Identification of Visual Learners Using Raw EEG
13 pages
Knowledge Discovery in Database
No ratings yet
Knowledge Discovery in Database
10 pages
Modified Binary Search Algorithm: Ankit R. Chadha Rishikesh Misal Tanaya Mokashi
No ratings yet
Modified Binary Search Algorithm: Ankit R. Chadha Rishikesh Misal Tanaya Mokashi
4 pages
cst414 - Deep Learning
No ratings yet
cst414 - Deep Learning
34 pages
NN 2nd
No ratings yet
NN 2nd
23 pages
Classification BP Regression KNN Other Classifiers - Final
No ratings yet
Classification BP Regression KNN Other Classifiers - Final
116 pages
Data Sructure and Algrithm
No ratings yet
Data Sructure and Algrithm
12 pages
MSE Revision Set A-1
No ratings yet
MSE Revision Set A-1
2 pages
Soft Computing Unit 2
No ratings yet
Soft Computing Unit 2
23 pages
ML Neural Networks
No ratings yet
ML Neural Networks
71 pages
DL Mod 1 Final
No ratings yet
DL Mod 1 Final
4 pages
Tasks On Neurons and ANN
No ratings yet
Tasks On Neurons and ANN
15 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
35 pages
Neural Network Notes
No ratings yet
Neural Network Notes
8 pages
MATLAB Programs From Unit 6 of M.Tech Computer Science
No ratings yet
MATLAB Programs From Unit 6 of M.Tech Computer Science
35 pages
Company Profile and OR JD
No ratings yet
Company Profile and OR JD
2 pages
5 - Neural Network
No ratings yet
5 - Neural Network
105 pages
MPC Control of A Heat-Exchanger Network
No ratings yet
MPC Control of A Heat-Exchanger Network
11 pages
Backstepping Based Nonlinear Sensorless Control of Induction Motor System
No ratings yet
Backstepping Based Nonlinear Sensorless Control of Induction Motor System
8 pages
A Study On Speech Emotion Recognition Based On MFCC and KNN Models
No ratings yet
A Study On Speech Emotion Recognition Based On MFCC and KNN Models
4 pages
AP Statistics Free-Response Practice Test 8 Probability and Random Variables
No ratings yet
AP Statistics Free-Response Practice Test 8 Probability and Random Variables
2 pages
PG DataMiningR Practicals
No ratings yet
PG DataMiningR Practicals
2 pages
AI Intern Interview Complete Questions Harpreet
No ratings yet
AI Intern Interview Complete Questions Harpreet
3 pages
Lect 5
No ratings yet
Lect 5
26 pages
Module 1 DL
No ratings yet
Module 1 DL
84 pages
Unit-2.a Feedforward DNN
No ratings yet
Unit-2.a Feedforward DNN
13 pages

Neural Network and Deep Learning - Unit 1

Uploaded by

Neural Network and Deep Learning - Unit 1

Uploaded by

NEURAL NETWORK AND DEEP LEARNING​

UNIT 1 - NEURAL NETWORKS

●​ A learning rule is a method or algorithm that helps a neural network

Hebbian Learning Rule

Hebbian Learning Rule Algorithm

Implementing AND Gate

●​ 4 training samples → 4 iterations.

●​ Weights = [0, 0, 0]ᵀ, Bias = 0.

Step 2: Input Vectors

●​ X1 = [-1, -1, 1]ᵀ

Step 3: Assign Outputs

●​ Set y = t for each input.

Input (X₁, X₂) Target (t)

Step 4: Update Weights Using Hebbian Rule

Using X1 = [-1, -1, 1]ᵀ and t = −1,

W = [0,0,0] + (−1) × [−1,−1,1] = [1, 1, -1]ᵀ

Using X2 = [-1, 1, 1]ᵀ and t = -1,

W = [1,1,−1] + (−1) × [−1,1,1] = [2, 0, -2]ᵀ

Using X3 = [1, -1, 1]ᵀ and t=−1,

W = [2,0,−2] + (−1) × [1,−1,1] = [1, 1, -3]ᵀ

Using X4 = [1, 1, 1]ᵀ and t=1,

W = [1,1,−3] + (1) × [1,1,1] = [2, 2, -2]ᵀ

Testing the Network

Each input is represented as an augmented vector (including bias):

The output Y is computed using the dot product:

Y = (w1 × x1) + (w2 × x2) + (w3 × x3)

1. For (-1, -1):

Y = (2 × −1) + (2 × −1) + (−2 × 1) = - 2 - 2 - 2 = −6

2. For (-1, 1):

4. For (1, 1):

The final weight vector obtained through Hebbian learning is:

Perceptron Learning Rule

●​ Purpose: Used in supervised learning for binary classification tasks

Types of Activation Functions:

1.​ Sign Function

Range value of Sign and step functions

1.​ Step 1: Calculate the weighted sum of the inputs:

The output can be binary or continuous, based on the function used.

Example of Perceptron Learning Rule:

1.​ First Instance (X1 = 0, X2 = 0):​

○​ Weighted sum: 0 × 0.9 + 0 × 0.9 = 0

○​ Weighted sum: 0 × 0.9 + 1 × 0.9 = 0.9

w1 = 0.9 + 0.5 × (−1) = 0.4

w2 = 0.9 + 0.5 × (−1) = 0.4

3.​ Third Instance (X1 = 1, X2 = 0):​

○​ Weighted sum: 1 × 0.4 + 0 × 0.4 = 0.4

○​ Weighted sum: 1 × 0.4 + 1 × 0.4 = 0.8

Feedforward with Updated Weights:

●​ After updating weights, reapply the process to all instances.

●​ A mathematical function applied to a neuron's output.

Need for Non-Linearity in Neural Networks

●​ Neurons rely on weights, biases, and activation functions for

●​ Neurons passing weighted sums directly keep the network linear.

●​ Non-linear activation functions (e.g., ReLU) help in learning complex

Types of Activation Functions

1. Linear Activation Function

●​ y = x, produces a straight-line output.

●​ Range: 0 to 1 (useful for binary classification).

3. Hyperbolic Tangent (Tanh) Function

4. Rectified Linear Unit (ReLU)

●​ Modification of ReLU that allows small negative values instead of 0.

6. Parametric ReLU (PReLU)

●​ Similar to Leaky ReLU, but learns α during training.

●​ Advantage: Optimized performance.

●​ Used in multi-class classification to convert outputs into

●​ Advantage: Helps handle multiple classes effectively.

Impact of Activation Functions on Model Performance

●​ Training Speed: ReLU is faster; Sigmoid/Tanh can slow learning.

Choosing the Right Activation Function

Function Best For Limitations

Tanh Hidden layers, Still suffers from vanishing

Leaky ReLU Fixes dead neurons Choosing α\alpha is tricky

Softmax Multi-class classification Computationally expensive

Single Layer Perceptron and MultiLayer Perceptron

Single-Layer Perceptron (SLP)

Demonstration of Single Layer Perceptron using OR and AND Function

NEURAL NETWORK AND DEEP LEARNING

● A learning rule is a method or algorithm that helps a neural network

● 4 training samples → 4 iterations.

● Weights = [0, 0, 0]ᵀ, Bias = 0.

● X1 = [-1, -1, 1]ᵀ

● Set y = t for each input.

● Purpose: Used in supervised learning for binary classification tasks

1. Sign Function

1. Step 1: Calculate the weighted sum of the inputs:

1. First Instance (X1 = 0, X2 = 0):

○ Weighted sum: 0 × 0.9 + 0 × 0.9 = 0

○ Weighted sum: 0 × 0.9 + 1 × 0.9 = 0.9

3. Third Instance (X1 = 1, X2 = 0):

○ Weighted sum: 1 × 0.4 + 0 × 0.4 = 0.4

○ Weighted sum: 1 × 0.4 + 1 × 0.4 = 0.8

● After updating weights, reapply the process to all instances.

● A mathematical function applied to a neuron's output.

● Neurons rely on weights, biases, and activation functions for

● Neurons passing weighted sums directly keep the network linear.

● Non-linear activation functions (e.g., ReLU) help in learning complex

● y = x, produces a straight-line output.

● Range: 0 to 1 (useful for binary classification).

● Modification of ReLU that allows small negative values instead of 0.

● Similar to Leaky ReLU, but learns α during training.

● Advantage: Optimized performance.

● Used in multi-class classification to convert outputs into

● Advantage: Helps handle multiple classes effectively.

● Training Speed: ReLU is faster; Sigmoid/Tanh can slow learning.

● MLP is a type of neural network that moves data in the forward

1. Input Layer: