0% found this document useful (0 votes)

21 views7 pages

06.perceptron Algorithm

The Perceptron algorithm, created by Frank Rosenblatt in 1958, is a foundational artificial neural network model used for binary classification, which has significantly influenced modern deep learning. It consists of components like input values, weights, biases, activation functions, and can be implemented in single-layer or multi-layer forms, each with its own advantages and disadvantages. The document also discusses forward and backward propagation, non-linear regression, and the applications and implications of the Perceptron in machine learning.

Uploaded by

asjnsj002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views7 pages

06.perceptron Algorithm

Uploaded by

asjnsj002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

6.

Perceptron Algorithm Unit-2: Supervised Learning

The Perceptron algorithm, developed by Frank Rosenblatt in 1958, is a type of artificial neural network used
primarily for binary classification tasks. This foundational model has evolved significantly, influencing
modern deep learning networks. The following sections delve into its components, functionality, variations,
and detailed descriptions of its workings.

6.1. Basic Components of Perceptron

• Input Values or Features: Each input represents a feature of the dataset.

• Weights and Bias: Weights adjust the influence of input features; the bias term allows the model to fit
the data better.

• Activation Function: Determines the neuron's output based on the weighted sum of inputs.

• Output: Typically binary, based on the threshold set by the activation function.

6.2. Types of Activation Functions

• Step Function: Traditional binary output, useful for basic decision-making tasks.

• Sigmoid and Hyperbolic Tangent (tanh): Provide gradual changes and are suitable for classification
between two main categories.

• ReLU: Popular in deeper networks for its efficiency in training and solving the vanishing gradient
problem.

6.3. Working of Perceptrons

• Computation: The perceptron computes a weighted sum of the inputs and adds a bias 𝑧 = 𝑤! 𝑥! + 𝑤"
𝑥" +. . . +𝑤# 𝑥# + 𝑏

• Activation: The activation function is applied to 𝑧 to produce the output, either 0 or 1 for simple tasks,
or more complex forms for non-linear functions in multi-layer networks.

• Thresholding: The basic perceptron uses a threshold to decide the output, where the function outputs
one value if z is above the threshold and another if below.

6.4. Types of Perceptron Models

• Single-layer Perceptron

• Characteristics: Consists of a single neuron with no hidden layers, suitable for simple, linearly
separable problems.

AL3451- Machine Learning Unit-2: Supervised Learning Page 1 of 7

Perceptron Algorithm
• Advantages:

• Simplicity: Easy to implement and understand.

• Efficiency: Requires less computational resources.

• Disadvantages:

• Limited Capacity: Cannot solve non-linear problems (e.g., XOR problem).

• Prone to errors in non-linearly separable datasets.

6.5. Multi-layer Perceptron (MLP)

• Characteristics: Comprises an input layer, one or more hidden layers, and an output layer.

• Advantages:

• Versatility: Can approximate virtually any function and solve complex, non-linear problems.

• Robustness: Better at generalizing from the training data.

• Disadvantages:

• Complexity: More parameters to train, requiring more data and computational power.

• Overfitting Risk: More prone to fitting noise in the training data without proper regularization.

6.6. Perceptron Learning Algorithm

• Iterative Process: Adjust weights based on the errors made in predictions.

• Weight Update: Weights are updated using the formula: 𝑤 = 𝑤 + 𝜂(𝑦 − 𝑦,)𝑥, where 𝜂 is the learning
rate, 𝑦 is the actual output, and 𝑦, is the predicted output.

• Convergence: The algorithm iterates until it converges to a solution where the errors are minimized
or a maximum number of iterations is reached.

6.7. Multi-Layer Perceptron Model

• Forward Propagation: Inputs are passed forward through the network, layer by layer, until the output
layer.

• Backpropagation Algorithm: Used to update the weights in the network by propagating the error
back through the network, adjusting weights to minimize the error.

6.7.1. Forward Propagation

Forward propagation is a fundamental concept in neural networks, particularly in the context of training deep
learning models. It refers to the process by which inputs are passed through a network to generate outputs.

AL3451- Machine Learning Unit-2: Supervised Learning Page 2 of 7

Perceptron Algorithm
The inputs are processed layer by layer, with each layer applying weights, biases, and typically a non-linear
activation function to the inputs before passing them to the next layer.

6.7.1.1. Types of Forward Propagation

1. Standard Forward Propagation: Used in most feedforward neural networks, where each layer's
output is calculated from only the previous layer's output.

2. Convolutional Forward Propagation: Employed in Convolutional Neural Networks (CNNs), where

filters are applied to local regions of the input data. This is effective for data that has a grid-like
topology, such as images.

3. Recurrent Forward Propagation: Utilized in Recurrent Neural Networks (RNNs), which involves
loops in the network that allow information to persist. In this case, the output from the network can be
fed back into the network as part of the input for subsequent steps, which is useful for sequence
prediction tasks like language modeling.

6.7.1.2. Areas of Application

• Image Recognition: Neural networks using forward propagation have been tremendously successful
in tasks like object detection and face recognition, where CNNs can identify and classify objects within
images effectively.

• Natural Language Processing (NLP): Forward propagation in models like RNNs and Transformers
enables significant advancements in understanding and generating human language, applicable in
machine translation, sentiment analysis, and chatbots.

• Financial Forecasting: Deep learning models predict stock movements, evaluate portfolio risk, and
automate trading decisions by analyzing vast amounts of financial data through forward propagation.

• Healthcare: Used in diagnostic systems, for example in radiology to interpret complex medical images
and predict diseases from patterns that are not apparent to human eyes.

6.7.1.3. Case Study: Image Classification

Scenario:

A simple case study involves using a small Convolutional Neural Network (CNN) to classify images of
handwritten digits from the MNIST dataset.

Process:

1. Input Layer: The grayscale image of a digit (28x28 pixels) is inputted.

2. Convolutional Layers: Several filters are applied to detect low-level features such as edges and
curves.

3. Pooling Layers: Reduce the spatial size of the representation, decreasing the number of parameters
and computation in the network.
AL3451- Machine Learning Unit-2: Supervised Learning Page 3 of 7
Perceptron Algorithm
4. Fully Connected Layers: Higher-level reasoning is performed on the features extracted by the
convolutional and pooling layers.

5. Output Layer: Consists of 10 neurons (for digits 0 through 9), where the softmax activation function
is applied to classify the input image into one of the 10 digit classes.

Outcome:

Each layer's output in the CNN is computed based on the outputs from the previous layer, effectively using
forward propagation to transform the raw pixel values into class probabilities. The model learns to recognize
patterns corresponding to each digit through training, and the forward propagation allows it to predict the digit
represented in new images.

Forward propagation is the backbone of data flow in neural networks, enabling the practical application of
deep learning to a wide range of industries and problems. By understanding and optimizing this process,
significant advancements can be made in artificial intelligence applications.

6.8. Backpropagation Algorithm

Backward propagation, commonly referred to as backpropagation, is a fundamental algorithm used for training
artificial neural networks. This method calculates the gradient of the loss function of a neural network with
respect to its weights by applying the chain rule of calculus, allowing efficient optimization of complex neural
network architectures.

6.8.1. Mechanism of Backward Propagation

1. Forward Pass: Initially, inputs are passed through the network (forward propagation) to generate
outputs and subsequently the loss, which measures the difference between the predicted output and the
true label.

2. Backward Pass: In this stage, the gradient of the loss function with respect to each weight is computed,
starting from the output layer and moving backward through the network. This involves:

• Calculating the partial derivatives of the loss with respect to each weight by chain rule.

• Propagating these derivatives back through the network, layer by layer.

• Updating the gradients at each layer based on the output from the subsequent layer.

6.8.2. Types of Backward Propagation

• Standard Backpropagation: Used in standard feedforward neural networks, where gradients are
calculated for each layer sequentially in reverse order from the last to the first layer.

• Stochastic Backpropagation: Involves updating weights incrementally after each training example,
which is characteristic of Stochastic Gradient Descent.

AL3451- Machine Learning Unit-2: Supervised Learning Page 4 of 7

Perceptron Algorithm
• Batch Backpropagation: Computes gradients for a batch of data before updating the weights,
commonly used to stabilize the updates in training.

6.8.3. Areas of Application

• Deep Learning: Fundamental to training deep neural networks for complex tasks like image and
speech recognition, natural language processing, and autonomous driving.

• Optimization Problems: Used in various scientific and engineering disciplines to optimize functions
and solve complex equations that are modelled by neural networks.

• Reinforcement Learning: Employed to optimize policies in reinforcement learning by adjusting

network parameters that estimate value functions or model environment dynamics.

6.8.4. Case Study: Training a Neural Network for Image Classification

Scenario:

Training a Convolutional Neural Network (CNN) to classify images from a dataset, that includes 60,000 32x32
colour images in 10 classes.

Process:

1. Forward Pass: Input images are passed through several convolutional, activation, and pooling layers
to extract features. The final output layer uses softmax to predict the class probabilities.

2. Loss Calculation: The loss is calculated using cross-entropy, comparing the predicted probabilities
with the actual class labels.

3. Backward Pass:

• Compute the gradient of the loss function with respect to the output layer's weights.

• Propagate these gradients back through the network, updating the weights in each
convolutional and fully connected layer to minimize the loss.

Outcome:

The model iteratively adjusts its weights based on the computed gradients, improving its accuracy over
multiple epochs. The network learns to recognize and differentiate between the various image categories
effectively.

Backward propagation is an essential mechanism in the training process of neural networks, enabling the
practical application of deep learning models across a diverse range of fields. By efficiently computing
gradients and updating model parameters, backward propagation helps improve model performance, making
neural networks more accurate and effective at tasks ranging from simple classification to complex decision-
making scenarios.

AL3451- Machine Learning Unit-2: Supervised Learning Page 5 of 7

Perceptron Algorithm

6.9. Non-Linear Regression

Non-linear regression is a form of regression analysis in which observational data is modeled by a function
which is a nonlinear combination of the model parameters and depends on one or more independent variables.
It provides a way to model the non-linear relationships often found in real-world data.

6.9.1. Types of Non-Linear Regression

1. Polynomial Regression: Models data using polynomial functions of varying degrees. Commonly used
for its simplicity in representing non-linear trends.

2. Logistic Regression: Used for binary classification tasks where the dependent variable is categorical.

3. Exponential Regression: Useful when data rises or falls at increasingly higher rates over time.

4. Log-Linear Regression: Applied when the rate of change in the dataset is constant over time, suitable
for modeling biological phenomena or complex decay processes.

6.9.2. Uses of Non-Linear Regression

• Ecological and Biological Modeling: Useful in growth models where growth accelerates or
decelerates in a non-linear manner.

• Economic Data Analysis: Models complex relationships between economic indicators.

• Engineering: Used in signal processing and control systems where system behavior is inherently non-
linear.

• Medical Research: Helps in dose-response models and modeling of biological systems.

6.9.3. Advantages of Non-Linear Regression

• Flexibility: Can fit a wide range of curvilinear patterns, making it adaptable to various types of data.

• Accuracy: Provides a more accurate fit for non-linear data than linear models, reducing model bias.

• Insightful: Offers deep insights into the data's underlying mechanisms by fitting complex
relationships.

6.9.4. Disadvantages of Non-Linear Regression

• Complexity: More complex to understand and fit than linear regression, requiring more sophisticated
computational tools and techniques.

• Overfitting: More prone to overfitting, especially with high-degree polynomial regressions or

insufficient data points.

• Sensitivity: Parameters estimates can be highly sensitive to changes in model specifics, such as the
form of the non-linear function or initial parameter estimates.

AL3451- Machine Learning Unit-2: Supervised Learning Page 6 of 7

Perceptron Algorithm
• Convergence Issues: Finding the best fit might be challenging as non-linear regression often relies on
iterative estimation techniques that may not converge.

Non-linear regression is a powerful analytical tool for modelling complex relationships between variables. It
excels in environments where the relationship between variables is not straightforward, offering a nuanced
understanding of data dynamics. However, its implementation requires careful consideration of model
selection, potential overfitting, and computational

The Perceptron, in its various forms from simple to multi-layered, plays a critical role in the field of machine
learning. Understanding its workings, advantages, and limitations is essential for leveraging its capabilities in
practical applications and advancing to more sophisticated neural network architectures.

AL3451- Machine Learning Unit-2: Supervised Learning Page 7 of 7

Unit 2 - ML
No ratings yet
Unit 2 - ML
18 pages
4 Perceptron 06 08 2025
No ratings yet
4 Perceptron 06 08 2025
32 pages
Unit-5 AIML
No ratings yet
Unit-5 AIML
16 pages
DL Question Bank Answers
No ratings yet
DL Question Bank Answers
55 pages
3rd Unit ML
No ratings yet
3rd Unit ML
7 pages
Neural Networks
No ratings yet
Neural Networks
10 pages
Neural Network
No ratings yet
Neural Network
97 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
Unit 2
No ratings yet
Unit 2
20 pages
Unit 4 Neural Networks
No ratings yet
Unit 4 Neural Networks
76 pages
Shortnotedeeplearning
No ratings yet
Shortnotedeeplearning
11 pages
Unit - II ML
No ratings yet
Unit - II ML
9 pages
ML Module 2 New
No ratings yet
ML Module 2 New
36 pages
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
No ratings yet
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
45 pages
Soft Computing Unit 2
No ratings yet
Soft Computing Unit 2
23 pages
Evolution of DL
No ratings yet
Evolution of DL
6 pages
Unit - 2
No ratings yet
Unit - 2
24 pages
Neural Network Learning Guide
No ratings yet
Neural Network Learning Guide
43 pages
A Presentation On: By: Edutechlearners
No ratings yet
A Presentation On: By: Edutechlearners
33 pages
DL Question Bank Answers
No ratings yet
DL Question Bank Answers
55 pages
Lecture 9
No ratings yet
Lecture 9
97 pages
Unit 1
No ratings yet
Unit 1
72 pages
DL Mod 1 Final
No ratings yet
DL Mod 1 Final
4 pages
An Introduction To Mathematics Behind Neural Networks
No ratings yet
An Introduction To Mathematics Behind Neural Networks
5 pages
Uni2 NN 2023
No ratings yet
Uni2 NN 2023
52 pages
Artificial Intelligence - Chapter 7
No ratings yet
Artificial Intelligence - Chapter 7
18 pages
MLP 1122 20240509 ch10 DeepNN
No ratings yet
MLP 1122 20240509 ch10 DeepNN
47 pages
Pattern Recognition & Analysis Assignment - Ii
No ratings yet
Pattern Recognition & Analysis Assignment - Ii
19 pages
Unit 3
No ratings yet
Unit 3
8 pages
Ann MLP
No ratings yet
Ann MLP
56 pages
Working of Multi-Layer Perceptron
No ratings yet
Working of Multi-Layer Perceptron
16 pages
Unit 1 Fundamentals of Deep Learning
No ratings yet
Unit 1 Fundamentals of Deep Learning
20 pages
Neural Networks Course Overview
No ratings yet
Neural Networks Course Overview
72 pages
Neural Network: Presented by Lecturer Dept. of Mechatronics Engineering Rajshahi University of Engineering & Technology
No ratings yet
Neural Network: Presented by Lecturer Dept. of Mechatronics Engineering Rajshahi University of Engineering & Technology
25 pages
Bim309 Ai Week13
No ratings yet
Bim309 Ai Week13
53 pages
3ML.05.NeuralNetworks DeepLearning
No ratings yet
3ML.05.NeuralNetworks DeepLearning
67 pages
Neural Network: Presented by Lecturer Dept. of Mechatronics Engineering Rajshahi University of Engineering & Technology
No ratings yet
Neural Network: Presented by Lecturer Dept. of Mechatronics Engineering Rajshahi University of Engineering & Technology
25 pages
Slides NN
No ratings yet
Slides NN
59 pages
Supervised ANN
No ratings yet
Supervised ANN
19 pages
06 Ann
No ratings yet
06 Ann
56 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
66 pages
Neural Networks
No ratings yet
Neural Networks
10 pages
Unit 3 Endsem PYQs
No ratings yet
Unit 3 Endsem PYQs
19 pages
cst414 - Deep Learning
No ratings yet
cst414 - Deep Learning
34 pages
Neural Network
No ratings yet
Neural Network
82 pages
Chapter 2 - Artificial Neural Networks
No ratings yet
Chapter 2 - Artificial Neural Networks
19 pages
M03 Networks
No ratings yet
M03 Networks
40 pages
NNDL
No ratings yet
NNDL
69 pages
Module 1 DL
No ratings yet
Module 1 DL
84 pages
Ann Mid1: Artificial Neural Networks With Biological Neural Network - Similarity
No ratings yet
Ann Mid1: Artificial Neural Networks With Biological Neural Network - Similarity
13 pages
7 Neural Networks
No ratings yet
7 Neural Networks
70 pages
Module I
No ratings yet
Module I
109 pages
Neural Networks in Python & R
No ratings yet
Neural Networks in Python & R
12 pages
Aimlf Unit4
No ratings yet
Aimlf Unit4
20 pages
NNDL
No ratings yet
NNDL
96 pages
The Perceptrons
No ratings yet
The Perceptrons
41 pages
Module 02
No ratings yet
Module 02
20 pages
Mod 4 Notes
No ratings yet
Mod 4 Notes
46 pages
Sentiment Classification With Deep Neural Networks: Yi Zhou
No ratings yet
Sentiment Classification With Deep Neural Networks: Yi Zhou
58 pages
Lec2-Deep Neural Networks
No ratings yet
Lec2-Deep Neural Networks
12 pages
Hand Digit Recognition Using CNN & Ann: Upma Jain, Vipashi Kansal, Tanusha Mittal, Ms Sonali Gupta
No ratings yet
Hand Digit Recognition Using CNN & Ann: Upma Jain, Vipashi Kansal, Tanusha Mittal, Ms Sonali Gupta
10 pages
ssw9 PS2-13 Wu
No ratings yet
ssw9 PS2-13 Wu
6 pages
Machine Learning Classification Guide
No ratings yet
Machine Learning Classification Guide
42 pages
Autoencoders in Deep Learning Quiz
No ratings yet
Autoencoders in Deep Learning Quiz
5 pages
Ai Unit 5
No ratings yet
Ai Unit 5
33 pages
Image Processing With Python
No ratings yet
Image Processing With Python
21 pages
Lovey Mishra: Education
No ratings yet
Lovey Mishra: Education
2 pages
Kontrol Cerdas 2 UAS Teknik Elektro
No ratings yet
Kontrol Cerdas 2 UAS Teknik Elektro
7 pages
11.RNN and Transformers
No ratings yet
11.RNN and Transformers
100 pages
UNIT 4 (MCQS)
No ratings yet
UNIT 4 (MCQS)
13 pages
Convolutional Networks For Images, Speech, and Time-Series: January 1995
No ratings yet
Convolutional Networks For Images, Speech, and Time-Series: January 1995
15 pages
kNN T-Shirt Size Prediction Guide
100% (1)
kNN T-Shirt Size Prediction Guide
6 pages
Master Spilak Bruno
No ratings yet
Master Spilak Bruno
73 pages
Artificial Neural Networks Jntu Model Com
No ratings yet
Artificial Neural Networks Jntu Model Com
8 pages
Idap 2019 8875953
No ratings yet
Idap 2019 8875953
6 pages
Cours 1 - Intro To Deep Learning
100% (1)
Cours 1 - Intro To Deep Learning
38 pages
Machine Learning Course Guide
No ratings yet
Machine Learning Course Guide
3 pages
DL Mod1
No ratings yet
DL Mod1
58 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
19 pages
What Is Backpropagation
No ratings yet
What Is Backpropagation
8 pages
Neural Networks
No ratings yet
Neural Networks
3 pages
Unit III
No ratings yet
Unit III
89 pages
ANN Models
No ratings yet
ANN Models
42 pages
Deep Learning Tutorial: Reference: Hung-Yi Lee
100% (1)
Deep Learning Tutorial: Reference: Hung-Yi Lee
179 pages
Perceptrons and Neural Networks: Manuela Veloso
No ratings yet
Perceptrons and Neural Networks: Manuela Veloso
23 pages
Training Feedforward DNN Guide
No ratings yet
Training Feedforward DNN Guide
9 pages
Deep Learning
No ratings yet
Deep Learning
24 pages
Multiclass vs Binary Classification
No ratings yet
Multiclass vs Binary Classification
3 pages

06.perceptron Algorithm

Uploaded by

06.perceptron Algorithm

Uploaded by

6.

Perceptron Algorithm Unit-2: Supervised Learning

6.1. Basic Components of Perceptron

6.2. Types of Activation Functions

6.3. Working of Perceptrons

6.4. Types of Perceptron Models

AL3451- Machine Learning Unit-2: Supervised Learning Page 1 of 7

• Simplicity: Easy to implement and understand.

• Efficiency: Requires less computational resources.

• Limited Capacity: Cannot solve non-linear problems (e.g., XOR problem).

• Prone to errors in non-linearly separable datasets.

6.5. Multi-layer Perceptron (MLP)

• Robustness: Better at generalizing from the training data.

6.6. Perceptron Learning Algorithm

6.7. Multi-Layer Perceptron Model

6.7.1. Forward Propagation

AL3451- Machine Learning Unit-2: Supervised Learning Page 2 of 7

6.7.1.1. Types of Forward Propagation

2. Convolutional Forward Propagation: Employed in Convolutional Neural Networks (CNNs), where

6.7.1.2. Areas of Application

6.7.1.3. Case Study: Image Classification

1. Input Layer: The grayscale image of a digit (28x28 pixels) is inputted.

6.8. Backpropagation Algorithm

6.8.1. Mechanism of Backward Propagation

• Propagating these derivatives back through the network, layer by layer.

6.8.2. Types of Backward Propagation

AL3451- Machine Learning Unit-2: Supervised Learning Page 4 of 7

6.8.3. Areas of Application

• Reinforcement Learning: Employed to optimize policies in reinforcement learning by adjusting

6.8.4. Case Study: Training a Neural Network for Image Classification

AL3451- Machine Learning Unit-2: Supervised Learning Page 5 of 7

6.9. Non-Linear Regression

6.9.1. Types of Non-Linear Regression

6.9.2. Uses of Non-Linear Regression

• Economic Data Analysis: Models complex relationships between economic indicators.

• Medical Research: Helps in dose-response models and modeling of biological systems.

6.9.3. Advantages of Non-Linear Regression

6.9.4. Disadvantages of Non-Linear Regression

• Overfitting: More prone to overfitting, especially with high-degree polynomial regressions or

AL3451- Machine Learning Unit-2: Supervised Learning Page 6 of 7

AL3451- Machine Learning Unit-2: Supervised Learning Page 7 of 7

You might also like