0% found this document useful (0 votes)

3 views13 pages

Unit-IV NN and Rule Based Algorithm

The document outlines the process of solving classification problems using neural networks (NNs), detailing steps such as determining input attributes, weights, and hidden layers. It discusses the advantages and disadvantages of NNs, including robustness and difficulty in understanding, and introduces concepts like supervised learning, backpropagation, and radial basis function networks. Additionally, it covers rule-based algorithms for classification and the potential for combining different classification techniques for improved results.

Uploaded by

Suja Mary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views13 pages

Unit-IV NN and Rule Based Algorithm

Uploaded by

Suja Mary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 13

Unit-IV

Network Based Algorithm

Solving a classification problem using NNs involves several steps:

1. Determine the number of output nodes as well as what attributes should be used as input.
The number of hidden layers also must be decided. This step is performed by a domain
expert.
2. Determine weights (labels) and functions to be used for the graph.
3. For each tuple in the training set, propagate it through the network and evaluate the
output prediction to the actual result.
4. For each tuple ti ϵ D, propagate t; through the network and make the appropriate
classification.

Issues
• Attributes (number of source nodes): This is the same issue as determining which attributes
to use as splitting attributes.

• Number of hidden layers: In the simplest case, there is only one hidden layer.

• Number of hidden nodes: Choosing the best number of hidden nodes per hid den layer is one
of the most difficult problems when using NNs. There have been many empirical and theoretical
studies attempting to answer this question. The answer depends on the structure of the NN, types
of activation functions, training algorithm, and problem being solved. If too few hidden nodes
are used, the target function may not be learned (underfitting). If too many nodes are used,
overfit ting may occur.

• Training data: As with DTs, with too much training data the NN may suffer from overfitting,
while too little and it may not be able to classify accurately enough.

• Number of sinks: Although it is usually assumed that the number of output nodes is the same
as the number of classes, this is not always the case.

• Interconnections: In the simplest case, each node is connected to all nodes in the next level.

• Weights: The weight assigned to an arc indicates the relative weight between those two nodes.
Initial weights are usually assumed to be small positive numbers and are assigned randomly.

• Activation functions: Many different types of activation functions can be used.

• Learning technique: The technique for adjusting the weights is called the learning technique.
Although many approaches can be used, the most common approach is some form of
backpropagation, which is discussed in a subsequent subsection.

• Stop: The learning may stop when all the training tuples have propagated through the network
or may be based on time or error rate.

Advantages to the use of NNs for classification:

• NNs are more robust than DTs because of the weights.

• The NN improves its performance by learning. This may continue even after the training set
has been applied.

• The use of NNs can be parallelized for better performance.

• There is a low error rate and thus a high degree of accuracy once the appropriate training has
been performed

. • NNs are more robust than DTs in noisy environments.

NNs disadvantages:

• NNs are difficult to understand. Nontechnical users may have difficulty understanding how
NNs work. While it is easy to explain decision trees, NNs are much more difficult to understand.

• Generating rules from NNs is not straightforward.

• Input attribute values must be numeric.

• Testing

• Verification

• As with DTs, overfitting may result.

• The learning phase may fail to converge.

• NNs may be quite expensive to use.

Propagation
The normal approach used for processing is called propagation.

Given a tuple of values input to the NN, X= (X1, . . . , Xh), one value is input at each node in the
input layer. Then the summation and activation functions are applied at each node, with an
output value created for each output arc from that node. These values are in turn sent to the
subsequent nodes. This process continues until a tuple of output values, Y = (y1, . . . , Ym ), is
produced from the nodes in the output layer.

The process of propagation is shown in Algorithm

Example

Figure shows a very simple NN used to classify university students as short, medium, or tall
Activation function h is associated with the short class, !4 is associated with the medium class,
and fs is associated with the tall class. In this case, the weights of each arc from the height node
is 1. The weights on the gender arcs is 0. This implies that in this case the gender values are
ignored.

NN Supervised Learning
The NN starting state is modified based on feedback of its performance with the data in the
training set. This type of learning is referred to as supervised because it is known a priori what
the desired output should be. Unsupervised learning can also be performed if the output is not
known.

Supervised learning in an NN is the process of adjusting the arc weights based on its
performance with a tuple from the training set. The behavior of the training data is known a
priori and thus can be used to fine-tune the network for better behavior in future similar
situations

Algorithm.

The output from node i is yi but should be di, the error produced from a node in any layer can be
found by

|yi – di|

The mean squared error (MSE) is found by

The total MSE error over all m output nodes in the NN is

This formula could be expanded over all tuples in the training set to see the total error over all of
them.

The Hebb and delta rules are approaches to change the weight on an input arc to a node based on
the knowledge that the output value from that node is incorrect. With both techniques, a learning
rule is used to modify the input weights.

The change in weights using the Hebb rule is represented by the following rule

Here c is a constant often called the learning rate.

A rule of thumb is that c = 1 / |# entries in training set|

Backpropagation is a learning technique that adjusts weights in the NN by prop agating weight
changes backwa.td from the sink to the source nodes. Backpropagation is the most well known
form of learning because it is easy to understand and generally applicable.

Figure shows the structure and use of one node, j, in a neural network graph.

The basic node structure is shown in part (a). Here the representative input arc has a weight of
W?j. where ? is used to show that the input to node j is corning from another node shown here
as ?. Of course, there probably are multiple input arcs to a node. The output weight is similarly
labeled w J?·

During propagation, data values input at the input layer flow through the network, with final
values corning out of the network at the output layer. The propagation technique is shown in part
(b). The activation function fj is applied to all the input values and weights, with output values
resulting.

Weights are changed based on the changes that were made in weights in subsequent arcs. This
backward learning process is called backpropagation and is illustrated in Figure ( c). Weight wj? is
modified to become w j? + ∆w j?. A learning rule is applied to this ∆w j? to determine the change at the
next higher level ∆W? j.

ALGORITHM

The MSE is used to calculate the error. The last step of the algorithm uses gradient descent as
the technique to modify the weights in the graph. The basic idea of gradient descent is to find the
set of weights

Figure and Algorithm illustrate the concept.

The stated algorithm assumes only one hidden layer. More hidden layers would be handled in the
same manner with the error propagated backward.

Figure shows the structure we use to discuss the gradient descent algorithm.

Here node i is at the output layer and node j is at the hidden layer just before it; the output of i
and yi is y j is the output of j.

The learning function in the gradient descent technique is based on using the following value for
delta at the output layer:

Here the weight wij is that at one arc coming into i from j. Assuming a sigmoidal activation
function in the output layer
Radial Basis Function Networks
A radial function or a radial basis function (REF) is a class of functions whose value decreases
(or increases) with the distance from a central point.

An RBF has a Gaussian shape, and an RBF network is typically an NN with three layers.

1. The input layer is used to simply input the data.

2. A Gaussian activation function is used at the hidden layer,
3. while a linear activation function is used at the output layer.

The objective is to have the hidden nodes learn to respond only to a subset of the input,
namely, that where the Gaussian function is centered. This is usually accomplished via
supervised learning

Perceptrons

 The simplest NN is called a perceptron.

 A perceptron is a single neuron with multiple inputs and one output.
 The original perceptron proposed the use of a step activation function, but it is more
common to see another type of function such as a sigmoidal function.
 A simple perceptron can be used to classify into two classes.
o Using a unipolar activation function, an output of 1 would be used to classify into
one class,
o while an output of 0 would be used to pass in the other class

Here x1 is shown on the horizontal axis and x2 is shown on the vertical axis. The area of the
plane to the right of the line x2 = 3 _ 3 /2x1 represents one class and the rest of the plane
represent the other class.
RULE-BASED ALGORITHMS
To perform classification is to generate if-then rules that cover all cases.

Example

If 90 <= grade, then class= A

I f 80 <= grade and grade < 90, then class=B

Definition

A classification rule, r = (a, c), consists of the if or antecedent, a, part and the then or
consequent portion, c. The antecedent contains a predicate that can be evaluated as true or false
against each tuple in the database

Thedifferences between rules and trees:

• The tree has an implied order in which the splitting is performed. Rules have no order·

• A tree is created based on looking at all classes. When generating rules, only one class must be
examined at a time.
Generating Rules from a DT

The process to generate a rule from a DT is straightforward and is outlined in Algorithm

ALGORlTHM

Input: T //Decision tree

Output: R //Rules
Gen algorithm:
//Illustrate simple approach to generating classification rules from a DT
R=0
for each path from root to a leaf in T do
a= True
for each non-leaf node do
a= a/\ (label of node combined with label of incident outgoing arc)
c = label of leaf node
R = R U r = (a, c)
This algorithm will generate a rule for each leaf node in the decision tree. All rules with the same
consequent could be combined together by the antecedents of the simpler rules.

Generating Rules from a Neural Net

The source NN may still be used for classification, the derived rules can be used to verify or
interpret the network. The problem is that the rules do not explicitly exist. They are buried in the
structure of the graph itself.

The basic idea of the RX algorithm is to cluster output values with the associated hidden nodes
and input.
A major problem with rule extraction is the potential size that these rules should be. For
example, if you have a node with n inputs each having 5 values, there are 5 n different input
combinations to this one node alone.

To overcome this problem and that of having continuous ranges of output values from nodes, the
output values for both the hidden and output layers are first discretized

Generating Rules without a DT or NN

These techniques are sometimes called covering algorithms because they attempt to generate
rules exactly cover a specific class

Tree algorithms work in a top down divide and conquer approach, but this need not be the case
for covering algorithms. They generate the best rule possible by optimizing the desired
classification probability.

To generate a rule to classify persons as tall. The basic format for the rule is then

If ? then class = tall

The objective for the covering algorithms is to replace the "?" in this statement with predicates
that can be used to obtain the "best" probability of being tall.

The basic idea is to choose the best attribute to perform the classification based on the training
data. "Best" is defined here by counting the number of errors.

As with ID3, 1R tends to choose attributes with a large number of values leading to overfitting.
To generating rules without first having a DT is called PRISM. PRISM generates rules for each
class by looking at the training data and adding rules that completely describe all tuples in that
class.

COMBINING TECHNIQUES
Given a classification problem, no one classification technique always yields the best results.
Therefore, there have been some proposals that look at combining techniques.
Two basic techniques can be used to accomplish this:
• A synthesis of approaches takes multiple techniques and blends them into a new approach.
Example: linear regression, to predict a future value for an attribute that is then used as input to a
classification NN. In this way the NN is used to predict a future classification value.
• Multiple independent approaches can be applied to a classification problem, each yielding its
own class prediction. This approach has been referred to as combination of multiple classifiers
( CMC).

The values are combined with a weighted linear combination

Example

Two classifiers exist to classify tuples into two classes. A target tuple, X, needs to be classified.
Using a nearest neighbor approach, the 10 tuples closest to X are identified.

Figure shows the 10 tuples closest to X

Here the weights, Wk. can be assigned by a user or learned based on the past accuracy of each
classifier. Another technique is to choose the classifier that has the best accuracy in a database
sample. This is referred to as a dynamic classifier selection (DCS).

Unit 2 - ML
No ratings yet
Unit 2 - ML
60 pages
Neural Net 2002
No ratings yet
Neural Net 2002
12 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
8 pages
Week 5 - Ann
No ratings yet
Week 5 - Ann
30 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
75 pages
Ia Davma Unidad 2
No ratings yet
Ia Davma Unidad 2
113 pages
Classification BP Regression KNN Other Classifiers - Final
No ratings yet
Classification BP Regression KNN Other Classifiers - Final
116 pages
Linear Separability Linearly Separable Data Non-Linearly Separable Data
No ratings yet
Linear Separability Linearly Separable Data Non-Linearly Separable Data
1 page
Supervised Learning Neural Networks
No ratings yet
Supervised Learning Neural Networks
4 pages
3ML.05.NeuralNetworks DeepLearning
No ratings yet
3ML.05.NeuralNetworks DeepLearning
67 pages
Artificial Neural Networks An Artificial Neuron: X W X W S X W W y
No ratings yet
Artificial Neural Networks An Artificial Neuron: X W X W S X W W y
7 pages
Build Neural Network With MS Excel Sample
No ratings yet
Build Neural Network With MS Excel Sample
104 pages
Neural Networks for Beginners
No ratings yet
Neural Networks for Beginners
79 pages
Machine Learning With Artificial Neural Networks
No ratings yet
Machine Learning With Artificial Neural Networks
6 pages
Principle of Soft Computing-34-50-16-17
No ratings yet
Principle of Soft Computing-34-50-16-17
2 pages
Basic Neural Networks
No ratings yet
Basic Neural Networks
9 pages
Artifical Neural Network
No ratings yet
Artifical Neural Network
7 pages
Future Scope and Conclusion
No ratings yet
Future Scope and Conclusion
13 pages
A Review of Artificial Neural Network (ANN)
No ratings yet
A Review of Artificial Neural Network (ANN)
5 pages
Self-Taught Learning: Implementation Using MATLAB
100% (1)
Self-Taught Learning: Implementation Using MATLAB
42 pages
DWDM Unit4-2
No ratings yet
DWDM Unit4-2
4 pages
Supervised Learning Network Introduction: Unit 2
No ratings yet
Supervised Learning Network Introduction: Unit 2
52 pages
Deep Learning
No ratings yet
Deep Learning
21 pages
Neural Network and Deep Learning - Unit 1
No ratings yet
Neural Network and Deep Learning - Unit 1
20 pages
UNIT 3 - Backpropagation Algorithm
No ratings yet
UNIT 3 - Backpropagation Algorithm
38 pages
Supervised Learning: Adane Letta Mamuye (PHD)
No ratings yet
Supervised Learning: Adane Letta Mamuye (PHD)
41 pages
Intro Class
100% (1)
Intro Class
81 pages
AI Lab 1
No ratings yet
AI Lab 1
11 pages
Unit Iv DM
No ratings yet
Unit Iv DM
58 pages
Neural Networks
No ratings yet
Neural Networks
37 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
26 pages
Chap11 Neural Nets
No ratings yet
Chap11 Neural Nets
38 pages
TO Artificial Neural Networks
No ratings yet
TO Artificial Neural Networks
22 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
Prediction of Process Parameters For Optimal Material Removal Rate Using Artificial Neural Network (ANN) Technique
No ratings yet
Prediction of Process Parameters For Optimal Material Removal Rate Using Artificial Neural Network (ANN) Technique
7 pages
DWDM Unit 2
No ratings yet
DWDM Unit 2
23 pages
ANN Assignment
No ratings yet
ANN Assignment
10 pages
Chap 7 Neural Networks
No ratings yet
Chap 7 Neural Networks
42 pages
Neural Networks
No ratings yet
Neural Networks
27 pages
402B Deep Learning
100% (1)
402B Deep Learning
82 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
21 pages
Model of Neuron in An ANN
No ratings yet
Model of Neuron in An ANN
12 pages
TO Artificial Neural Networks
No ratings yet
TO Artificial Neural Networks
22 pages
TO Artificial Neural Networks
No ratings yet
TO Artificial Neural Networks
22 pages
Ann 4
No ratings yet
Ann 4
15 pages
ML Unit 2
No ratings yet
ML Unit 2
5 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
37 pages
Machine Learning Unit 5 Notes
No ratings yet
Machine Learning Unit 5 Notes
19 pages
36-Multi-Layer Perceptron and Its Properties-30-10-2024
No ratings yet
36-Multi-Layer Perceptron and Its Properties-30-10-2024
39 pages
Week 9 - Neural Networks
No ratings yet
Week 9 - Neural Networks
27 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
The Deep Neural Network-A Review
No ratings yet
The Deep Neural Network-A Review
5 pages
Artificial Intelligence - Chapter 7
No ratings yet
Artificial Intelligence - Chapter 7
18 pages
Backpropagation in Neural Networks
No ratings yet
Backpropagation in Neural Networks
13 pages
Image Processing 7
No ratings yet
Image Processing 7
193 pages
DL Unit - 1 Notes
No ratings yet
DL Unit - 1 Notes
45 pages
Neural Net 3rdclass
No ratings yet
Neural Net 3rdclass
35 pages
Neural Network
No ratings yet
Neural Network
7 pages
Normalization in DBMS
No ratings yet
Normalization in DBMS
22 pages
Unit IV Recommender System
No ratings yet
Unit IV Recommender System
5 pages
Fundamentals of Artificial Neural Networks
No ratings yet
Fundamentals of Artificial Neural Networks
27 pages
Unit-1 Data Mining Metrics
100% (1)
Unit-1 Data Mining Metrics
2 pages
Unit-III Advanced Machine Learning
No ratings yet
Unit-III Advanced Machine Learning
8 pages
Decision Tree - Unit3
No ratings yet
Decision Tree - Unit3
21 pages
lightllm源码导读模型
No ratings yet
lightllm源码导读模型
37 pages
Neural Networks & Deep Learning Seminar
No ratings yet
Neural Networks & Deep Learning Seminar
3 pages
L5 Neural Network
No ratings yet
L5 Neural Network
67 pages
Ch-3 Convolutional Neural Networks (CNNS)
No ratings yet
Ch-3 Convolutional Neural Networks (CNNS)
11 pages
Deep Learning - Question Bank
No ratings yet
Deep Learning - Question Bank
6 pages
UNIT-4 Foundations of Deep Learning
100% (1)
UNIT-4 Foundations of Deep Learning
43 pages
"Artificial Neural Networks": A Presentation On
No ratings yet
"Artificial Neural Networks": A Presentation On
13 pages
Deep Learning (Syllabus)
No ratings yet
Deep Learning (Syllabus)
1 page
Neural Network Lab Guide
No ratings yet
Neural Network Lab Guide
17 pages
Deep Learning With Keras and Tensorflow
No ratings yet
Deep Learning With Keras and Tensorflow
9 pages
DL - Unit - 1 - Foundations of Deep Learning
No ratings yet
DL - Unit - 1 - Foundations of Deep Learning
35 pages
ELET442 - Artificial Neural Networks (ANNs)
No ratings yet
ELET442 - Artificial Neural Networks (ANNs)
56 pages
MCQ
No ratings yet
MCQ
8 pages
Basic Introduction To Convolutional Neural Network in Deep Learning
No ratings yet
Basic Introduction To Convolutional Neural Network in Deep Learning
9 pages
Student Notes - Convolutional Neural Networks (CNN) Introduction - Belajar Pembelajaran Mesin Indonesia
No ratings yet
Student Notes - Convolutional Neural Networks (CNN) Introduction - Belajar Pembelajaran Mesin Indonesia
14 pages
Deep Learning R18 Jntuh Lab Manual
0% (1)
Deep Learning R18 Jntuh Lab Manual
21 pages
Unit-1 Control Statement
No ratings yet
Unit-1 Control Statement
15 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
11 pages
Multi-Layer Perceptron Guide
No ratings yet
Multi-Layer Perceptron Guide
16 pages
Unit 4
No ratings yet
Unit 4
18 pages
CSC 323-06 Artificial Neural Network
No ratings yet
CSC 323-06 Artificial Neural Network
29 pages
DR - Amin.ML Ch07 DeepLearning 1
No ratings yet
DR - Amin.ML Ch07 DeepLearning 1
12 pages
Introduction To Recurrent Neural Network
No ratings yet
Introduction To Recurrent Neural Network
9 pages
Inter-IIT Proposal
No ratings yet
Inter-IIT Proposal
3 pages
Unit II - Diagnotis and Multiple Linear
No ratings yet
Unit II - Diagnotis and Multiple Linear
8 pages
Pandas & NumPy Data Analysis Guide
No ratings yet
Pandas & NumPy Data Analysis Guide
11 pages
Programs
No ratings yet
Programs
10 pages
Ds Plan of Work 2025-26
No ratings yet
Ds Plan of Work 2025-26
2 pages
Sequential Storage
No ratings yet
Sequential Storage
9 pages
Syllabus
No ratings yet
Syllabus
2 pages
Feature Extraction From Web Data Using Artificial Neural Networks (ANN)
No ratings yet
Feature Extraction From Web Data Using Artificial Neural Networks (ANN)
10 pages
Bab 7
No ratings yet
Bab 7
3 pages
Neural Networks 16 Mark Answers
No ratings yet
Neural Networks 16 Mark Answers
3 pages
Assignment 1 (
No ratings yet
Assignment 1 (
2 pages

Unit-IV NN and Rule Based Algorithm

Uploaded by

Unit-IV NN and Rule Based Algorithm

Uploaded by

Unit-IV

Network Based Algorithm

• Activation functions: Many different types of activation functions can be used.

Advantages to the use of NNs for classification:

• NNs are more robust than DTs because of the weights.

• The use of NNs can be parallelized for better performance.

. • NNs are more robust than DTs in noisy environments.

• Generating rules from NNs is not straightforward.

• Input attribute values must be numeric.

• As with DTs, overfitting may result.

• The learning phase may fail to converge.

• NNs may be quite expensive to use.

The process of propagation is shown in Algorithm

The mean squared error (MSE) is found by

The total MSE error over all m output nodes in the NN is

Here c is a constant often called the learning rate.

A rule of thumb is that c = 1 / |# entries in training set|

Figure and Algorithm illustrate the concept.

1. The input layer is used to simply input the data.

 The simplest NN is called a perceptron.

If 90 <= grade, then class= A

I f 80 <= grade and grade < 90, then class=B

Thedifferences between rules and trees:

The process to generate a rule from a DT is straightforward and is outlined in Algorithm

Input: T //Decision tree

Generating Rules from a Neural Net

Generating Rules without a DT or NN

If ? then class = tall

The values are combined with a weighted linear combination

Figure shows the 10 tuples closest to X

You might also like