0% found this document useful (0 votes)

31 views35 pages

Artificial Intelligence: Outline

Uploaded by

Nhật Khoa Dương

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views35 pages

Artificial Intelligence: Outline

Uploaded by

Nhật Khoa Dương

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

9/28/2023

Artificial Intelligence

Dr. Tran Quang Huy

OUTLINE
2
Chapter 1: Overview of AI
Chapter 2: Artificial Neural Networks
Chapter 3: Searching, Knowledge, Reasoning, and Planning
Chapter 4: Machine learning

W1 W2 W3 W4 W5 W6 W7 W8 W9 W10
L L L I-T L L L L P P

L: Lesson; I-T: In-class Test; P: Project

2
9/28/2023

Objectives
3
1. Understand the basics of Neural Networks

2. Being able to move on the more advanced Convolutional Neural Networks

Main contents
4
1. Artificial Neural Networks (ANN) and their relation to biology
2. The seminal Perceptron algorithm
3. Back propagation
4. How to train Neural Networks using Keras library

4
9/28/2023

What are Neural Networks?

5
Question:
- How does your family dog recognize you, the owner, versus a complete and
total stranger?
- How does a small child learn to recognize the difference between a school
bus and a transit bus?
- How do our own brains subconsciously perform complex pattern recognition
tasks each and every day without us even noticing?

What are Neural Networks?

Answer: Each of us contains a real-life biological neural networks that6is
connected to our nervous systems – this network is made up of a large
number of interconnected neurons (nerve cells).
The word “neural” is the adjective form of “neuron”, and “network” denotes a
graph-like structure; therefore, an “Artificial Neural Network” is a computation
system that attempts to mimic (or at least, is inspired by) the neural
connections in our nervous system. Artificial neural networks are also referred
to as “neural networks” or “artificial neural systems”.
It is common to abbreviate Artificial Neural Network and refer to them as
“ANN” or simply “NN”

6
9/28/2023

ANN
7

ANN
8

8
9/28/2023

ANN
9

A simple neural network architecture.

Inputs are presented to the network.
Each connection carries a signal
through the two hidden layers in the
network. A final function computes
the output class label.

ANN
10

10
9/28/2023

ANN
11

Read the following and explain the meaning of each part in the figure and equations
12

12
9/28/2023

13
Activation Functions

What is activation function?

How does the activation function work?

Why do we use activation functions?

Listsome types of popular activation

functions?

What is activation function?

How does the activation function work?

14
9/28/2023

Why do we use activation functions? 15

1. Create non-linear characteristic for model

2. Keep the output in a specific range, such as [0, 1]; [-1, 1]

Popular Activation
Functions

Find the equation of

each activation
function.

16
9/28/2023

Activation Functions
Step function: 17

Sigmoid function:

ReLU function:

Activation Functions
Step function: 18

This is a very simple threshold function. If the weighted sum: we ou tput 1, otherwise,
we output 0.

The output of f is always zero when net is less than or equal zero. If net is greater than
zero, then f will return one.

What is the problems of step function?

18
9/28/2023

Activation Functions
19
Sigmoid function:

Sigmoid function is a more common activation function used in the history of NN.

Activation Functions
20
Sigmoid function:

Sigmoid function is a more common activation function used in the history of NN.

Why???
The primary advantage here is that the smoothness of the sigmoid function makes it easier to
devise learning algorithms.
The sigmoid function is a better choice for learning than the simple step function since it:
1. Is continuous and differentiable everywhere.
2. Is symmetric around the y-axis.
3. Asymptotically approaches its saturation values.

20
9/28/2023

Activation Functions
21
Sigmoid function:

Disadvantage of Sigmoid function:

1. The outputs of the sigmoid are not zero centered.

2. Saturated neurons essentially kill the gradient, since the delta of the gradient will be
extremely small.

Activation Functions
22
Tanh function:

The hyperbolic tangent, or tanh (with a similar shape of the sigmoid) was also heavily used as
an activation function up until the late 1990s.
The tanh function is zero centered, but the gradients are still killed when neurons become
saturated

22
9/28/2023

Activation Functions
23
ReLU function:

Rectified Linear Unit (ReLU) is also called “ramp functions” due to how they look
when plotted.

Activation Functions
24
ReLU function:

Note:

Notice how the function is zero for negative inputs but then linearly increases for
positive values. The ReLU function is not saturable and is also extremely
computationally efficient.

The ReLU activation function tends to outperform both the sigmoid and tanh
functions in nearly all applications.

24
9/28/2023

Activation Functions
25
ReLU function:

As of 2015, ReLU is the most popular activation function used in deep learning.
However, a problem arises when we have a value of zero – the gradient cannot be
taken.

Activation Functions
26
ReLU6 function:

This function limits the problem of exploding gradients

26
9/28/2023

Activation Functions
27
Leaky ReLU function:

Leaky ReLUs allow for a small, non-zero gradient when the unit is not active

Activation Functions
28
Leaky ReLU function:

The function is indeed allowed to take on

a negative value, unlike traditional ReLUs
which “clamp" the function output at zero.
Parametric ReLUs build on Leaky ReLUs
and allow the parameter α to be learned on
an activation-by-activation basis, implying
that each node in the network can learn a
different “coefficient of leakage” separate
from the other nodes.

28
9/28/2023

30
Feedforward Network Architectures

This figure is a 3-2-3-2 feedforward network

Layer 0 contains 3 inputs, our xi values. These could be raw
pixel intensities of an image or a feature vector extracted
from the image.
Layers 1 and 2 are hidden layers containing 2 and 3 nodes,
respectively.
Layer 3 is the output layer or the visible layer – there is where
we obtain the overall output classification from our network.
The output layer typically has as many nodes as class labels;
one node for each potential output.
For example, if we were to build an NN to classify
handwritten digits, our output layer would consist of 10 nodes,
one for each digit 0-9.

30
9/28/2023

PERCEPTRON ALGORITHM
31
Perceptron was introduced by Frank Rosenblatt in 1957. He proposed a Perceptron
learning rule based on the original MCP neuron. A Perceptron is an algorithm for
supervised learning of binary classifiers. This algorithm enables neurons to learn
and processes elements in the training set one at a time.

https://www.javatpoint.com/perceptron-in-machine-learning

32
TYPES OF PERCEPTRON

1.Single layer (a): Single layer

perceptron can learn only linearly
separable patterns.

2. Multilayer (b): Multilayer

perceptrons can learn about two
or more layers having a greater
processing power.

https://www.javatpoint.com/perceptron-in-machine-learning

32
9/28/2023

33
TYPES OF PERCEPTRON

A single-layered perceptron model consists feed-forward network and also includes a

threshold transfer function inside the model. The main objective of the single-layer
perceptron model is to analyze the linearly separable objects with binary outcomes.

A single layer perceptron model do not contain recorded data, so it begins with
inconstantly allocated input for weight parameters. Further, it sums up all inputs
(weight). After adding all inputs, if the total sum of all inputs is more than a pre-
determined value, the model gets activated and shows the output value as +1.
If the outcome is same as pre-determined or threshold value, then the performance
of this model is stated as satisfied, and weight demand does not change. However,
this model consists of a few discrepancies triggered when multiple weight inputs
values are fed into the model. Hence, to find desired output and minimize errors,
some changes should be necessary for the weights input.
33

34
TYPES OF PERCEPTRON

The multi-layer perceptron model is also known as the

Backpropagation algorithm, which executes in two
stages as follows:
•Forward Stage: Activation functions start from the
input layer in the forward stage and terminate on the
output layer.
•Backward Stage: In the backward stage, weight and
bias values are modified as per the model's
requirement. In this stage, the error between actual
output and demanded originated backward on the
output layer and ended on the input layer.

34
9/28/2023

35
TYPES OF PERCEPTRON

Advantages of Multi-Layer Perceptron:

•A multi-layered perceptron model can be used to solve complex non-
linear problems.
•It works well with both small and large input data.
•It helps us to obtain quick predictions after the training.
•It helps to obtain the same accuracy ratio with large as well as small
data.
Disadvantages of Multi-Layer Perceptron:
• Computations are difficult and time-consuming.
•It is difficult to predict how much the dependent variable affects each
independent variable.
•The model functioning depends on the quality of the training.

Basic Components of Perceptron

Mr. Frank Rosenblatt invented the perceptron model as a binary classifier which contains
three main components:
• Input Nodes or Input Layer
• Weight and Bias
• Activation Function https://www.javatpoint.com/perceptron-in-machine-learning

36
9/28/2023

Basic Components of Perceptron

37
Types of Activation functions:

https://www.javatpoint.com/perceptron-in-machine-learning

How does Perceptron work?

38
In Machine Learning, Perceptron is
considered as a single-layer neural network
that consists of four main parameters named
input values (Input nodes), weights and Bias,
net sum, and an activation function.
The perceptron model begins with the
multiplication of all input values and their
weights, then adds these values together to
create the weighted sum. Then this weighted
sum is applied to the activation function 'f' to Write the final equation
obtain the desired output. This activation based on these info?
function is also known as the step
function and is represented by 'f'. https://www.javatpoint.com/perceptron-in-machine-learning

38
9/28/2023

How does Perceptron work?

For example, x1 = 2, x2 = 3, x3 = 1, wn are certain numbers in the range

[0,1], step function is used. Estimate the ouput.

40
9/28/2023

Problem 1: The input to a single-input neuron is 2.0, its weight is 2.3 and its bias is -3.
i. What is the net input to the transfer function?
ii. What is the neuron output?

42
9/28/2023

Problem 2: The input to a single-input neuron is 2.0, its weight is 2.3 and its bias is -3.
What is the output of the neuron if it has the following transfer functions?
i. Hard limit
ii. Linear
iii. Log-sigmoid

44
Problem 3:

Given a two-input neuron with the following parameters: b = 1.2, W = [3 2],

and p = [-5 6]T, calculate the neuron output for the following transfer

functions:

i. A symmetrical hard limit transfer function

ii. A saturating linear transfer function

iii. A hyperbolic tangent sigmoid (tansig) transfer function

44
9/28/2023

An illustrative example
There is a conveyer belt on which the fruit is 46
loaded. This conveyer passes through a set of
sensors, which measure three properties of the
fruit: shape, texture and weight.

Value 1 -1
Shape round elliptical
Texture smooth rough
Weight >1 pound <=1 pound

The three sensor outputs will then be input to a neural network. The purpose of the
network is to decide which kind of fruit is on the conveyor. Let’s assume that there
are only two kinds of fruit on the conveyor: apples and oranges.

46
9/28/2023

An illustrative example
47
Apply the following perceptron model to the previous problem in the case of
two inputs

An illustrative example
48

If w1,1 = -1, w1,2 = 1, find a?

48
9/28/2023

An illustrative example
49
Therefore, if the inner product of the weight matrix (a single row vector in this case) with
the input vector is greater than or equal to -b, the output will be 1. If the inner product of the
weight vector and the input is less than -b, the output will be -1.
This divides the input space into two parts. The figure illustrates this for the case where b = -
1. The blue line in the figure represents all points for which the net input is equal to 0:
n = [-1 1]p – 1 = 0

An illustrative example
50
The decision boundary between the categories is determined by the equation
Wp + b = 0

Because the boundary must be linear, the single-layer perceptron can only be used to
recognize patterns that are linearly separable

50
9/28/2023

An illustrative example
51
Apply the following perceptron model to the previous problem in the case of
three inputs
Find a

An illustrative example
52
We want to choose the bias and the elements of the weight matrix so that
the perceptron will be able to distinguish between apples and oranges.
For example, we may want the output of the perceptron to be 1 when an
apple is input and -1 when an orange is input.

52
9/28/2023

AND, OR, and XOR Datasets

Perceptron Training Procedure and the Delta Rule (step 2c)

54
9/28/2023

Implementing the Perceptron in Python

56
9/28/2023

Implementing the Perceptron in Python

58
9/28/2023

Implementing the Perceptron in Python

60
9/28/2023

Implementing the Perceptron in Python

62
9/28/2023

Implementing the Perceptron in Python

Evaluating the Perceptron Bitwise Datasets

64
9/28/2023

Evaluating the Perceptron Bitwise Datasets

66
9/28/2023

Evaluating the Perceptron Bitwise Datasets

68
9/28/2023

Evaluating the Perceptron Bitwise Datasets

DL Unit-1 San
No ratings yet
DL Unit-1 San
58 pages
OE Unit 5
No ratings yet
OE Unit 5
80 pages
Tensorflow Tutorial PDF
100% (6)
Tensorflow Tutorial PDF
90 pages
Module 1 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
100% (1)
Module 1 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
18 pages
COMP3411 Week 3 - NN
No ratings yet
COMP3411 Week 3 - NN
70 pages
Soft Computing Question Bank
No ratings yet
Soft Computing Question Bank
2 pages
Computer Vision Notes
No ratings yet
Computer Vision Notes
30 pages
CS 522 Selected Topics in CS: Lecture 07 - Artificial Neural Network
No ratings yet
CS 522 Selected Topics in CS: Lecture 07 - Artificial Neural Network
52 pages
Liquid Neural Networks
No ratings yet
Liquid Neural Networks
282 pages
Deep Learning
No ratings yet
Deep Learning
37 pages
DL Notes
No ratings yet
DL Notes
21 pages
TB04 - Soft Computing Ebook PDF
100% (4)
TB04 - Soft Computing Ebook PDF
356 pages
Unit 5
No ratings yet
Unit 5
102 pages
Aimlf Unit4
No ratings yet
Aimlf Unit4
20 pages
4 NN
No ratings yet
4 NN
25 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
82 pages
2.deep Feed Forward Networks
No ratings yet
2.deep Feed Forward Networks
26 pages
Deep Learning: On Artificial Neural Networks (Anns)
No ratings yet
Deep Learning: On Artificial Neural Networks (Anns)
16 pages
Deep Learning
No ratings yet
Deep Learning
180 pages
Unit V
No ratings yet
Unit V
49 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
86 pages
ML MU Unit 5NeuralNetworkpdf 2025 04 16 13 47 39
No ratings yet
ML MU Unit 5NeuralNetworkpdf 2025 04 16 13 47 39
57 pages
Neural Deep Learning
No ratings yet
Neural Deep Learning
221 pages
ANNs
No ratings yet
ANNs
57 pages
Unit 5
No ratings yet
Unit 5
59 pages
Lecture 36 40
No ratings yet
Lecture 36 40
25 pages
Neural Networks
No ratings yet
Neural Networks
28 pages
Unit I
No ratings yet
Unit I
90 pages
Unit 3 Endsem PYQs
No ratings yet
Unit 3 Endsem PYQs
19 pages
NNML Full
No ratings yet
NNML Full
19 pages
Neural NetworksChapter2Sup
No ratings yet
Neural NetworksChapter2Sup
20 pages
Basics
No ratings yet
Basics
48 pages
Lesson 2 Neural Network Architectures
No ratings yet
Lesson 2 Neural Network Architectures
35 pages
Lesson 7.0 Supervised Learning With Neural Networks
No ratings yet
Lesson 7.0 Supervised Learning With Neural Networks
22 pages
Soft Computing CT QP
No ratings yet
Soft Computing CT QP
2 pages
UNIT-II Chapter-2
No ratings yet
UNIT-II Chapter-2
20 pages
Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
73 pages
Neural Networks for Tech Enthusiasts
No ratings yet
Neural Networks for Tech Enthusiasts
85 pages
AI & ML Unit 5 Notes
No ratings yet
AI & ML Unit 5 Notes
23 pages
AI Unit5 Neural Network 1c2c9166 c1b7 47a3 8ce1 E914f1ab6afb
No ratings yet
AI Unit5 Neural Network 1c2c9166 c1b7 47a3 8ce1 E914f1ab6afb
52 pages
L10 Neural Network
No ratings yet
L10 Neural Network
52 pages
Deep Learning Unit1
No ratings yet
Deep Learning Unit1
25 pages
WINSEM2023-24 BITE410L TH VL2023240503970 2024-03-11 Reference-Material-I
No ratings yet
WINSEM2023-24 BITE410L TH VL2023240503970 2024-03-11 Reference-Material-I
40 pages
Advanced Supervised Learning
No ratings yet
Advanced Supervised Learning
17 pages
Notes DL-1
No ratings yet
Notes DL-1
10 pages
05 ANN Artificial Neural Networks
No ratings yet
05 ANN Artificial Neural Networks
221 pages
Neural Network
No ratings yet
Neural Network
7 pages
Types of Neural Networks and Definition of Neural Network
No ratings yet
Types of Neural Networks and Definition of Neural Network
15 pages
Unit 3 - Ann
No ratings yet
Unit 3 - Ann
49 pages
NNDL
No ratings yet
NNDL
96 pages
Deep Learning Essentials
No ratings yet
Deep Learning Essentials
27 pages
Neural Networks
No ratings yet
Neural Networks
61 pages
Unit 1
No ratings yet
Unit 1
16 pages
Machine Learning
No ratings yet
Machine Learning
13 pages
Unit 1 Fundamentals of Deep Learning
No ratings yet
Unit 1 Fundamentals of Deep Learning
20 pages
Unit 3
No ratings yet
Unit 3
8 pages
Digital Library
No ratings yet
Digital Library
24 pages
Deep Learning
No ratings yet
Deep Learning
19 pages
05 ANN Artificial Neural Networks
No ratings yet
05 ANN Artificial Neural Networks
216 pages
House Dzone Refcard 383 Neural Network Essentials
No ratings yet
House Dzone Refcard 383 Neural Network Essentials
5 pages
Neural Network: BY, Deekshitha J P Rakshitha Shankar
No ratings yet
Neural Network: BY, Deekshitha J P Rakshitha Shankar
27 pages
4.0 The Complete Guide To Artificial Neural Networks
No ratings yet
4.0 The Complete Guide To Artificial Neural Networks
23 pages
AI & Machine Learning Quiz
No ratings yet
AI & Machine Learning Quiz
13 pages
Multilayer Perceptron Neural Networks
No ratings yet
Multilayer Perceptron Neural Networks
51 pages
Soft Computing Unit-2
No ratings yet
Soft Computing Unit-2
45 pages
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
No ratings yet
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
45 pages
AML 03 Dense Neural Networks
No ratings yet
AML 03 Dense Neural Networks
20 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
Module I
No ratings yet
Module I
109 pages
Mining Massive Datasets Preface
No ratings yet
Mining Massive Datasets Preface
17 pages
Cox Book Review of The Alignment Problem
No ratings yet
Cox Book Review of The Alignment Problem
6 pages
Unit I - Afs
No ratings yet
Unit I - Afs
18 pages
PAC Learning Explained
No ratings yet
PAC Learning Explained
15 pages
Neural Network
No ratings yet
Neural Network
20 pages
Deep Learning Methods in Mining Ver Ver Ver
No ratings yet
Deep Learning Methods in Mining Ver Ver Ver
16 pages
Deep Learning
No ratings yet
Deep Learning
26 pages
R 2032422
No ratings yet
R 2032422
11 pages
Gradient-Based Learning & Neural Networks
No ratings yet
Gradient-Based Learning & Neural Networks
72 pages
Deep Learning & Vision for AI Students
No ratings yet
Deep Learning & Vision for AI Students
36 pages
CSM 422
No ratings yet
CSM 422
2 pages
Forecasting Cryptocurrency Returns From Sentiment Signals: An Analysis of BERT Classifiers and Weak Supervision
No ratings yet
Forecasting Cryptocurrency Returns From Sentiment Signals: An Analysis of BERT Classifiers and Weak Supervision
29 pages
Machine Learning
No ratings yet
Machine Learning
40 pages
Quantum ML for URL Fraud Detection
No ratings yet
Quantum ML for URL Fraud Detection
18 pages
Unit 1 NNDL
No ratings yet
Unit 1 NNDL
8 pages
Machine Learning (Csen 3233)
No ratings yet
Machine Learning (Csen 3233)
4 pages
Multi Layer Perceptron - Neural Network
No ratings yet
Multi Layer Perceptron - Neural Network
3 pages
Neural Networks
No ratings yet
Neural Networks
32 pages
Final Report PDF
No ratings yet
Final Report PDF
33 pages

Artificial Intelligence: Outline

Uploaded by

Artificial Intelligence: Outline

Uploaded by

9/28/2023

Dr. Tran Quang Huy

L: Lesson; I-T: In-class Test; P: Project

2. Being able to move on the more advanced Convolutional Neural Networks

What are Neural Networks?

What are Neural Networks?

A simple neural network architecture.

What is activation function?

How does the activation function work?

Why do we use activation functions?

Listsome types of popular activation

What is activation function?

How does the activation function work?

Why do we use activation functions? 15

1. Create non-linear characteristic for model

Find the equation of

What is the problems of step function?

Disadvantage of Sigmoid function:

1. The outputs of the sigmoid are not zero centered.

This function limits the problem of exploding gradients

The function is indeed allowed to take on

Feedforward Network Architectures

This figure is a 3-2-3-2 feedforward network

1.Single layer (a): Single layer

2. Multilayer (b): Multilayer

A single-layered perceptron model consists feed-forward network and also includes a

The multi-layer perceptron model is also known as the

Advantages of Multi-Layer Perceptron:

Basic Components of Perceptron

Basic Components of Perceptron

How does Perceptron work?

How does Perceptron work?

For example, x1 = 2, x2 = 3, x3 = 1, wn are certain numbers in the range

Given a two-input neuron with the following parameters: b = 1.2, W = [3 2],

i. A symmetrical hard limit transfer function

ii. A saturating linear transfer function

iii. A hyperbolic tangent sigmoid (tansig) transfer function

If w1,1 = -1, w1,2 = 1, find a?

AND, OR, and XOR Datasets

Perceptron Training Procedure and the Delta Rule (step 2c)

Implementing the Perceptron in Python

Implementing the Perceptron in Python

Implementing the Perceptron in Python

Implementing the Perceptron in Python

Implementing the Perceptron in Python

Implementing the Perceptron in Python

Implementing the Perceptron in Python

Implementing the Perceptron in Python

Implementing the Perceptron in Python

Evaluating the Perceptron Bitwise Datasets

Evaluating the Perceptron Bitwise Datasets

Evaluating the Perceptron Bitwise Datasets

Evaluating the Perceptron Bitwise Datasets

Evaluating the Perceptron Bitwise Datasets

Evaluating the Perceptron Bitwise Datasets

You might also like