Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
84 views18 pages

Chapter 1

Uploaded by

Quang Nhân Võ
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
84 views18 pages

Chapter 1

Uploaded by

Quang Nhân Võ
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
You are on page 1/ 18

Chapter 1

Overview on Machine
Learning
Assoc. Prof. Dr. Duong Tuan Anh
Faculty of Computer Science and Engineering,
HCMC Univ. of Technology
1/2023

1
Outline
 Pattern Recognition
 Machine learning
 Related fields of pattern recognition
 Classification

2
1. Pattern recognition
 Humans are good at recognizing objects (or patterns).
 We find it difficult to write a computer program to recognize
objects.
 Ex: By analyzing sample images of faces, a program should
be able to capture the pattern specific to a face and identify it
as a face. This is pattern recognition.
 There may be several classes and we have to classify a

particular face into a certain category (or class). This is


classification.
 In pattern recognition, the term pattern is used to include all
objects that we want to classify.
 A class is a collection of objects that are similar, but not
necessarily identical, and which is distinguishable from other
classes.

3
Figure 1.1 illustrates the difference between classification
where the classes are known beforehand and classification
where classes are created after inspecting the objects.

Figure 1.1 Classification when the classes are (a) known and b)
unknown beforehand.

4
Applications of pattern
recognitions
Interest in pattern recognition has grown due to emerging
applications. These include:
 Data mining
 Biometrics
 personal identification based on physical attributes of the face, iris,
fingerprints, etc.
 Machine vision
 automatic visual inspection in an assembly line
 Character recognition

automatic mail sorting by zip code, automatic check scanners at at ATMs.
 Document recognition
 recognize whether an email is spam or not, based on the message
header and content.
 Speech recognition
 helping handicapped patients to control machines.

5
 Computer-added diagnosis
 helping doctors make diagnostic decisions based on interpreting
medical data such as ultrasound images, electrocardiograms
(ECGs) or electroencephalograms (EEGs).
 Medical imaging
 classifying cells as malignant or benign based on magnetic
resonance imaging (MRI) scans, or classify different emotional
and cognitive states from the images of brain activity in functional
MRI
 Bioinformatics
 DNA sequence analysis to detect genes related to particular
diseases.
 Remote sensing
 land use and crop yield
 Astronomy
 classifying galaxies based on their shapes

6
2. Machine learning
 What learning is?
 Learning from experience
 Remembering, adapting and generalization
 Reasoning and logical deduction
 Machine learning is about making computers modify or adapt
their actions so that their actions get more accurate, where
accuracy is measured by how well the chosen actions reflect
the correct ones.
 Through learning, computers can recognize patterns correctly.
 So machine learning “helps” pattern recognition. Without
machine learning, computers can not achieve pattern
recognition.

7
Some paradigms of machine learning
 Supervised learning
 A training set of examples (patterns) with the correct responses
(class labels) are provided and based on this training set, the
algorithm generalizes to respond correctly to all possible
inputs. This is also called learning from examplars.
 Unsupervised learning
 Correct responses (class labels) are not provided, instead the
algorithm tries to identify similarities between the inputs so that
inputs that have something in common are categorized
together.
 The statistical approach to unsupervised learning is called

density estimation.
Note: pattern = example, instance, point, examplar

8
3. Related fields of pattern recognition

 The methods used for pattern recognition have been


developed in various fields, often independently.
 In statistics, going from particular observations to general
descriptions is called inference, learning (i.e., using training
data) is called estimating and classification is discriminant
analysis.
 In engineering, classification is called pattern recognition and
the approach is nonparametric and much more empirical.
 Other methods have their origins in machine learning, artificial
intelligence, artificial neural networks and data mining.
 We will incorporate techniques from these different fields to
give pattern recognition a more unified treatment.

9
Figure 1.2 Pattern recognition and
related fields

10
4. Classification
 Classification is often the final step in a general process. It
involves sorting objects into separate classes.
 In the case of an image, the acquired image is segmented
to isolate different objects from each other and from the
background, and the different objects are labeled.
 A typical pattern recognition system contains
 a sensor,
 a preprocessing mechanism (prior to segmentation),
 a feature extraction mechanism,
 a set of examples (training data) already classified and
 a classification algorithm.

11
Figure 1.3 A general classification system
The feature extraction step reduces the data by measuring certain
characteristic properties or features (such as size, shape, and texture)
of the labeled objects.
These features (i.e. values of these features) are then passed to a
classifier that evaluates the evidence presented and makes a decision
regarding to class each object should be assigned, depending on
whether a value of its feature fall inside or outside the tolerance of that
class.
This process is used, e.g., in classifying lesions as benign or malignant .
12
 The quality of the acquired image depends on the resolution,
sensitivity, bandwidth and signal-to-noise ratio of the imaging
system. Pre-processing step such as image enhancement and
image restoration may be required prior to segmentation, which is
often a challenging process.
 The quality of the features is related to their ability to discriminate
examples from different classes. Examples from the same class
should have similar feature values, while examples from different
classes should have different feature values, i.e. good features
should have small intra-class variations and large inter-class
variations (Figure 1.4).

Figure 1.4 A good


feature, x, measured for
two different classes
(blue and red) should
have intra-class
variations and large
inter-class variations

13
 The features can be continuous or categorical or non-metric (i.e.
qualitative). Categorical features can either be nominal (i.e.
unordered) or ordinal.
 Humans are adept at recognize objects within an image, using size,
shape, color, and other visual clues. They can do this despite the
fact that the objects may appear from different viewpoints and under
different lightning conditions, have different sizes, be rotated or
when the images are partially obstructed from view.

Fig. 1.5 Face recognition needs to be able to handle different expressions,


lighting and occlusions 14
 The goal of the classifier is to classify new data (test data) to
one of the classes, characterized by a decision region. The
borders between decision regions are called decision
boundaries.

Figure 1.6 Classes mapped as decision regions, with decision boundaries

15
Datasets for pattern
recognition
 There are a wide variety of datasets available on the
Internet.
 One popular site is the machine learning repository at
Univ. of California, Irvine:
http://archive.ics.uci.edu/ml/datasets/

 Large datasets used for data mining tasks are available at


kdd.ics.uci.edu
www.kdnuggets.com/datasets/

kdd: knowledge discovery and data mining

16
References
 G. Dougerty, 2013, Pattern Recognition and
Classification – An Introduction, Springer
 S. Marshland, 2015, Machine Learning – An
Algorithmic Approach, Chapman & Hall/CRC.
 M. N. Murty and V. S. Devi, 2011, Pattern
Recognition – An Algorithmic Approach, Springer.

17
Terminology
 Machine learning: học máy, pattern: mẫu, pattern recognition:
nhận dạng mẫu, feature: đặc trưng, feature extraction: rút trích
đặc trưng, classification: phân lớp, discriminant analysis: phân
tích phân biệt, training data: dữ liệu huấn luyện, test data: dữ liệu
thử, image segmentation: phân mảnh hình ảnh, decision
boundary: biên quyết định, decision region: vùng quyết định, pre-
processing: tiền xử lý, continuous feature: đặc trưng liên tục,
categorical feature: đặc trưng rời rạc, supervised learning: học có
giám sát, unsupervised learning: học không giám sát, intra-class
variation: độ sai biệt trong nội bộ lớp, inter-class variation: độ sai
biệt giữa các lớp, sensor: bộ cảm biến, artificial neural network:
mạng nơ ron nhân tạo, bioinformatics: sinh tin học, machine
vision: thị giác máy tính, biometrics: sinh trắc học, remote
sensing: viễn thám, density estimation: ước lượng mật độ,

18

You might also like