Lec 01 - Intro MachineLearning

The document provides an overview of machine learning, defining it as a process where systems improve their performance through experience. It discusses the motivations for machine learning, including data availability and computational power, and outlines various applications such as spam filtering and data mining. Additionally, it covers key concepts like supervised and unsupervised learning, the importance of model evaluation, and the challenges of overfitting and dimensionality reduction.

Uploaded by

naqvihassan200

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views59 pages

Lec 01 - Intro MachineLearning

Uploaded by

naqvihassan200

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 59

Machine Learning

• Herbert Alexander Simon:

“Learning is any process by
which a system improves
performance from experience.”
• “Machine Learning is concerned
with computer programs that
automatically improve their
performance through
Herbert Simon
experience. “ Turing Award 1975
Nobel Prize in Economics 1978
Why Machine Learning?
• Develop systems that can automatically adapt and customize
themselves to individual users.
– Personalized news or mail filter
• Discover new knowledge from large databases (data mining).
– Market basket analysis (e.g. diapers and beer)
• Ability to mimic human and replace certain monotonous tasks -
which require some intelligence.
• like recognizing handwritten characters
• Develop systems that are too difficult/expensive to construct
manually because they require specific detailed skills or
knowledge tuned to a specific task (knowledge engineering
bottleneck).

4
Why now?
• Flood of available data (especially with the
advent of the Internet)
• Increasing computational power
• Growing progress in available algorithms and
theory developed by researchers
• Increasing support from industries

5
ML Applications
The concept of learning in a ML system
• Learning = Improving with experience at some
task
– Improve over task T,
– With respect to performance measure, P
– Based on experience, E.

7
Motivating Example
Learning to Filter Spam
Example: Spam Filtering
Spam - is all email the user does not
want to receive and has not asked to
receive
T: Identify Spam Emails
P:
% of spam emails that were filtered
% of ham/ (non-spam) emails that
were incorrectly filtered-out
E: a database of emails that were
labelled by users
The Learning Process

Model Learning Model

Testing
The Learning Process in our Example

Model Learning Model

Testing

● Number of recipients
● Size of message
● Number of attachments
● Number of "re's" in the
subject line
Email Server …
Data Set
Target
Input Attributes
Attribute

Number of Email Country Customer Email Type

new Length (K) (IP) Type
Recipients
0 2 Germany Gold Ham
1 4 Germany Silver Ham
5 2 Nigeria Bronze Spam
Instances

2 4 Russia Bronze Spam

3 4 Germany Bronze Ham
0 1 USA Silver Ham
4 2 USA Silver Spam

Numeric Nominal Ordinal

Step 4: Model Learning

Learner Classifier
Database
Inducer Classification Model
Training Set
Induction Algorithm
Step 5: Model Testing

Database Learner
Training Set Inducer
Classifier
Induction Algorithm
Classification Model
Learning Algorithms
Error
Linear Classifiers

Email Length