0% found this document useful (0 votes)

14 views62 pages

1 ML Overview

Uploaded by

luticia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views62 pages

1 ML Overview

Uploaded by

luticia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 62

Statistical Learning

Machine Learning Overview

Outline
• What is machine learning?
• Supervised Learning
• Classi cation
• Regression
• Unsupervised Learning
• Clustering
• Reinforcement Learning
fi
Part I: What is machine learning?
What is machine learning?
• Arthur Samuel (1959): Machine learning is the eld of study that gives the
computer the ability to learn without being explicitly programmed.

fi
https://tung-dn.github.io/programming.html
What is machine learning?
• Arthur Samuel (1959): Machine learning is the eld of study that gives the
computer the ability to learn without being explicitly programmed.

• Tom Mitchell (1997): A computer program is said to learn from experience

E with respect to some class of tasks T and performance measure P, if
its performance at tasks in T as measured by P, improves with experience
E.

fi
Supervised
Learning

Taxonomy of ML
Unsupervised
Reinforcement
Learning
Learning
Part II: Supervised Learning
Example 1: Predict whether a user likes a song or not

model
Example 1: Predict whether a user likes a song or not

Intensity

User Sharon

Tempo
Example 1: Predict whether a user likes a song or not

Intensity

User Sharon

DisLike
Like

Relaxed Tempo Fast

Example 1: Predict whether a user likes a song or not

Intensity

User Sharon

DisLike
Like

Relaxed Tempo Fast

Example 1: Predict whether a user likes a song or not

Intensity New data

?
User Sharon

DisLike
Like

Relaxed Tempo Fast

Example 1: Predict whether a user likes a song or not

Intensity New data

User Sharon

DisLike
Like

Relaxed Tempo Fast

Example 2: Classify Images http://www.image-net.org/
Example 2: Classify Images

Experience/Data:
images with labels

indoor outdoor
Example 2: Classify Images
Label: outdoor

Label: indoor

Training data Test data

learning (i.e.,training) testing

performance
Label: outdoor

Label: indoor

Training data Test data

learning (i.e.,training) testing

performance
How to represent data?
input data
d
x∈ℝ Intensity

d: feature dimension
x
x1 Tempo
x=
x2 Intensity

There can be many features!

Relaxed Tempo Fast
How to represent data?

Label Intensity
y ∈ {0,1}
y=1

Where “supervision”
comes from y=0
Relaxed Tempo Fast
Represent various types of data
• Image
- Pixel values

• Bank account
- Credit rating, balance, # deposits in last day, week,
month, year, #withdrawals
Two Types of Supervised Learning Algorithms

Classification Regression
Example of regression: housing price prediction
Given: a dataset that contains samples
(x1, y2), (x2, y2), . . . , (xn, yn) Price

Task: if a residence has x squares

feet, predict the price?

Square feet
𝑛
Example of regression: housing price prediction
Given: a dataset that contains samples
(x1, y2), (x2, y2), (x3, y3), . . . , (xn, yn)

Task: if a residence has x squares

feet, predict the price?
y∈ℝ

Square feet
𝑛
Example of regression: housing price prediction

Input with more features (e.g., lot size)

x
(credit: stanford CS229)
Supervised Learning: More examples
x = raw pixels of the image y = bounding boxes

Russakovsky et al. 2015

Two Types of Supervised Learning Algorithms

Classification Regression

• the label is a discrete variable • the label is a continuous variable

y ∈ {1,2,3,...,K} y∈ℝ
Training Data for Supervised Learning

Training data is a collection of input instances to the

learning algorithm:

(x1, y2), (x2, y2), (x3, y3), . . . , (xn, yn)

input label

A training data is the “experience” given to a learning algorithm

Goal of Supervised Learning

Given training data

(x1, y2), (x2, y2), (x3, y3), . . . , (xn, yn)

Learn a function mapping f : X → Y, such that f(x) predicts

the label y on future data x (not in training data)
Goal of Supervised Learning

Training set error

n
1
∑
0-1 loss for classification ℓ = ( f(xi) ≠ yi)
• n i=1
n
1 2
∑
Squared loss for regression: ℓ = ( f(xi) − yi)
• n i=1
A learning algorithm optimizes the training objective

f* = arg min (x,y)ℓ( f(x), y) Details in upcoming

lectures :)
𝔼
Quiz Break
Q1-1: Which is true about feature vectors?

A. Feature vectors can have at most 10 dimensions

B. Feature vectors have only numeric values
C. The raw image can also be used as the feature vector
D. Text data don’t have feature vectors
Quiz Break
Q1-1: Which is true about feature vectors?

A. Feature vectors can have at most 10 dimensions

B. Feature vectors have only numeric values
C. The raw image can also be used as the feature vector
D. Text data don’t have feature vectors

A. Feature vectors can be in high dimen.

B. Some feature vectors can have other types of values like strings
D. Bag-of-words is a type of feature vector for text
Quiz Break
Q1-2: Which of the following is not a common task of supervised learning?

A. Object detection (predicting bounding box from raw images)

B. Classi cation
C. Regression
D. Dimensionality reduction
fi
Quiz Break
Q1-2: Which of the following is not a common task of supervised learning?

A. Object detection (predicting bounding box from raw images)

B. Classi cation
C. Regression
D. Dimensionality reduction
fi
Part II: Unsupervised Learning
(no teacher)
Unsupervised Learning
• Given: dataset contains no label x1, x2, . . . , xn
• Goal: discover interesting patterns and structures in the data
Unsupervised Learning
• Given: dataset contains no label x1, x2, . . . , xn
• Goal: discover interesting patterns and structures in the data

y=1
Intensity

y=0

Tempo
Unsupervised Learning
• Given: dataset contains no label x1, x2, . . . , xn
• Goal: discover interesting patterns and structures in the data

y=1
Intensity Intensity

y=0

Tempo Tempo
Clustering
• Given: dataset contains no label x1, x2, . . . , xn
• Output: divides the data into clusters such that there are
intra-cluster similarity and inter-cluster dissimilarity
Intensity

Tempo
Clustering

Clustering Irises using three di erent features

The colors represent clusters identi ed by the algorithm, not y’s provided as input
ff
fi
Clustering
• You probably have >1000 digital photos stored on your phone
• After this class you will be able to organize them better
(based on visual similarity)
Clustering Genes
Clustering Words with Similar Meanings

[Arora-Li-Liang-Ma-Risteski, TACL’17,18]
How do we perform clustering?
• Many clustering algorithms. We will look at the two most
frequently used ones:
• K-means clustering: we specify the desired number of
clusters, and use an iterative algorithm to find them
• Hierarchical clustering: we build a binary tree over the
dataset
K-means clustering
• Very popular clustering method

• Don’t confuse it with k-NN classifier

• Input: a dataset x1, x2, . . . , xn , and assume the number of

clusters k is given
K-means clustering
Step 1: Randomly picking 2 positions as initial cluster centers (not necessarily a data
point)

Intensity

Tempo
K-means clustering
Step 2: for each point x, determine its cluster: nd the closest center in Euclidean space

Intensity

Tempo
fi
K-means clustering
Step 3: update all cluster centers as the centroids

Intensity

Tempo
K-means clustering
Repeat step 2 & 3 until convergence

Intensity

Converged solution!

No labels required!

Tempo
K-means clustering: A demo
https://www.naftaliharris.com/blog/visualizing-k-means-clustering/
Hierarchical Clustering (more to follow next lecture)
Quiz Break
Q2-1: Which is true about machine learning?

A. The process doesn’t involve human inputs

B. The machine is given the training and test data for learning
C. In clustering, the training data also have labels for learning
D. Supervised learning involves labeled data
Quiz Break
Q2-1: Which is true about machine learning?

A. The process doesn’t involve human inputs

B. The machine is given the training and test data for learning
C. In clustering, the training data also have labels for learning
D. Supervised learning involves labeled data

A. The labels are human inputs

B. The machine should not have test data for learning
C. No labels available for clustering
Quiz Break
Q2-2: Which is true about unsupervised learning?

A. There are only 2 unsupervised learning algorithms

B. Kmeans clustering is a type of hierarchical clustering
C. Kmeans algorithm automatically determines the number of clusters k
D. Unsupervised learning is widely used in many applications
Part III: Reinforcement Learning
(Learn from reward)
Reinforcement Learning
• Given: an agent that can take actions and a reward function
specifying how good an action is.
• Goal: learn to choose actions that maximize future reward
total.

Google Deepmind
Reinforcement Learning Key Problems
1. Problem: actions may have delayed effects.
• Requires credit-assignment
2. Problem: maximal reward action is unknown
• Exploration-exploitation trade-off

“..the problem [exploration-exploitation]

was proposed [by British scientist] to be
dropped over Germany so that German
scientists could also waste their time on it.”

- Peter Whittle

Multi-armed Bandit
Today’s recap
• What is machine learning?
• Supervised Learning
• Classi cation
• Regression
• Unsupervised Learning
• Reinforcement Learning
fi
Thanks!

EHS Propaganda Poster Assignment
100% (2)
EHS Propaganda Poster Assignment
1 page
0457 Example Candidate Responses Paper 3 (For Examination From 2018)
78% (9)
0457 Example Candidate Responses Paper 3 (For Examination From 2018)
26 pages
Career Profile: Marie Anandhu
No ratings yet
Career Profile: Marie Anandhu
3 pages
Stuart Stress Adaptation Model
No ratings yet
Stuart Stress Adaptation Model
10 pages
Organizational Power & Politics Guide
No ratings yet
Organizational Power & Politics Guide
20 pages
Part IV DP - Final
100% (10)
Part IV DP - Final
3 pages
Fail Predicate in Prolog PDF
0% (1)
Fail Predicate in Prolog PDF
2 pages
Studen Worksheet of Research Methods
No ratings yet
Studen Worksheet of Research Methods
11 pages
Unit 1: Introduction To Short Vowels
No ratings yet
Unit 1: Introduction To Short Vowels
8 pages
Verbs See 2
No ratings yet
Verbs See 2
17 pages
I Want To Study String Theory. Where Do I Start - Quora
No ratings yet
I Want To Study String Theory. Where Do I Start - Quora
6 pages
Glocal Law School Glocal University, Saharanpur: Course Teacher Dr. Sonal Shukla
No ratings yet
Glocal Law School Glocal University, Saharanpur: Course Teacher Dr. Sonal Shukla
5 pages
Theory Building in Management
No ratings yet
Theory Building in Management
18 pages
Machine Learning Algorithms
No ratings yet
Machine Learning Algorithms
25 pages
Cooperation of The Eye
No ratings yet
Cooperation of The Eye
5 pages
Machine Learning
No ratings yet
Machine Learning
28 pages
Costala Advice PDF
No ratings yet
Costala Advice PDF
1 page
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
19 pages
Artificial Intelligence and Machine Learning Applications in Musculoskeletal PT
No ratings yet
Artificial Intelligence and Machine Learning Applications in Musculoskeletal PT
22 pages
Learners' Packet Template For Special Education, Alive/Madrasah and Iped
No ratings yet
Learners' Packet Template For Special Education, Alive/Madrasah and Iped
2 pages
Keefektifan Pemberian Terapi Guided Imagery Untuk Mengurangi Tingkat Kecemasan Pada Pasien Gangguan Jiwa Skizofrenia
No ratings yet
Keefektifan Pemberian Terapi Guided Imagery Untuk Mengurangi Tingkat Kecemasan Pada Pasien Gangguan Jiwa Skizofrenia
8 pages
Grade 3 Antonyms Lesson Plan
100% (2)
Grade 3 Antonyms Lesson Plan
3 pages
Models For Machine Learning: M. Tim Jones
No ratings yet
Models For Machine Learning: M. Tim Jones
10 pages
Intro to Machine Learning Concepts
No ratings yet
Intro to Machine Learning Concepts
70 pages
Machine Learning - Data
No ratings yet
Machine Learning - Data
11 pages
ML PDF
No ratings yet
ML PDF
17 pages
Module 1 Quizzes, TEFL FULL CIRCLE
No ratings yet
Module 1 Quizzes, TEFL FULL CIRCLE
4 pages
Machine Learning and Deep Learning Supervised Learning 1682688720
No ratings yet
Machine Learning and Deep Learning Supervised Learning 1682688720
121 pages
Present Simple, Past Simple and Future Simple
100% (1)
Present Simple, Past Simple and Future Simple
9 pages
Short 'A' Sounds Lesson Plan
No ratings yet
Short 'A' Sounds Lesson Plan
2 pages
ML Interview Questions
No ratings yet
ML Interview Questions
21 pages
Completing The Typology Evidence For Floating Segments From Ende
No ratings yet
Completing The Typology Evidence For Floating Segments From Ende
64 pages
Intro To ML
No ratings yet
Intro To ML
107 pages
Machine Learning File
No ratings yet
Machine Learning File
7 pages
Chapter 1 Introduction To Machine Learning
100% (1)
Chapter 1 Introduction To Machine Learning
19 pages
NeuralNetwork Learning
No ratings yet
NeuralNetwork Learning
22 pages
MSC Psych 2020 Syllabus Edited by Aachal
No ratings yet
MSC Psych 2020 Syllabus Edited by Aachal
279 pages
Machine Learning Types Explained
No ratings yet
Machine Learning Types Explained
26 pages
Portafolio A11 MR CESAR
No ratings yet
Portafolio A11 MR CESAR
22 pages
Lecture 2
No ratings yet
Lecture 2
22 pages
Machine Learning Types and Algorithms
No ratings yet
Machine Learning Types and Algorithms
11 pages
Machine Learning and Web Scraping Lesson02
No ratings yet
Machine Learning and Web Scraping Lesson02
29 pages
Machine Learning Section4 Ebook v03
No ratings yet
Machine Learning Section4 Ebook v03
20 pages
Machine Learning Fundamentals
No ratings yet
Machine Learning Fundamentals
19 pages
Grade-8-TOS 1st Q
No ratings yet
Grade-8-TOS 1st Q
2 pages
Module1 And2
No ratings yet
Module1 And2
122 pages
Machine Learning for Beginners
No ratings yet
Machine Learning for Beginners
27 pages
Basic Notes
No ratings yet
Basic Notes
26 pages
NLP Chapter 2
No ratings yet
NLP Chapter 2
79 pages
AI Chapter 5
No ratings yet
AI Chapter 5
31 pages
3 Introduction To Machine Learning
No ratings yet
3 Introduction To Machine Learning
21 pages
Machine Learning Basics for Beginners
No ratings yet
Machine Learning Basics for Beginners
122 pages
Machine Learning Concepts Guide
No ratings yet
Machine Learning Concepts Guide
122 pages
Unsupervised Lec
No ratings yet
Unsupervised Lec
12 pages
The Johari Window - Building Self-Awareness and Trust
No ratings yet
The Johari Window - Building Self-Awareness and Trust
15 pages
Machine Learning-Lecture 01
No ratings yet
Machine Learning-Lecture 01
28 pages
Classification of Machine Learning
No ratings yet
Classification of Machine Learning
73 pages
Machine Learning Is The Branch of
No ratings yet
Machine Learning Is The Branch of
12 pages
ML Basics Theory
No ratings yet
ML Basics Theory
16 pages
Introduction of Machine Learning
No ratings yet
Introduction of Machine Learning
9 pages
chp5 (14) Fam
No ratings yet
chp5 (14) Fam
13 pages
W9 ML Overview NRG
No ratings yet
W9 ML Overview NRG
21 pages
Ashley Foster ELA Lesson Plan 2
No ratings yet
Ashley Foster ELA Lesson Plan 2
9 pages
Machine Learning
No ratings yet
Machine Learning
56 pages
Threeofakind B Eds157 A2
No ratings yet
Threeofakind B Eds157 A2
51 pages
Supervised Unsupervised Reinforcement
No ratings yet
Supervised Unsupervised Reinforcement
39 pages
Machine Learning - Introduction
No ratings yet
Machine Learning - Introduction
73 pages
Collaborative Transformative Learning Cyberspace Report
No ratings yet
Collaborative Transformative Learning Cyberspace Report
4 pages
Machine Learning Interview Questions
No ratings yet
Machine Learning Interview Questions
20 pages
Meta Motion Fitness Tracker 241109 213742 (1) Removed
No ratings yet
Meta Motion Fitness Tracker 241109 213742 (1) Removed
20 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
17 pages
DM Chapter 0
No ratings yet
DM Chapter 0
4 pages
Unit-5 Machine Learning
No ratings yet
Unit-5 Machine Learning
25 pages
LKSK ML typesToStudents
No ratings yet
LKSK ML typesToStudents
18 pages
Session 3 Types of Machine Learning
No ratings yet
Session 3 Types of Machine Learning
22 pages
Introduction To AI
No ratings yet
Introduction To AI
51 pages
Basics of Machine Learning and Deep Learning
100% (1)
Basics of Machine Learning and Deep Learning
49 pages
Unit 3 Material
No ratings yet
Unit 3 Material
8 pages
Module IV - Machine Learning
No ratings yet
Module IV - Machine Learning
53 pages
Week 09 Lesson 1 Intro Machine Learning 1 To 32
No ratings yet
Week 09 Lesson 1 Intro Machine Learning 1 To 32
61 pages
M Learning
No ratings yet
M Learning
11 pages
Lecture 03
No ratings yet
Lecture 03
28 pages
Unit 3 and Unit 4 Notes - Data Science - III BCA 2
No ratings yet
Unit 3 and Unit 4 Notes - Data Science - III BCA 2
27 pages
Ml-Unit 1
No ratings yet
Ml-Unit 1
53 pages
Machine Learning
No ratings yet
Machine Learning
33 pages
ML Day2
No ratings yet
ML Day2
31 pages
Introduction To ML
No ratings yet
Introduction To ML
46 pages
ML Assignment 1
No ratings yet
ML Assignment 1
12 pages
Financial Machine Learning-Unit-1: Dr. J.Dhanalakshmi
No ratings yet
Financial Machine Learning-Unit-1: Dr. J.Dhanalakshmi
70 pages

1 ML Overview

Uploaded by

1 ML Overview

Uploaded by

Statistical Learning

Machine Learning Overview

• Tom Mitchell (1997): A computer program is said to learn from experience

Relaxed Tempo Fast

Relaxed Tempo Fast

Intensity New data

Relaxed Tempo Fast

Intensity New data

Relaxed Tempo Fast

Training data Test data

learning (i.e.,training) testing

Training data Test data

learning (i.e.,training) testing

There can be many features!

Task: if a residence has x squares

Task: if a residence has x squares

Input with more features (e.g., lot size)

Russakovsky et al. 2015

• the label is a discrete variable • the label is a continuous variable

Training data is a collection of input instances to the

(x1, y2), (x2, y2), (x3, y3), . . . , (xn, yn)

A training data is the “experience” given to a learning algorithm

Given training data

Learn a function mapping f : X → Y, such that f(x) predicts

Training set error

f* = arg min (x,y)ℓ( f(x), y) Details in upcoming

A. Feature vectors can have at most 10 dimensions

A. Feature vectors can have at most 10 dimensions

A. Feature vectors can be in high dimen.

A. Object detection (predicting bounding box from raw images)

A. Object detection (predicting bounding box from raw images)

Clustering Irises using three di erent features

• Don’t confuse it with k-NN classifier

• Input: a dataset x1, x2, . . . , xn , and assume the number of

A. The process doesn’t involve human inputs

A. The process doesn’t involve human inputs

A. The labels are human inputs

A. There are only 2 unsupervised learning algorithms

A. There are only 2 unsupervised learning algorithms

“..the problem [exploration-exploitation]

You might also like