0% found this document useful (0 votes)

26 views16 pages

Predictive Analytics Basics

The document discusses the fundamentals of predictive analytics in machine learning, focusing on data mining tasks categorized as descriptive and predictive. It explains the concepts of supervised and unsupervised learning, detailing the process of training models to make predictions based on labeled data. Key components of learning include representation, evaluation, and optimization, culminating in the development of a prediction rule from training data.

Uploaded by

Dhruv Jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views16 pages

Predictive Analytics Basics

Uploaded by

Dhruv Jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

DS605: Fundamentals of Machine Learning

Lecture 07

Fundamentals of Predictive Analytics

[Representation, Evaluation, and Optimization]

Arpit Rana
5th August 2024
Data Mining Tasks

Disclaimer: Most images incorporated within the presentation slides

have been sourced from different sources on the web and ML books.
Data Mining Tasks

Data Mining Tasks

The actual data mining task is the semi-automatic or automatic
analysis of large quantities of data to extract interesting patterns.

Descriptive Predictive
Find human-interpretable patterns Use some variables to predict future
that describe the data. or unknown values of other variables.

● Cluster Analysis ● Regression

● Outlier Analysis ● Classiﬁcation
● Association Rule Mining
● Sequence Pattern Mining

In Machine Learning terminology, these In Machine Learning terminology, these

tasks are categorised as “Unsupervised tasks are categorised as “Supervised
Learning”. Learning”.
Data Mining Tasks

Data Mining Tasks

The actual data mining task is the semi-automatic or automatic
analysis of large quantities of data to extract interesting patterns.

Descriptive Predictive
Find human-interpretable patterns Use some variables to predict future
that describe the data. or unknown values of other variables.

● Cluster Analysis ● Regression

● Outlier Analysis ● Classiﬁcation
● Association Rule Mining
● Sequence Pattern Mining

In Machine Learning terminology, these In Machine Learning terminology, these

tasks are categorised as “Unsupervised tasks are categorised as “Supervised
Learning”. Learning”.
Machine Learning: Deﬁnition

Machine Learning is

● the science (and art) of programming computers

● so they can learn from data. AI

ML
– Aurelien Geron, Google
DL

Gen
-AI
Machine Learning: Example

A Spam Filter,
● a Machine Learning Program, given
○ examples of “spam” emails (e.g. ﬂagged by
users), and
○ examples of “ham” (i.e. regular) emails
● can learn to ﬂag spam
Machine Learning: A New Programming Paradigm

Data Rules Data Answers

Traditional
Programming Machine
Learning
(Symbolic AI)

Answers Rules

● A long list of complex (hard coded) rules ● Automatically learns which words or
phrases are good predictors of spam
● Keep writing new rules as the new
phrases are introduced by spammers
Machine Learning: Deﬁnition Revisited

Machine Learning is the training of a model from data that generalises a decision against a
performance measure.

● Training a model suggests training examples. Data Answers

● A model suggests state acquired through experience.

● Generalises a decision suggests the capability to make a

decision based on inputs and anticipating unseen inputs in
the future for which a decision will be required. Machine
Learning
● against a performance measure suggests a targeted need and
directed quality to the model being prepared.

Model
Learning = Representation + Evaluation + Optimization

Representation
Choosing a representation of the learner: the hypotheses
space or the model class — the set of models that it can
possibly learn.

Evaluation
Choosing an evaluation function (also called objective
function, utility function, loss function, or scoring
function) is needed to distinguish good classiﬁers from
bad ones.

Optimization
��
Choosing a method to search among the models in the
hypothesis space for the highest-scoring one.
Learning = Representation + Evaluation + Optimization

✔
✔ ✔ ✔
✔ ✔ ✔

✔ ✔
✔ ✔ ✔
✔
✔

✔
Supervised Learning
Problem Settings and Examples
Supervised Learning: A Formal Model

The learner’s input:

● Domain set
An arbitrary set (instance space), X, the set of objects (a.k.a. instances, domain points) we may wish to
label.

● Label set
A set of possible labels, Y. e.g., {0, 1}, {-1, 1}.

● Training data
S = ((x1, y1) . . . (xm, ym)) is ﬁnite sequence of pairs in X x Y, i.e., a sequence of labeled domain points.

The learner’s output:

● A prediction rule, h : X → Y , also called a predictor, a hypothesis, or a classiﬁer.
○ The learner returns h upon receiving the training sequence S.
○ It can be used to predict the label of new domain points (like the past ones).
Supervised Learning: A Formal Model

Data-generation Model:
● Let D be a probability distribution over X x Y, i.e., D is joint probability distribution over domain
points and labels.
○ A distribution Dx over unlabeled domain points (sometimes called marginal distribution),
○ A conditional probability over labels for each domain point, D((x, y) | x).

Independent and Identically Distributed (I.I.D.) Assumption

● Each domain point x has the same prior probability distribution (to be sampled):
P(xi) = P(xi+1) = P(xi+2) = · · · ,
and is independent of the previous examples:
P(xi) = P(xi | xi-1 , xi-2 , . . .) .
Supervised Learning: A Formal Model

More formally, the task of supervised learning can be deﬁned as -

Given a training set (S) of m example input-output pairs,

We call the output y(i) the

ground truth — the true answer
we are asking our model to
predict.

where each pair was generated by an unknown function y = f (x),

discover a function h that approximates the true function f .
Supervised Learning Process

Training
Phase

Inductive Learning: given

Learner a set of observations, it
Hypothesis
ﬁnds a function that is
Space 𝓗 (𝚪: S → h) applicable to the entire
instance space. .

Stationarity:
Follows the
Final Hypothesis or
Model (h) Test
same
Phase
distribution as A Test Instance Prediction
the training
instances.
Next lecture Choosing a Hypothesis Space
6th August 2024

Unit 1
No ratings yet
Unit 1
92 pages
Introduction to Machine Learning Basics
No ratings yet
Introduction to Machine Learning Basics
606 pages
Machine Learning Course Overview
No ratings yet
Machine Learning Course Overview
51 pages
Machine Learning: Professor Department of Computer Science & Engineering
No ratings yet
Machine Learning: Professor Department of Computer Science & Engineering
59 pages
UNIT 1ML Removed Removed
No ratings yet
UNIT 1ML Removed Removed
123 pages
Longman Academic Reading Series - 5 Teacher - S Manual
No ratings yet
Longman Academic Reading Series - 5 Teacher - S Manual
164 pages
Machine Learning Basics for Beginners
100% (5)
Machine Learning Basics for Beginners
134 pages
Selected T Chapter 3
No ratings yet
Selected T Chapter 3
62 pages
Presentation On ML
No ratings yet
Presentation On ML
469 pages
GML Slides 2024 04 29
No ratings yet
GML Slides 2024 04 29
206 pages
Ch7 Introduction To Machine Learning
No ratings yet
Ch7 Introduction To Machine Learning
29 pages
Unit 1 ML
No ratings yet
Unit 1 ML
93 pages
Made By: Swati Tripathi
No ratings yet
Made By: Swati Tripathi
31 pages
Data Science & ML Course Guide
No ratings yet
Data Science & ML Course Guide
83 pages
Unit 1
No ratings yet
Unit 1
93 pages
BE02000041 Funda of AI Unit 3 Basics of ML
No ratings yet
BE02000041 Funda of AI Unit 3 Basics of ML
86 pages
03 Introtoml Ueh
No ratings yet
03 Introtoml Ueh
43 pages
DSA5105 Lecture1
No ratings yet
DSA5105 Lecture1
51 pages
Chapter 5 Machine Learning
No ratings yet
Chapter 5 Machine Learning
96 pages
Machine Learning - UNIT I Notes
No ratings yet
Machine Learning - UNIT I Notes
31 pages
Machine Learning IAI
No ratings yet
Machine Learning IAI
94 pages
Q2 - Week 2 - GenMath DLP
No ratings yet
Q2 - Week 2 - GenMath DLP
12 pages
DSA5102X Lecture1
No ratings yet
DSA5102X Lecture1
51 pages
ML-Unit - 3 & 4
No ratings yet
ML-Unit - 3 & 4
33 pages
Sec 1630
No ratings yet
Sec 1630
145 pages
Machine Learning
No ratings yet
Machine Learning
56 pages
Blood Pressure Rubric
No ratings yet
Blood Pressure Rubric
1 page
Introduction To ML Unit-1
No ratings yet
Introduction To ML Unit-1
90 pages
07 Intro To ML
No ratings yet
07 Intro To ML
38 pages
Machine Learning - Introduction
No ratings yet
Machine Learning - Introduction
73 pages
Lec1 - Introduction
No ratings yet
Lec1 - Introduction
55 pages
1 - Introduction
No ratings yet
1 - Introduction
82 pages
PR & ML: CS5691: Machine Learning
No ratings yet
PR & ML: CS5691: Machine Learning
42 pages
Aiml Mca
100% (1)
Aiml Mca
38 pages
Lec1 Intoduction
No ratings yet
Lec1 Intoduction
34 pages
LM #02-ML Concepts & Frameworks
No ratings yet
LM #02-ML Concepts & Frameworks
31 pages
Machine Learning
No ratings yet
Machine Learning
33 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
15 pages
Machine Learning - ch1
No ratings yet
Machine Learning - ch1
46 pages
Pe and Health: Quarter 1 - Module 1: Exercise For Fitness
100% (1)
Pe and Health: Quarter 1 - Module 1: Exercise For Fitness
16 pages
Rpms-Ipcrf Portfolio: Cris C. Banjao
No ratings yet
Rpms-Ipcrf Portfolio: Cris C. Banjao
31 pages
ML 1
No ratings yet
ML 1
35 pages
CE880 Lecture5 Slides
No ratings yet
CE880 Lecture5 Slides
32 pages
ML Intro
No ratings yet
ML Intro
28 pages
L02 Fundamentals of ML
No ratings yet
L02 Fundamentals of ML
46 pages
Og Training
No ratings yet
Og Training
3 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
4 pages
L02 Fundamentals of ML
No ratings yet
L02 Fundamentals of ML
39 pages
Machine Learning Fundamentals Guide
No ratings yet
Machine Learning Fundamentals Guide
46 pages
Machine Learning Fundamentals
No ratings yet
Machine Learning Fundamentals
19 pages
ML Intro Theory
No ratings yet
ML Intro Theory
10 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
AI Chapter 5
No ratings yet
AI Chapter 5
31 pages
ML - Module 1
No ratings yet
ML - Module 1
30 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
14 pages
Ai Unit5 Learning
No ratings yet
Ai Unit5 Learning
62 pages
Machine Learning for Beginners
No ratings yet
Machine Learning for Beginners
27 pages
National Foreign Languages Project 2020 (E 2020 PROJECT)
No ratings yet
National Foreign Languages Project 2020 (E 2020 PROJECT)
14 pages
Module2 ch2
No ratings yet
Module2 ch2
36 pages
Fundamentals of ML 1
No ratings yet
Fundamentals of ML 1
38 pages
Lecture 2
No ratings yet
Lecture 2
22 pages
Supervised Learning Insights
No ratings yet
Supervised Learning Insights
22 pages
1b Different Types
No ratings yet
1b Different Types
26 pages
Topic: School Improvement Plan (Sip) and Managing Programs and Projects
No ratings yet
Topic: School Improvement Plan (Sip) and Managing Programs and Projects
4 pages
Field Engagement Jabalpur Grey-Color
40% (5)
Field Engagement Jabalpur Grey-Color
21 pages
Project Create Proposal
No ratings yet
Project Create Proposal
5 pages
Trainees Progress Sheet 4.2-2b
100% (1)
Trainees Progress Sheet 4.2-2b
7 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
16 pages
Political Party LP Day 1
No ratings yet
Political Party LP Day 1
5 pages
English Greetings for Kids
No ratings yet
English Greetings for Kids
8 pages
Lesson Plan
No ratings yet
Lesson Plan
10 pages
Months, Days of The Week, & Time: Unit Plan
No ratings yet
Months, Days of The Week, & Time: Unit Plan
30 pages
Kindergarten Family Lessons
No ratings yet
Kindergarten Family Lessons
9 pages
Overview of The Curriculum Development Process
No ratings yet
Overview of The Curriculum Development Process
13 pages
Good Education in An Age of Measurement
No ratings yet
Good Education in An Age of Measurement
14 pages
Kohn's
No ratings yet
Kohn's
13 pages
Bragg - 2012 - The Effect of Mathematical Games On On-Task Behaviours in The Primary Classroom
No ratings yet
Bragg - 2012 - The Effect of Mathematical Games On On-Task Behaviours in The Primary Classroom
18 pages
Mbhte Format
No ratings yet
Mbhte Format
6 pages
Activity Sheets: Quarter 3 - MELC 20
100% (1)
Activity Sheets: Quarter 3 - MELC 20
10 pages
TIME IS GOLD New
No ratings yet
TIME IS GOLD New
28 pages
Assertive Discipline
100% (2)
Assertive Discipline
10 pages
Lesson Plan 15-1-2025 - G2
No ratings yet
Lesson Plan 15-1-2025 - G2
2 pages
Lesson Plan - Ancient Egypt 8th Grade
No ratings yet
Lesson Plan - Ancient Egypt 8th Grade
2 pages
Systems Practice Workbook
No ratings yet
Systems Practice Workbook
94 pages
The 4 Skills in Learning A Foreign Language
No ratings yet
The 4 Skills in Learning A Foreign Language
2 pages
Group 5 Research Final
No ratings yet
Group 5 Research Final
77 pages
Effects of E-Learning On Students Motivation
No ratings yet
Effects of E-Learning On Students Motivation
9 pages

Predictive Analytics Basics

Uploaded by

Predictive Analytics Basics

Uploaded by

DS605: Fundamentals of Machine Learning

Fundamentals of Predictive Analytics

Disclaimer: Most images incorporated within the presentation slides

Data Mining Tasks

● Cluster Analysis ● Regression

In Machine Learning terminology, these In Machine Learning terminology, these

Data Mining Tasks

● Cluster Analysis ● Regression

In Machine Learning terminology, these In Machine Learning terminology, these

● the science (and art) of programming computers

Data Rules Data Answers

● Training a model suggests training examples. Data Answers

● A model suggests state acquired through experience.

● Generalises a decision suggests the capability to make a

The learner’s input:

The learner’s output:

Independent and Identically Distributed (I.I.D.) Assumption

More formally, the task of supervised learning can be deﬁned as -

We call the output y(i) the

where each pair was generated by an unknown function y = f (x),

Inductive Learning: given

You might also like