0% found this document useful (0 votes)

15 views56 pages

Module 1 Part3

The document discusses key concepts in machine learning, including the Vapnik-Chervonenkis (VC) dimension, which measures the complexity of hypothesis spaces, and Probably Approximately Correct (PAC) learning, a framework for analyzing learning algorithms. It emphasizes the importance of model selection, generalization, and inductive bias in creating effective machine learning models, highlighting the trade-offs between underfitting and overfitting. Additionally, it introduces parameters critical for PAC learning, such as hypothesis class, sample complexity, error tolerance, and confidence level.

Uploaded by

ananthan.2005.10

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views56 pages

Module 1 Part3

Uploaded by

ananthan.2005.10

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 56

Module 1

i) VAPNIK-CHERVONENKIS (VC) DIMENSION,

ii) PROBABLY APPROXIMATELY CORRECT LEARNING (PAC)
III) MODEL SELECTION AND GENERALIZATION
Vapnik-Chervonenkis dimension(1971)

 Measures the complexity of the space H.

 Not count hk
 It counts the no of distinct instances of X that can be completely
discriminated by H (shattering)
VC DIMENSION

Dichotomy: 2-class problems

X: elements € class 0 or 1
H: any straight line in 2D
Linear Classifier with two data points
Linear Classifier with three data
points
Shattering
Linear Classifier with four data points
Rectangle Classifier
Rectangle Classifier
Vapnik-Chervonenk (VC) Dimension
Rectangle can shatter 4 points
Illustration - Vapnik-Chervonenkis dimension.
Probably Approximately Correct Learning (PAC)

 In computational learning theory, probably approximately

correct learning (PAC learning) is a framework for
mathematical analysis of machine learning algorithms
 It was proposed in 1984 by Leslie Valiant
 The goal is to ensure that a learning algorithm will probably
(with high probability) find an approximately correct
hypothesis based on a limited amount of training data
PAC-learnability

 To fully define the Probably Approximately Correct (PAC) learning framework,

several parameters are crucial
 These parameters provide the necessary constraints and criteria for the learning
algorithm to ensure its performance guarantees.
 The key parameters include:
1) Hypothesis class (H): The hypothesis class defines the set of possible hypotheses
that the learning algorithm can select as the output. It represents the space of
functions from which the learning algorithm can choose the best approximation to
the target function.
2.Sample complexity (m): The sample complexity refers to the minimum number of training
examples required for the learning algorithm to find an approximately correct hypothesis.
3.Error tolerance (ϵ): The error tolerance parameter specifies the acceptable level of error in
the output hypothesis. It quantifies how closely the learned hypothesis needs to approximate
the true target function.
4.Confidence level (δ): The confidence level denotes the probability that the learning
algorithm's performance guarantees hold. It indicates the probability of the algorithm failing
to find an approximately correct hypothesis, and it is typically set to a small value, often
denoted as δ.
PAC-learnability

Terminologies and notations required to define PAC-learnability

 Let X be a set called the instance space which may be finite or infinite. For example, X may be
the set of all points in a plane.
 A concept class C for X is a family of functions c ∶ X → {0, 1}. A member of C is called a
concept. A concept can also be thought of as a subset of X. If C is a subset of X, it defines a
unique function µC ∶ X → {0, 1} as follows:

 A hypothesis h is also a function h ∶ X → {0, 1}. So, as in the case of concepts, a hypothesis can
also be thought of as a subset of X. H will denote a set of hypotheses.
 We assume that F is an arbitrary, but fixed, probability distribution over X
 Training examples are obtained by taking random samples from X. We assume that the samples
are randomly generated from X according to the probability distribution F.
Definition

 Let X be an instance space, C a concept class for X, h a hypothesis in C and F an

arbitrary, but fixed, probability distribution. The concept class C is said to be
PAC-learnable if there is an algorithm A which, for samples drawn with any
probability distribution F and any concept c ∈ C, will with high probability
produce a hypothesis h ∈ C whose error is small.
PAC Learning
False Negative And False Positive
Error region
Approximately Correct
Probability Approximately Correct
PAC Learning for Axis Aligned Rectangle
Approximately Correct
Error Region
Example problem 1
Example problem 2
Model Selection and Generalization
What is a Model in Machine
Learning?
Construct a Model
Construct a Model….
 In regression, assuming a linear function is an inductive bias.
 Among all lines, choosing the one that minimizes squared error is
another inductive bias
 Each hypothesis class has a certain capacity and can learn only
certain functions
Underfitting
Overfitting
Model Selection

 Mathematical or logical representation of a solution space

 Inorder to formulate a hypothesis for a problem, we have to
choose some model
 May also indicates the process of choosing one particular approach
from among several different approaches.
 possible algorithms
 possible sets of features
 Initial values for certain parameters.
Model Selection
Inductive bias

 The set of assumptions we make to have learning possible is called the

inductive bias of the learning algorithm
 One way we introduce inductive bias is when we assume a hypothesis
class.
Examples
• In learning the class of family car, Assuming the shape of a rectangle is
an inductive bias.(Used to make the model simple)
• In regression, assuming a linear function is an inductive bias
Advantages of a simple model

 Easy to use
 Easy to train ( fewer parameters
 Easy to explain
 Easy to arrive at generalization
 A simple model would generalize better than a complex model. This principle is
known as Occam’s razor, which states that simpler explanations are more
plausible and any unnecessary complexity should be shaved off
Generalisation

 How well a model trained on the training set predicts the right
output for new instances is called generalization.
 The model should be selected having the best generalization
 Main causes for poor performance of learning algorithms
 Underfitting

 Overfitting
Testing generalisation: Cross-
validation

 Generalization can be tested if we have data outside the training set

Simulation
 Divide the dataset into two parts -training set and validation set, or
testing data
 The hypothesis that is the most accurate on the validation set is the
best one (the one that has the best inductive bias). This process is
called cross-validation
Underfitting And Overfitting
The Triple Trade-off
Training Set And Validation Set
Test Set

Finite and Infinite Hypothesis Spaces - PAC and Bayes Theorem
No ratings yet
Finite and Infinite Hypothesis Spaces - PAC and Bayes Theorem
9 pages
Foundations of Machine Learning: Module 7: Computational Learning Theory
No ratings yet
Foundations of Machine Learning: Module 7: Computational Learning Theory
64 pages
Hypothesis Space and Inductive Bias
No ratings yet
Hypothesis Space and Inductive Bias
51 pages
Machine Leaning 3
No ratings yet
Machine Leaning 3
44 pages
ML Unit-3.-1
No ratings yet
ML Unit-3.-1
28 pages
Statistical Learning Framework
No ratings yet
Statistical Learning Framework
7 pages
Aml CH.1
No ratings yet
Aml CH.1
11 pages
ML Unit-1
No ratings yet
ML Unit-1
42 pages
PAC Learning & Machine Learning Course
No ratings yet
PAC Learning & Machine Learning Course
36 pages
INT354 - Unit 1
No ratings yet
INT354 - Unit 1
72 pages
Matters of Discussion
No ratings yet
Matters of Discussion
28 pages
MachineLearning - UNIT III
No ratings yet
MachineLearning - UNIT III
30 pages
Day 8 The PAC Learning Model 1747716734
No ratings yet
Day 8 The PAC Learning Model 1747716734
8 pages
Probably Approximately Correct (PAC) Learning Model - Machine Learning
No ratings yet
Probably Approximately Correct (PAC) Learning Model - Machine Learning
11 pages
Al3451 - Machine Learning - Answer Key 13 Mark
No ratings yet
Al3451 - Machine Learning - Answer Key 13 Mark
22 pages
A Primer On PAC-Bayesian Learning
No ratings yet
A Primer On PAC-Bayesian Learning
26 pages
PAC Bayesian Learning Introduction
No ratings yet
PAC Bayesian Learning Introduction
124 pages
Error (ε) : A small value representing the maximum acceptable error
No ratings yet
Error (ε) : A small value representing the maximum acceptable error
3 pages
Lec-3-Vc Dimension and Pac Learning
No ratings yet
Lec-3-Vc Dimension and Pac Learning
19 pages
Lecture22 s12
No ratings yet
Lecture22 s12
21 pages
PAC Learning Explained
No ratings yet
PAC Learning Explained
15 pages
PAC Learning Detailed
No ratings yet
PAC Learning Detailed
2 pages
Notes
No ratings yet
Notes
125 pages
Unit 3
No ratings yet
Unit 3
99 pages
UNIT 1 Notes
No ratings yet
UNIT 1 Notes
38 pages
Computer Network: 02 December 2024 22:38
No ratings yet
Computer Network: 02 December 2024 22:38
5 pages
Shawe-Taylor-Slides Statiscal Learning Theory For Modern Machine Learning
No ratings yet
Shawe-Taylor-Slides Statiscal Learning Theory For Modern Machine Learning
195 pages
Csup AL
No ratings yet
Csup AL
5 pages
Lecture 5
No ratings yet
Lecture 5
12 pages
ML Question Bank CA-II
No ratings yet
ML Question Bank CA-II
10 pages
Sec 1630
No ratings yet
Sec 1630
145 pages
Course July Lecture02
No ratings yet
Course July Lecture02
26 pages
Iii Sem - Design Analysis Algorithm PDF
No ratings yet
Iii Sem - Design Analysis Algorithm PDF
36 pages
Learning Theory for CS Scholars
No ratings yet
Learning Theory for CS Scholars
91 pages
Machine Learning - Computational Learning Theory PDF
No ratings yet
Machine Learning - Computational Learning Theory PDF
7 pages
SML Lecture2
No ratings yet
SML Lecture2
35 pages
(OR1) Dynamic Programming Exercises
No ratings yet
(OR1) Dynamic Programming Exercises
3 pages
4.0 ALGO211 Week10 Computational Learning Theory
No ratings yet
4.0 ALGO211 Week10 Computational Learning Theory
16 pages
Week 7 Notes
No ratings yet
Week 7 Notes
11 pages
ML Unit-4 Prob Learning
No ratings yet
ML Unit-4 Prob Learning
36 pages
Undecidable Problems For Recursively Enumerable Languages: Continued
No ratings yet
Undecidable Problems For Recursively Enumerable Languages: Continued
54 pages
Lec 6
No ratings yet
Lec 6
29 pages
INT354 Unit 1 Part2
No ratings yet
INT354 Unit 1 Part2
14 pages
PAC Learning Frameworks Explained
No ratings yet
PAC Learning Frameworks Explained
59 pages
MLSM Lecture2 120923
No ratings yet
MLSM Lecture2 120923
35 pages
Coding Interview Cheat Sheet
100% (1)
Coding Interview Cheat Sheet
2 pages
Computational Learning Theory Guide
No ratings yet
Computational Learning Theory Guide
24 pages
PAC Learning for ML Researchers
No ratings yet
PAC Learning for ML Researchers
22 pages
Pac Learning
No ratings yet
Pac Learning
30 pages
Lecture 1: Brief Overview - PAC Learning
No ratings yet
Lecture 1: Brief Overview - PAC Learning
3 pages
ML Lecture 1 Iitg
No ratings yet
ML Lecture 1 Iitg
32 pages
PAC Learning for ML Theorists
No ratings yet
PAC Learning for ML Theorists
34 pages
Unit Iii
No ratings yet
Unit Iii
6 pages
TSP Solution for Math Enthusiasts
No ratings yet
TSP Solution for Math Enthusiasts
13 pages
ML Unit-2 Material Add-On
No ratings yet
ML Unit-2 Material Add-On
82 pages
Bead-Sort: Natural Sorting for CS Experts
100% (2)
Bead-Sort: Natural Sorting for CS Experts
10 pages
ML Lecture 5
No ratings yet
ML Lecture 5
14 pages
Machine Learning Module 2 Overview
No ratings yet
Machine Learning Module 2 Overview
20 pages
Bernstein-Vazirani Quantum Algorithm
No ratings yet
Bernstein-Vazirani Quantum Algorithm
21 pages
PAC Learning Model Overview
No ratings yet
PAC Learning Model Overview
7 pages
ML 1 2 3
No ratings yet
ML 1 2 3
54 pages
Lecture+Notes+Model+ Selection PDF
No ratings yet
Lecture+Notes+Model+ Selection PDF
12 pages
CS 253: Algorithms: Growth of Functions
No ratings yet
CS 253: Algorithms: Growth of Functions
22 pages
Computational Learning Theory Guide
No ratings yet
Computational Learning Theory Guide
43 pages
ML Lecture 8
No ratings yet
ML Lecture 8
12 pages
PAC Learning and Complexity
No ratings yet
PAC Learning and Complexity
14 pages
Greedy Method
No ratings yet
Greedy Method
35 pages
Machine Learning Theory Lecture
No ratings yet
Machine Learning Theory Lecture
6 pages
PSO
No ratings yet
PSO
74 pages
Quantum Computing Road Map
No ratings yet
Quantum Computing Road Map
56 pages
Data Structures & Algorithms MCQs
No ratings yet
Data Structures & Algorithms MCQs
26 pages
Module 7 Automata
No ratings yet
Module 7 Automata
7 pages
31 Intro Flows
No ratings yet
31 Intro Flows
20 pages
Cs3452 - Toc - QB New
No ratings yet
Cs3452 - Toc - QB New
10 pages
Discrete Mathematics With Applications 4th Edition (Ebook PDF) PDF Download
100% (1)
Discrete Mathematics With Applications 4th Edition (Ebook PDF) PDF Download
58 pages
College of Applied Science Peerumade
No ratings yet
College of Applied Science Peerumade
30 pages
Graph Traversal DFS
No ratings yet
Graph Traversal DFS
56 pages
Courses For Preparedness, Online MSCS
No ratings yet
Courses For Preparedness, Online MSCS
7 pages
Algorithm Complexity Basics
No ratings yet
Algorithm Complexity Basics
18 pages
Algorythm Analysis1
No ratings yet
Algorythm Analysis1
2 pages
Art of Programming Contest SE For Uva PDF
No ratings yet
Art of Programming Contest SE For Uva PDF
247 pages
An Improved EDAS Method Based On The Cosine Measure Under The Nested Probabilistic Linguistic Environment
No ratings yet
An Improved EDAS Method Based On The Cosine Measure Under The Nested Probabilistic Linguistic Environment
17 pages
Schneider - Ch03 - Inv To CS 8e
No ratings yet
Schneider - Ch03 - Inv To CS 8e
50 pages
Seminar PPT 002311002008
No ratings yet
Seminar PPT 002311002008
15 pages
7.5, 7.6 Floyd Warshall and Sollin Algorithm
No ratings yet
7.5, 7.6 Floyd Warshall and Sollin Algorithm
14 pages
S024 60018230088 Ai Exp-5
No ratings yet
S024 60018230088 Ai Exp-5
5 pages
D1 Common Word Questions Answers
No ratings yet
D1 Common Word Questions Answers
6 pages
Python Programming: An Introduction To Computer Science: Algorithm Design and Recursion
No ratings yet
Python Programming: An Introduction To Computer Science: Algorithm Design and Recursion
133 pages
Greedy MDS
No ratings yet
Greedy MDS
1 page

Module 1 Part3

Uploaded by

Module 1 Part3

Uploaded by

Module 1

i) VAPNIK-CHERVONENKIS (VC) DIMENSION,

 Measures the complexity of the space H.

Dichotomy: 2-class problems

 In computational learning theory, probably approximately

 To fully define the Probably Approximately Correct (PAC) learning framework,

Terminologies and notations required to define PAC-learnability

 Let X be an instance space, C a concept class for X, h a hypothesis in C and F an

 Mathematical or logical representation of a solution space

 The set of assumptions we make to have learning possible is called the

 Generalization can be tested if we have data outside the training set

You might also like