0% found this document useful (0 votes)

12 views20 pages

Instance Based Learning

The document discusses Instance-Based Learning, focusing on methods such as k-Nearest Neighbor, locally weighted regression, and Radial Basis Function networks. It highlights the advantages and disadvantages of these methods, including the curse of dimensionality and the differences between lazy and eager learning approaches. Additionally, it covers Case-Based Reasoning and the use of kd-trees for efficient neighbor searching.

Uploaded by

thejasgangadkar007

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views20 pages

Instance Based Learning

Uploaded by

thejasgangadkar007

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Instance Based Learning

• k-Nearest Neighbor
• Locally weighted regression
• Radial basis functions
• Case-based reasoning
• Lazy and eager learning

CS 5751 Machine Chapter 8 Instance Based Learning 1

Learning
Instance-Based Learning
Key idea : just store all training examples < xi ,f(xi ) >
Nearest neighbor (1 - Nearest neighbor) :
• Given query instance xq , locate nearest example xn , estimate
fˆ ( xq ) ← f ( xn )
k − Nearest neighbor :
• Given xq , take vote among its k nearest neighbors (if
discrete - valued target function)
• Take mean of f values of k nearest neighbors (if real - valued)

ˆf ( x ) ← ∑i =1
k
f ( xi )
q
k
CS 5751 Machine Chapter 8 Instance Based Learning 2
Learning
When to Consider Nearest Neighbor
• Instance map to points in Rn
• Less than 20 attributes per instance
• Lots of training data
Advantages
• Training is very fast
• Learn complex target functions
• Do not lose information
Disadvantages
• Slow at query time
• Easily fooled by irrelevant attributes
CS 5751 Machine Chapter 8 Instance Based Learning 3
Learning
k-NN Classification
5-Nearest Neighbor

xq 1-NN Decision Surface

CS 5751 Machine Chapter 8 Instance Based Learning 4

Learning
Behavior in the Limit
Define p(x) as probability that instance x will be
labeled 1 (positive) versus 0 (negative)
Nearest Neighbor
• As number of training examples approaches infinity,
approaches Gibbs Algorithm
Gibbs: with probability p(x) predict 1, else 0
k-Nearest Neighbor:
• As number of training examples approaches infinity and k
gets large, approaches Bayes optimal
Bayes optimal: if p(x) > 0.5 then predict 1, else 0
• Note Gibbs has at most twice the expected error of Bayes
optimal
CS 5751 Machine Chapter 8 Instance Based Learning 5
Learning
Distance-Weighted k-NN
Might want to weight nearer neighbors more heavily ...

ˆf ( x ) ← ∑i =1 i
k
w f ( xi )
∑i =1 wi
q k

where
1
wi ≡
d ( xq , xi ) 2
and d(xq ,xi ) is distance between xq and xi
Note, now it makes sense to use all training examples
instead of just k
→ Shepard' s method
CS 5751 Machine Chapter 8 Instance Based Learning 6
Learning
Curse of Dimensionality
Imagine instances described by 20 attributes, but
only 2 are relevant to target function
Curse of dimensionality: nearest neighbor is easily
misled when high-dimensional X
One approach:
• Stretch jth axis by weight zj, where z1,z2,…,zn chosen to
minimize prediction error
• Use cross-validation to automatically choose weights
z1,z2,…,zn
• Note setting zj to zero eliminates dimension j altogether
see (Moore and Lee, 1994)

CS 5751 Machine Chapter 8 Instance Based Learning 7

Learning
Locally Weighted Regression
k - NN forms local approximation to f for each query point xq
Why not form explicit approximation fˆ(x) for region around xq ?
• Fit linear function to k nearest neighbors
• Or fit quadratic, etc.
• Produces " piecewise approximation" to f
Several choices of error to minimize :
• Squared error over k nearest neighbors
E (x ) ≡ 1
1 q 2 ∑
( f ( x) − fˆ ( x)) 2
x∈k nearest neighbors of xq

• Distance - weighted squared error over all neighbors

q 2 ∑
E ( x ) ≡ 1 ( f ( x) − fˆ ( x)) 2 K (d ( x , x))
2
x∈D
q

CS 5751 Machine Chapter 8 Instance Based Learning 8

Learning
Radial Basis Function Networks
• Global approximation to target function, in terms
of linear combination of local approximations
• Used, for example, in image classification
• A different kind of neural network
• Closely related to distance-weighted regression,
but “eager” instead of “lazy”

CS 5751 Machine Chapter 8 Instance Based Learning 9

Learning
Radial Basis Function Networks
f(x)

w
w0 1
w2 k
w

1
where ai(x) are the attributes describing
instance x, and
a1(x) a2(x) an(x) k
f ( x) = w0 + ∑ wu K u (d ( xu , x))
u =1

One common choice for K u(d(xu ,x)) is

1
d 2 ( xu , x )
2σ u2
K u(d(xu ,x)) = e

CS 5751 Machine Chapter 8 Instance Based Learning 10

Learning
Training RBF Networks
Q1: What xu to use for kernel function Ku(d(xu,x))?
• Scatter uniformly through instance space
• Or use training instances (reflects instance distribution)
Q2: How to train weights (assume here Gaussian
Ku)?
• First choose variance (and perhaps mean) for each Ku
– e.g., use EM
• Then hold Ku fixed, and train linear output layer
– efficient methods to fit linear function

CS 5751 Machine Chapter 8 Instance Based Learning 11

Learning
Case-Based Reasoning
Can apply instance-based learning even when XV Rn
→ need different “distance” metric
Case-Based Reasoning is instance-based learning applied to
instances with symbolic logic descriptions:
((user-complaint error53-on-shutdown)
(cpu-model PowerPC)
(operating-system Windows)
(network-connection PCIA)
(memory 48meg)
(installed-applications Excel Netscape
VirusScan)
(disk 1Gig)
(likely-cause ???))

CS 5751 Machine Chapter 8 Instance Based Learning 12

Learning
Case-Based Reasoning in CADET
CADET: 75 stored examples of mechanical devices
• each training example:
<qualitative function, mechanical structure>
• new query: desired function
• target value: mechanical structure for this function

Distance metric: match qualitative function

descriptions

CS 5751 Machine Chapter 8 Instance Based Learning 13

Learning
Case-Based Reasoning in CADET
A stored case: T-junction pipe
Structure: Function:
Q1,T1 T = temperature Q1 +
Q = waterflow
Q3
Q2 +
Q3,T3
T1 +
T3
T2 +
Q2,T2
A problem specification: Water faucet
+
Structure: Function: Cc Qc +
++ + Qm

? Ch Qh

-
+
Tc

+
+ Tm
Th +
CS 5751 Machine Chapter 8 Instance Based Learning 14
Learning
Case-Based Reasoning in CADET
• Instances represented by rich structural
descriptions
• Multiple cases retrieved (and combined) to form
solution to new problem
• Tight coupling between case retrieval and problem
solving
Bottom line:
• Simple matching of cases useful for tasks such as
answering help-desk queries
• Area of ongoing research

CS 5751 Machine Chapter 8 Instance Based Learning 15

Learning
Lazy and Eager Learning
Lazy: wait for query before generalizing
• k-Nearest Neighbor, Case-Based Reasoning
Eager: generalize before seeing query
• Radial basis function networks, ID3, Backpropagation, etc.

Does it matter?
• Eager learner must create global approximation
• Lazy learner can create many local approximations
• If they use same H, lazy can represent more complex
functions (e.g., consider H=linear functions)

CS 5751 Machine Chapter 8 Instance Based Learning 16

Learning
kd-trees (Moore)
• Eager version of k-Nearest Neighbor
• Idea: decrease time to find neighbors
– train by constructing a lookup (kd) tree
– recursively subdivide space
• ignore class of points
• lots of possible mechanisms: grid, maximum variance, etc.
– when looking for nearest neighbor search tree
– nearest neighbor can be found in log(n) steps
– k nearest neighbors can be found by generalizing
process (still in log(n) steps if k is constant)
• Slower training but faster classification
CS 5751 Machine Chapter 8 Instance Based Learning 17
Learning
kd Tree

CS 5751 Machine Chapter 8 Instance Based Learning 18

Learning
Instance Based Learning Summary
• Lazy versus Eager learning
– lazy - work done at testing time
– eager -work done at training time
– instance based sometimes lazy
• k-Nearest Neighbor (k-nn) lazy
– classify based on k nearest neighbors
– key: determining neighbors
– variations:
• distance weighted combination
• locally weighted regression
– limitation: curse of dimensionality
• “stretching” dimensions
CS 5751 Machine Chapter 8 Instance Based Learning 19
Learning
Instance Based Learning Summary
• k-d trees (eager version of k-nn)
– structure built at train time to quickly find neighbors
• Radial Basis Function (RBF) networks (eager)
– units active in region (sphere) of space
– key: picking/training kernel functions
• Case-Based Reasoning (CBR) generally lazy
– nearest neighbor when no continuos features
– may have other types of features:
• structural (graphs in CADET)

CS 5751 Machine Chapter 8 Instance Based Learning 20

Learning

Machine Learning Module 3 / DR Loganathan D / Cambridge Institute of Technology, Bangalore
No ratings yet
Machine Learning Module 3 / DR Loganathan D / Cambridge Institute of Technology, Bangalore
242 pages
Mod 3
No ratings yet
Mod 3
56 pages
Basics of Design & Graphics - Practice Questions
No ratings yet
Basics of Design & Graphics - Practice Questions
25 pages
Instance Based Learning: Vibhav Gogate The University of Texas at Dallas
No ratings yet
Instance Based Learning: Vibhav Gogate The University of Texas at Dallas
25 pages
Instance Based Learning
100% (1)
Instance Based Learning
27 pages
Module 5
No ratings yet
Module 5
94 pages
ML Chapter 8 (IBL) Notes
No ratings yet
ML Chapter 8 (IBL) Notes
60 pages
@vtudeveloper - in ML Mod 3
No ratings yet
@vtudeveloper - in ML Mod 3
32 pages
UNIT V 5.1 ML Instance Based Learning
No ratings yet
UNIT V 5.1 ML Instance Based Learning
52 pages
MLT Unit 3 Part 2
No ratings yet
MLT Unit 3 Part 2
57 pages
Module 3 - Chapter 4 - Similarity Based Learning
No ratings yet
Module 3 - Chapter 4 - Similarity Based Learning
51 pages
AML Mod5
No ratings yet
AML Mod5
33 pages
Coal Conversions Facts 2013
No ratings yet
Coal Conversions Facts 2013
4 pages
Instance Based Learning
100% (1)
Instance Based Learning
49 pages
1 Unit 2 Notes
No ratings yet
1 Unit 2 Notes
31 pages
AKSA Battery Charger
No ratings yet
AKSA Battery Charger
2 pages
Error Messages
No ratings yet
Error Messages
53 pages
ML - Unit 3
No ratings yet
ML - Unit 3
32 pages
Machine Learning Unit 3
No ratings yet
Machine Learning Unit 3
40 pages
Bcs602 ML Mod-3 Notes @vtunetwork
No ratings yet
Bcs602 ML Mod-3 Notes @vtunetwork
28 pages
Unit 5 ML-2-70
No ratings yet
Unit 5 ML-2-70
69 pages
ML Unit 2
No ratings yet
ML Unit 2
31 pages
Unit 5 ML
No ratings yet
Unit 5 ML
13 pages
CH 2
No ratings yet
CH 2
30 pages
Learning Algorithms for Engineers
No ratings yet
Learning Algorithms for Engineers
37 pages
Module3-Similarity-based Learning-11Mar2024
No ratings yet
Module3-Similarity-based Learning-11Mar2024
34 pages
ML-Chapter-8 (IBL) - Notes
No ratings yet
ML-Chapter-8 (IBL) - Notes
15 pages
Machine Learning Lecture 02
No ratings yet
Machine Learning Lecture 02
25 pages
Lazy vs. Eager Learning
No ratings yet
Lazy vs. Eager Learning
6 pages
Machine Learning for Data Scientists
No ratings yet
Machine Learning for Data Scientists
13 pages
Singh Surender - Biostatistics & Research Methodolgy
No ratings yet
Singh Surender - Biostatistics & Research Methodolgy
18 pages
CAT - 2 Class
No ratings yet
CAT - 2 Class
62 pages
CSC 323-08 Instance-Based Learning
No ratings yet
CSC 323-08 Instance-Based Learning
6 pages
ML Lec7
No ratings yet
ML Lec7
5 pages
Aiml Module 3 Part 2
No ratings yet
Aiml Module 3 Part 2
12 pages
Aiml M3 C2
No ratings yet
Aiml M3 C2
56 pages
Chapter 6: Classification and Prediction: Classify Predictions
No ratings yet
Chapter 6: Classification and Prediction: Classify Predictions
23 pages
Text Book 2 Module 4 Chapter 3-Similarity Based Learning
No ratings yet
Text Book 2 Module 4 Chapter 3-Similarity Based Learning
12 pages
MCA 4th Sem
No ratings yet
MCA 4th Sem
18 pages
18CS71 AI & ML Module 5 Notes
No ratings yet
18CS71 AI & ML Module 5 Notes
21 pages
ML.4-Classification Techniques (Week 5,6,7)
No ratings yet
ML.4-Classification Techniques (Week 5,6,7)
56 pages
k-Nearest Neighbor in Inductive Learning
No ratings yet
k-Nearest Neighbor in Inductive Learning
2 pages
Instance Based Learning: 09s1: COMP9417 Machine Learning and Data Mining
No ratings yet
Instance Based Learning: 09s1: COMP9417 Machine Learning and Data Mining
9 pages
Instance-Based Learning Explained
No ratings yet
Instance-Based Learning Explained
6 pages
CDB Review Checklist: Program Analysis (PA) Phase Submittal Design Development (DD) Phase Submittal
No ratings yet
CDB Review Checklist: Program Analysis (PA) Phase Submittal Design Development (DD) Phase Submittal
5 pages
CE802 Lec IBL Add Slides
No ratings yet
CE802 Lec IBL Add Slides
9 pages
Module 4 A
No ratings yet
Module 4 A
29 pages
Instance Based Learning: November 2015
No ratings yet
Instance Based Learning: November 2015
11 pages
BTech V KCS 055 Unit3
No ratings yet
BTech V KCS 055 Unit3
12 pages
UNIT 3 - INSTANCE BASED LEARNING Akgec
No ratings yet
UNIT 3 - INSTANCE BASED LEARNING Akgec
14 pages
CHP 4
No ratings yet
CHP 4
24 pages
Ai&ml Module 5 Final
No ratings yet
Ai&ml Module 5 Final
14 pages
Unit 3
No ratings yet
Unit 3
12 pages
ML Mid2 Ans
No ratings yet
ML Mid2 Ans
24 pages
Instance-Based Learning Guide
No ratings yet
Instance-Based Learning Guide
12 pages
CV-86 Piezoelectric Velocity Transducer Specs
No ratings yet
CV-86 Piezoelectric Velocity Transducer Specs
1 page
Instance-Based Learning Guide
No ratings yet
Instance-Based Learning Guide
16 pages
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
No ratings yet
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
20 pages
Shaft Design
No ratings yet
Shaft Design
14 pages
Solving Wicked Problems in Construction
No ratings yet
Solving Wicked Problems in Construction
13 pages
K Nearest Neighbors
No ratings yet
K Nearest Neighbors
16 pages
NIFT PONDICHERRY - Area Statement - Final
No ratings yet
NIFT PONDICHERRY - Area Statement - Final
3 pages
Instance Based Learning
No ratings yet
Instance Based Learning
20 pages
Instance Based Learning: Artificial Intelligence and Machine Learning 18CS71
No ratings yet
Instance Based Learning: Artificial Intelligence and Machine Learning 18CS71
19 pages
TDS DLSF Series
No ratings yet
TDS DLSF Series
3 pages
Dynex Technologies DSX Troubleshooting Guide: Tip Pick-up/Eject Errors
No ratings yet
Dynex Technologies DSX Troubleshooting Guide: Tip Pick-up/Eject Errors
3 pages
Flip Flops - Registers and Counters
No ratings yet
Flip Flops - Registers and Counters
42 pages
K Nearest Neighbors
No ratings yet
K Nearest Neighbors
16 pages
Instance-Based Learning Guide
No ratings yet
Instance-Based Learning Guide
19 pages
Online STTP Schdule
No ratings yet
Online STTP Schdule
1 page
Load Schedules For Lighting Panel Admin BLD 6 - 11-2023
No ratings yet
Load Schedules For Lighting Panel Admin BLD 6 - 11-2023
1 page
0936E1001R00
No ratings yet
0936E1001R00
1 page
KI235 For Goods Movement, What Should Be Done
No ratings yet
KI235 For Goods Movement, What Should Be Done
8 pages
Hussein 2015
No ratings yet
Hussein 2015
4 pages
BROCHURE
No ratings yet
BROCHURE
8 pages
Understanding Buffer Overflow
No ratings yet
Understanding Buffer Overflow
5 pages
DBMS Lab Report
No ratings yet
DBMS Lab Report
19 pages
MP IA2 Q and A
No ratings yet
MP IA2 Q and A
9 pages
CSE-224 (Fundamentals of Android)
No ratings yet
CSE-224 (Fundamentals of Android)
2 pages
CV - Inderpreet Kaur
No ratings yet
CV - Inderpreet Kaur
2 pages
DELTA IA-TC DTM B EN-DIN 20181004 Web
No ratings yet
DELTA IA-TC DTM B EN-DIN 20181004 Web
4 pages
Code in Voices
No ratings yet
Code in Voices
10 pages
Crafting The Methods and Results in Academic Publishing
No ratings yet
Crafting The Methods and Results in Academic Publishing
10 pages
Mobile Application User Guide
No ratings yet
Mobile Application User Guide
13 pages
1947 Benscoter, Stanley PDF
No ratings yet
1947 Benscoter, Stanley PDF
258 pages
CSE 101 - Introduction To Computers I: Topic
No ratings yet
CSE 101 - Introduction To Computers I: Topic
39 pages