Learning From Observations: Section 1 - 3

Learning agents can improve their performance over time by learning from observations and feedback. Decision tree learning is a common inductive learning method that constructs a tree-based hypothesis to classify examples based on their attributes. The algorithm chooses the attribute that best splits the training examples at each step by calculating the information gain of each attribute. This results in a simpler tree that generalizes well while remaining consistent with the training data.

Uploaded by

chakravarthyashok

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

98 views26 pages

Learning From Observations: Section 1 - 3

Uploaded by

chakravarthyashok

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 26

Learning from Observations

Chapter 18
Section 1 – 3
Outline
• Learning agents
• Inductive learning
• Decision tree learning
Learning
• Learning is essential for unknown environments,
– i.e., when designer lacks omniscience

• Learning is useful as a system construction

method,
– i.e., expose the agent to reality rather than trying to
write it down

• Learning modifies the agent's decision

mechanisms to improve performance
Learning agents
Learning element
• Design of a learning element is affected by
– Which components of the performance element are to
be learned
– What feedback is available to learn these components
– What representation is used for the components

• Type of feedback:
– Supervised learning: correct answers for each
example
– Unsupervised learning: correct answers not given
– Reinforcement learning: occasional rewards
Inductive learning
• Simplest form: learn a function from examples

f is the target function

An example is a pair (x, f(x))

Problem: find a hypothesis h

such that h ≈ f
given a training set of examples

(This is a highly simplified model of real learning:

– Ignores prior knowledge
– Assumes examples are given)
–
Inductive learning method
• Construct/adjust h to agree with f on training set
• (h is consistent if it agrees with f on all examples)
• E.g., curve fitting:

•
•
Inductive learning method
• Construct/adjust h to agree with f on training set
• (h is consistent if it agrees with f on all examples)
• E.g., curve fitting:

•
Inductive learning method
• Construct/adjust h to agree with f on training set
• (h is consistent if it agrees with f on all examples)
• E.g., curve fitting:

• Ockham’s razor: prefer the simplest hypothesis

consistent with data
Learning decision trees
Problem: decide whether to wait for a table at a restaurant,
based on the following attributes:
1. Alternate: is there an alternative restaurant nearby?
2. Bar: is there a comfortable bar area to wait in?
3. Fri/Sat: is today Friday or Saturday?
4. Hungry: are we hungry?
5. Patrons: number of people in the restaurant (None, Some, Full)
6. Price: price range ($, $$, $$$)
7. Raining: is it raining outside?
8. Reservation: have we made a reservation?
9. Type: kind of restaurant (French, Italian, Thai, Burger)
10. WaitEstimate: estimated waiting time (0-10, 10-30, 30-60, >60)
Attribute-based representations
• Examples described by attribute values (Boolean, discrete, continuous)
• E.g., situations where I will/won't wait for a table:

• Classification of examples is positive (T) or negative (F)

•
Decision trees
• One possible representation for hypotheses
• E.g., here is the “true” tree for deciding whether to wait:
Expressiveness
• Decision trees can express any function of the input attributes.
• E.g., for Boolean functions, truth table row → path to leaf:

• Trivially, there is a consistent decision tree for any training set with one path
to leaf for each example (unless f nondeterministic in x) but it probably won't
generalize to new examples

• Prefer to find more compact decision trees

Hypothesis spaces
How many distinct decision trees with n Boolean attributes?
= number of Boolean functions
= number of distinct truth tables with 2n rows = 22n

• E.g., with 6 Boolean attributes, there are

18,446,744,073,709,551,616 trees
Hypothesis spaces
How many distinct decision trees with n Boolean attributes?
= number of Boolean functions
= number of distinct truth tables with 2n rows = 22n

• E.g., with 6 Boolean attributes, there are

18,446,744,073,709,551,616 trees

How many purely conjunctive hypotheses (e.g., Hungry  Rain)?

• Each attribute can be in (positive), in (negative), or out
 3n distinct conjunctive hypotheses
• More expressive hypothesis space
– increases chance that target function can be expressed
– increases number of hypotheses consistent with training set
 may get worse predictions
Decision tree learning
• Aim: find a small tree consistent with the training examples
• Idea: (recursively) choose "most significant" attribute as root of
(sub)tree
Choosing an attribute
• Idea: a good attribute splits the examples into subsets
that are (ideally) "all positive" or "all negative"

• Patrons? is a better choice

•
Using information theory
• To implement Choose-Attribute in the DTL
algorithm
• Information Content (Entropy):
I(P(v1), … , P(vn)) = Σi=1 -P(vi) log2 P(vi)
• For a training set containing p positive examples
and n negative examples:
p n p p n n
I( , ) log 2  log 2
pn pn pn pn pn pn
Information gain
• A chosen attribute A divides the training set E into
subsets E1, … , Ev according to their values for A, where
A has v distinct values.
v
p i  ni pi ni
remainder ( A)   I( , )
i 1 p  n pi  ni pi  ni
• Information Gain (IG) or reduction in entropy from the
attribute test:
p n
IG ( A)  I ( , )  remainder ( A)
pn pn
• Choose the attribute with the largest IG
Information gain
For the training set, p = n = 6, I(6/12, 6/12) = 1 bit

Consider the attributes Patrons and Type (and others too):

2 4 6 2 4
IG( Patrons )  1  [ I (0,1)  I (1,0)  I ( , )]  .0541 bits
12 12 12 6 6
2 1 1 2 1 1 4 2 2 4 2 2
IG(Type )  1  [ I ( , )  I ( , )  I ( , )  I ( , )]  0 bits
12 2 2 12 2 2 12 4 4 12 4 4

Patrons has the highest IG of all attributes and so is chosen by the DTL
algorithm as the root
Example contd.
• Decision tree learned from the 12 examples:

• Substantially simpler than “true” tree---a more complex

hypothesis isn’t justified by small amount of data
Performance measurement
• How do we know that h ≈ f ?
1. Use theorems of computational/statistical learning theory
2. Try h on a new test set of examples
(use same distribution over example space as training set)
Learning curve = % correct on test set as a function of training set size
Summary
• Learning needed for unknown environments,
lazy designers
• Learning agent = performance element +
learning element
• For supervised learning, the aim is to find a
simple hypothesis approximately consistent with
training examples
• Decision tree learning using information gain
• Learning performance = prediction accuracy
measured on test set

CS6364 Lecture18 - ML Decision Tree
No ratings yet
CS6364 Lecture18 - ML Decision Tree
30 pages
Machine Learning Learning
No ratings yet
Machine Learning Learning
35 pages
HIT3002: Introduction To Artificial Intelligence: Learning From Observations
No ratings yet
HIT3002: Introduction To Artificial Intelligence: Learning From Observations
13 pages
Ai - Unit Vi
No ratings yet
Ai - Unit Vi
40 pages
Inductive and Decision Tree Learning
No ratings yet
Inductive and Decision Tree Learning
30 pages
JU Ch9
No ratings yet
JU Ch9
21 pages
Chapter 8: Learning: By, Safa Hamdare
No ratings yet
Chapter 8: Learning: By, Safa Hamdare
46 pages
Robotics
No ratings yet
Robotics
5 pages
2024 Lecture11 MLAlgorithms
No ratings yet
2024 Lecture11 MLAlgorithms
84 pages
AI Unit 4
No ratings yet
AI Unit 4
91 pages
Mod 4-1
No ratings yet
Mod 4-1
42 pages
AI Learning: Decision Trees
No ratings yet
AI Learning: Decision Trees
64 pages
Chapter19 4e
No ratings yet
Chapter19 4e
67 pages
Tycs Ai Unit 2
No ratings yet
Tycs Ai Unit 2
84 pages
Unit 5
No ratings yet
Unit 5
21 pages
Week 11 - Decision Tree Learning
No ratings yet
Week 11 - Decision Tree Learning
43 pages
Chapter Five Learning
No ratings yet
Chapter Five Learning
50 pages
2025 Lecture07 P1 ID3
No ratings yet
2025 Lecture07 P1 ID3
41 pages
Decision Tree
No ratings yet
Decision Tree
42 pages
9 Learning
No ratings yet
9 Learning
16 pages
10 Learning
No ratings yet
10 Learning
32 pages
Module - 2 Decision Tree Learning
No ratings yet
Module - 2 Decision Tree Learning
79 pages
ML Lecture 3
No ratings yet
ML Lecture 3
13 pages
Decision Trees
No ratings yet
Decision Trees
53 pages
Cs 171 18 IntroLearning Old
No ratings yet
Cs 171 18 IntroLearning Old
47 pages
Decision Trees
No ratings yet
Decision Trees
42 pages
Learning
No ratings yet
Learning
51 pages
10 Learning Annot
No ratings yet
10 Learning Annot
32 pages
Cooperating Intelligent Systems: Learning From Observations Chapter 18, AIMA
No ratings yet
Cooperating Intelligent Systems: Learning From Observations Chapter 18, AIMA
51 pages
CS 343: Artificial Intelligence Machine Learning: Raymond J. Mooney
100% (1)
CS 343: Artificial Intelligence Machine Learning: Raymond J. Mooney
35 pages
Chap 18
No ratings yet
Chap 18
51 pages
ML-3-Decision Tree
No ratings yet
ML-3-Decision Tree
17 pages
Class 16 Decision Tree
No ratings yet
Class 16 Decision Tree
45 pages
Unit 3
No ratings yet
Unit 3
81 pages
Chapter 6:artificial Intelligence Learning: By. Getaneh T
No ratings yet
Chapter 6:artificial Intelligence Learning: By. Getaneh T
59 pages
Decision Tree
No ratings yet
Decision Tree
98 pages
Module 3
No ratings yet
Module 3
102 pages
Ai Unit V
No ratings yet
Ai Unit V
18 pages
DTreesAndOverfitting 1 11 2011 - Final
No ratings yet
DTreesAndOverfitting 1 11 2011 - Final
20 pages
Decision Tree Learning Guide
No ratings yet
Decision Tree Learning Guide
79 pages
Module 3
No ratings yet
Module 3
101 pages
Ai Unit 5 Part 3
No ratings yet
Ai Unit 5 Part 3
9 pages
L8 1 Decisiontrees Random Forest
No ratings yet
L8 1 Decisiontrees Random Forest
118 pages
L3 - Decision Trees
No ratings yet
L3 - Decision Trees
28 pages
Artificial Intelligence: Slide 6
100% (1)
Artificial Intelligence: Slide 6
42 pages
M01 Tree-Based Methods
No ratings yet
M01 Tree-Based Methods
38 pages
Inductive Learning in AI
No ratings yet
Inductive Learning in AI
53 pages
Lec7 - Nonparametric Methods - II
No ratings yet
Lec7 - Nonparametric Methods - II
38 pages
TTNT 09 Learning From Examples
No ratings yet
TTNT 09 Learning From Examples
58 pages
Lect6 PDF
No ratings yet
Lect6 PDF
66 pages
Lec12 2
No ratings yet
Lec12 2
103 pages
Unit 5 2
No ratings yet
Unit 5 2
31 pages
Artificial Intelligence: Machine Learning
No ratings yet
Artificial Intelligence: Machine Learning
110 pages
Ai Module V Part2
No ratings yet
Ai Module V Part2
8 pages
Cse 445 Lecture 8 Mma
No ratings yet
Cse 445 Lecture 8 Mma
107 pages
Decision Tree Learning Lecture
No ratings yet
Decision Tree Learning Lecture
13 pages
48 Learning From Memorization AIC17 V1
No ratings yet
48 Learning From Memorization AIC17 V1
49 pages
ML - Module 2
No ratings yet
ML - Module 2
41 pages
Bayesian Networks: Section 1 - 2
No ratings yet
Bayesian Networks: Section 1 - 2
16 pages
Inference in First-Order Logic
No ratings yet
Inference in First-Order Logic
43 pages
Adversarial Search: Section 1 - 4
No ratings yet
Adversarial Search: Section 1 - 4
21 pages
m8 Fol
No ratings yet
m8 Fol
27 pages
m07 Logic
No ratings yet
m07 Logic
74 pages
m2 Agents
No ratings yet
m2 Agents
28 pages
Constraint Satisfaction Problems: Section 1 - 3
No ratings yet
Constraint Satisfaction Problems: Section 1 - 3
34 pages
AI - Solving Problems by Search 1
No ratings yet
AI - Solving Problems by Search 1
56 pages
Artificial Intelligence: Introduction: Chapter 1
No ratings yet
Artificial Intelligence: Introduction: Chapter 1
12 pages
M04-Heuristics Informed Search
No ratings yet
M04-Heuristics Informed Search
41 pages
Lab 6 Lisp Programming and Working With Lisp Studio
100% (1)
Lab 6 Lisp Programming and Working With Lisp Studio
13 pages
N-Queens Problem in PROLOG
No ratings yet
N-Queens Problem in PROLOG
3 pages
Lab 4 Best First Heuristic Search
No ratings yet
Lab 4 Best First Heuristic Search
8 pages
Lab 2 Tree Traversal in Prolog
No ratings yet
Lab 2 Tree Traversal in Prolog
2 pages
Lab 1 Introduction To PROLOG
No ratings yet
Lab 1 Introduction To PROLOG
4 pages
Artificial Intelligence I: Knowledge Repre-Sentation
No ratings yet
Artificial Intelligence I: Knowledge Repre-Sentation
30 pages
chatGPT: AI for Business & Learning
No ratings yet
chatGPT: AI for Business & Learning
7 pages
AI Overview: History, Status, and Applications
100% (1)
AI Overview: History, Status, and Applications
20 pages
Exam Long Questions
No ratings yet
Exam Long Questions
8 pages
Tuning of PID Controller Using Ziegler-Nichols Method For Speed Control of DC Motor
No ratings yet
Tuning of PID Controller Using Ziegler-Nichols Method For Speed Control of DC Motor
6 pages
V Model SE PDF
100% (1)
V Model SE PDF
38 pages
Supervised Learning
No ratings yet
Supervised Learning
3 pages
Image Classification Challenges
No ratings yet
Image Classification Challenges
3 pages
Lecture 6-Data Mining and Warehousing
No ratings yet
Lecture 6-Data Mining and Warehousing
7 pages
DBMS QB Solution
No ratings yet
DBMS QB Solution
9 pages
AI Bootcamp: Master AI in 12 Weeks
No ratings yet
AI Bootcamp: Master AI in 12 Weeks
21 pages
Machine Learning For Robots: Course 1: Ros Deep Learning With Tensorflow 101
No ratings yet
Machine Learning For Robots: Course 1: Ros Deep Learning With Tensorflow 101
4 pages
Soft Computing Paradigm
No ratings yet
Soft Computing Paradigm
46 pages
Workshop Schedule: 9:00 - 10:00 Am Director, Director (R&D), Dean (A), Dean (SW), All Hods, Principal (Pharmacy)
No ratings yet
Workshop Schedule: 9:00 - 10:00 Am Director, Director (R&D), Dean (A), Dean (SW), All Hods, Principal (Pharmacy)
1 page
Shiva Kumar Data Science Resume
No ratings yet
Shiva Kumar Data Science Resume
2 pages
ME 464 Exam Paper Final-Vetted
No ratings yet
ME 464 Exam Paper Final-Vetted
5 pages
Mengenali Fungsi Logika "And" Melalui Pemrograman Perceptron Dengan Matlab
No ratings yet
Mengenali Fungsi Logika "And" Melalui Pemrograman Perceptron Dengan Matlab
8 pages
Ai Cheat Sheet Machine Learning With Python Cheat Sheet
100% (4)
Ai Cheat Sheet Machine Learning With Python Cheat Sheet
2 pages
Auto-Tuning of PID Controllers Via Extremum Seeking
No ratings yet
Auto-Tuning of PID Controllers Via Extremum Seeking
6 pages
Cybernetics and Artificial Intelligence
No ratings yet
Cybernetics and Artificial Intelligence
29 pages
Difficulties in Intercultural Communication
No ratings yet
Difficulties in Intercultural Communication
6 pages
Lab Bode Plot - 2022
No ratings yet
Lab Bode Plot - 2022
3 pages
1) Types of Machine Learning ? 2) Machine Learning Techniques ? 3) Unsupervised Learning Techniques ? 4) K-Means Technique ?
No ratings yet
1) Types of Machine Learning ? 2) Machine Learning Techniques ? 3) Unsupervised Learning Techniques ? 4) K-Means Technique ?
15 pages
Comparison of Fuzzy and MPC Based Buck Converter: M.Praveen Kumar P.Ponnambalam
No ratings yet
Comparison of Fuzzy and MPC Based Buck Converter: M.Praveen Kumar P.Ponnambalam
6 pages
Control System Lab Manual EC Branch (KEC-652)
No ratings yet
Control System Lab Manual EC Branch (KEC-652)
33 pages
3-Distribution Design
No ratings yet
3-Distribution Design
66 pages
CM204G DBMS
No ratings yet
CM204G DBMS
10 pages
PABF End Term
No ratings yet
PABF End Term
7 pages
Cybernetic Music: Roland Kayn's Journey
100% (1)
Cybernetic Music: Roland Kayn's Journey
10 pages
Basics To Advanced ChatGPT, AI, Prompt Engineering, Excel, PowerBI, Tableau, SQL
67% (6)
Basics To Advanced ChatGPT, AI, Prompt Engineering, Excel, PowerBI, Tableau, SQL
42 pages
SDU - Data Analyst
No ratings yet
SDU - Data Analyst
1 page