0% found this document useful (0 votes)

80 views3 pages

Introduction To Machine Learning - Unit 15 - Week 12

The document outlines an assignment for the NPTEL course 'Introduction to Machine Learning,' detailing the submission status and scores for various questions related to probability, hypothesis functions, and reinforcement learning. It includes specific questions and accepted answers, along with feedback on correctness. The assignment was submitted on April 16, 2025, and the document also provides information about the course structure and learning objectives.

Uploaded by

Vardhini Kondra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

80 views3 pages

Introduction To Machine Learning - Unit 15 - Week 12

Uploaded by

Vardhini Kondra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

X

(https://swayam.gov.in) (https://swayam.gov.in/nc_details/NPTEL)

[email protected] 

NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL) » Introduction to Machine Learning (course)


Click to register for
Certification exam

Week 12 : Assignment 12
(https://examform.nptel.ac.in/2025_01/exam_form/dashboard)

If already registered, click The due date for submitting this assignment has passed.

to check your payment Due on 2025-04-16, 23:59 IST.

status
Assignment submitted on 2025-04-16, 11:37 IST
1) 4
2 points
Let P (Ai ) = 2
−i
. Calculate the upper bound for P ( ⋃ Ai ) using union bound (rounded to 3 decimal places).
Course outline i=1

0.875
About NPTEL ()
0.937
How does an NPTEL 0.984
online course work? () 1

Yes, the answer is correct.

Week 0 () Score: 2
Accepted Answers:
Week 1 () 0.937

Week 2 () 2) Given 50 hypothesis functions, each trained with 105 samples, what is the lower bound on the probability that there does not exist 2 points
a hypothesis function with error greater than 0.1?
Week 3 ()
3
−2⋅10
Week 4 () 1 − 100

3
−10
1 − 100e
Week 5 ()
3
−2⋅10
1 − 50
Week 6 ()
3
−10
1 − 50
Week 7 ()
No, the answer is incorrect.
Score: 0
Week 8 ()
Accepted Answers:
3
−2⋅10
1 − 100
Week 9 ()

3) The VC dimension of a pair of squares is: 1 point

Week 10 ()
3
Week 11 () 4
5
Week 12 ()
6
Learning Theory (unit?
No, the answer is incorrect.
unit=129&lesson=130) Score: 0
Introduction to
Accepted Answers:
Reinforcement Learning
5
(unit?
unit=129&lesson=131) 4) In games like Chess or Ludo, the transition function is known to us. But what about Counter Strike or Mortal Combat or Super 1 point
Mario? In games where we do not know T, we can only query the game simulator with current state and action, and it returns the next state.
RL Framework and TD
This means we cannot directly argmax or argmin for V(T(S,a)). Therefore, learning the value function V is not sufficient to construct a policy.
Learning (Optional) (unit?
Which of these could we do to overcome this? (more than 1 may apply)
unit=129&lesson=132)
Assume there exists a method to do each option. You have to judge whether doing it solves the stated problem.
Solution Methods and
Applications (Optional) Directly learn the policy.
(unit?
Learn a different function which stores value for state-action pairs (instead of only state like V does).
unit=129&lesson=133)
Learn T along with V.
Week 12 Feedback
Run a random agent repeatedly till it wins. Use this as the winning policy.
Form:Introduction to
Machine Learning!! (unit? Yes, the answer is correct.
unit=129&lesson=293) Score: 1
Accepted Answers:
Quiz: Week 12 : Directly learn the policy.
Assignment 12 Learn a different function which stores value for state-action pairs (instead of only state like V does).
(assessment?name=319)
Learn T along with V.

Text Transcripts ()
For the rest of the questions, we will follow a simplistic game and see how a Reinforcement Learning agent can learn to behave opt

This is our game:

Download Videos ()

Books ()

Problem Solving
Session - Jan 2025 ()
At the start of the game, the agent is on the Start state and can choose to move left or right at each turn.

If it reaches the right end RE, it wins and if it reaches the left end LE, it loses.

Because we love maths so much, instead of saying the agent wins or loses,

we will say that the agent gets a reward of +1 at RE and a reward of -1 at LE.

Then the objective of the agent is simply to maximum the reward it obtains!

5) For each state, we define a variable that will store its value. The value of the state will help the agent determine how to behave 1 point
later. First we will learn this value.

Let V be the mapping from state to its value.

Initially,
V(LE) = -1
V(X1) = V(X2) = V(X3) = V(X4) = V(Start) = 0
V(RE) = +1

For each state S ∈ {X1, X2, X3, X4, Start} , with SL being the state to its immediate left and SR being the state to its immediate right,
repeat:
V (S) = 0.9 × max(V (SL ), V (SR ))

Till V converges (does not change for any state).

What is V(X4) after one application of the given formula?

1
0.9
0.81
0

Yes, the answer is correct.

Score: 1
Accepted Answers:
0.9

6) What is V(X1) after one application of given formula? 1 point

-1
-0.9
-0.81
0

Yes, the answer is correct.

Score: 1
Accepted Answers:
0

7) What is V(X1) after V converges? 1 point

0.54
-0.9
0.63
0

No, the answer is incorrect.

Score: 0
Accepted Answers:
0.54

8) The behavior of an agent is called a policy. Formally, a policy is a mapping from states to actions. In our case, we have two 1 point
actions: left and right. We will denote the action for our policy as A.
Clearly, the optimal policy would be to choose action right in every state. Which of the following can we use to mathematically describe our
optimal policy using the learnt V?

For options (c) and (d), T is the transition function defined as: T (state, action) = next_state . (more than one options may apply)
Left if V (SL ) > V (SR )
A = {
Right otherwise

Left if V (SR ) > V (SL )

A = {
Right otherwise

A = arg max({V (T (S, a))})

A = arg min({V (T (S, a))})

No, the answer is incorrect.

Score: 0
Accepted Answers:
Left if V (SL ) > V (SR )
A = {
Right otherwise

A = arg max({V (T (S, a))})

Introduction To Machine Learning - Unit 15 - Week 12
No ratings yet
Introduction To Machine Learning - Unit 15 - Week 12
3 pages
A12 Spring2024
No ratings yet
A12 Spring2024
5 pages
Q Learing
No ratings yet
Q Learing
30 pages
402 Lec20
No ratings yet
402 Lec20
21 pages
Lecture 06
No ratings yet
Lecture 06
98 pages
Reinforcement Learning Lec12
No ratings yet
Reinforcement Learning Lec12
60 pages
ml4r 2025 05
No ratings yet
ml4r 2025 05
22 pages
Reinforcement Learning Guide
No ratings yet
Reinforcement Learning Guide
18 pages
CO431 RL 2023 End Nov
No ratings yet
CO431 RL 2023 End Nov
3 pages
RL 10 QUESTIONS FOR MID II Scheme of Evaluvation
No ratings yet
RL 10 QUESTIONS FOR MID II Scheme of Evaluvation
15 pages
CSE 445 - Lecture 9 - Reinforcement Learning
No ratings yet
CSE 445 - Lecture 9 - Reinforcement Learning
45 pages
Reinforcement Learning Cheat Sheet: Return
No ratings yet
Reinforcement Learning Cheat Sheet: Return
7 pages
Reinforcement Learning: Csci 5512: Artificial Intelligence Ii
No ratings yet
Reinforcement Learning: Csci 5512: Artificial Intelligence Ii
30 pages
18 AI BasicRL
No ratings yet
18 AI BasicRL
96 pages
Bridging The Gap Between Value and Policy Based Reinforcement Learning
No ratings yet
Bridging The Gap Between Value and Policy Based Reinforcement Learning
21 pages
4 Reinforcement Learning - Basic Algorithms: - S, A) ) and The Immediate Reward Function R (R (S, A, S
No ratings yet
4 Reinforcement Learning - Basic Algorithms: - S, A) ) and The Immediate Reward Function R (R (S, A, S
16 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
38 pages
Reinforcement Learning Guide
No ratings yet
Reinforcement Learning Guide
48 pages
Serge Levine Course Introduction To Reinforcement Learning 6 Value Function
No ratings yet
Serge Levine Course Introduction To Reinforcement Learning 6 Value Function
27 pages
RL Theory Tutorial
No ratings yet
RL Theory Tutorial
80 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
6 pages
ML Unit 4
No ratings yet
ML Unit 4
9 pages
RL-UNIT2 - RL Unit 2 RL-UNIT2 - RL Unit 2
No ratings yet
RL-UNIT2 - RL Unit 2 RL-UNIT2 - RL Unit 2
23 pages
Introduction To RL
No ratings yet
Introduction To RL
64 pages
Sections
No ratings yet
Sections
76 pages
Monte Carlo 1
No ratings yet
Monte Carlo 1
245 pages
Reinforcement Learning Exam
No ratings yet
Reinforcement Learning Exam
6 pages
ML Unit-4 - RTU
No ratings yet
ML Unit-4 - RTU
18 pages
I2ml3e Chap18
No ratings yet
I2ml3e Chap18
27 pages
Multi-Agent Reinforcement Learning-Implementation of Hide and Seek
No ratings yet
Multi-Agent Reinforcement Learning-Implementation of Hide and Seek
7 pages
Lec17 ReinforcementLearning
No ratings yet
Lec17 ReinforcementLearning
58 pages
10 Deep Reinforcement
No ratings yet
10 Deep Reinforcement
40 pages
L13 Reinforcement Learning
No ratings yet
L13 Reinforcement Learning
35 pages
Sdfesdf
No ratings yet
Sdfesdf
23 pages
Unit 3 Ai
No ratings yet
Unit 3 Ai
5 pages
RL With LCS
No ratings yet
RL With LCS
29 pages
11-DL-Deep Learning For Reinforcement Learning
No ratings yet
11-DL-Deep Learning For Reinforcement Learning
47 pages
DD2431 Machine Learning Lab 4: Reinforcement Learning Python Version
No ratings yet
DD2431 Machine Learning Lab 4: Reinforcement Learning Python Version
9 pages
RL 3
No ratings yet
RL 3
31 pages
RL Cheatsheet for Researchers
No ratings yet
RL Cheatsheet for Researchers
16 pages
Policy Gradient Methods
No ratings yet
Policy Gradient Methods
70 pages
Deep Reinforcement Learning: 1 Notation
No ratings yet
Deep Reinforcement Learning: 1 Notation
9 pages
7 - Reinforcement Learning
No ratings yet
7 - Reinforcement Learning
23 pages
Lec 12
No ratings yet
Lec 12
60 pages
New CZ3005 Module 5 - Reinforcement Learning
No ratings yet
New CZ3005 Module 5 - Reinforcement Learning
31 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
46 pages
L13 Reinforcement Learning
No ratings yet
L13 Reinforcement Learning
57 pages
Subtitle
No ratings yet
Subtitle
2 pages
Reinforcement Learning - Unit 7 - Week 4
No ratings yet
Reinforcement Learning - Unit 7 - Week 4
2 pages
CZ3005 Module 5 - Reinforcement Learning
No ratings yet
CZ3005 Module 5 - Reinforcement Learning
31 pages
37 RL
No ratings yet
37 RL
18 pages
02 Bellman Equations and Optimality - Complete Guide
No ratings yet
02 Bellman Equations and Optimality - Complete Guide
6 pages
Games2 6pp
No ratings yet
Games2 6pp
15 pages
Reinforcement Learning - Unit 6 - Week 4
0% (1)
Reinforcement Learning - Unit 6 - Week 4
3 pages
Reinforcement Learning - Personal Study Notes
No ratings yet
Reinforcement Learning - Personal Study Notes
12 pages
Exploration vs Exploitation AI
No ratings yet
Exploration vs Exploitation AI
2 pages
6 Pos
No ratings yet
6 Pos
1 page
Introduction To Machine Learning - Unit 9 - Week 6
No ratings yet
Introduction To Machine Learning - Unit 9 - Week 6
4 pages
4 WSD
No ratings yet
4 WSD
2 pages
Intro To ML
No ratings yet
Intro To ML
29 pages
Introduction To Machine Learning - Unit 14 - Week 11
No ratings yet
Introduction To Machine Learning - Unit 14 - Week 11
4 pages
Introduction To Machine Learning - Unit 11 - Week 8
No ratings yet
Introduction To Machine Learning - Unit 11 - Week 8
5 pages
Introduction To Machine Learning - Unit 12 - Week 9
No ratings yet
Introduction To Machine Learning - Unit 12 - Week 9
5 pages
Introduction To Machine Learning - Unit 6 - Week 3
No ratings yet
Introduction To Machine Learning - Unit 6 - Week 3
5 pages
NN-DL Unit-I Artificial Neural Networks (ANNs) - Part-2
No ratings yet
NN-DL Unit-I Artificial Neural Networks (ANNs) - Part-2
101 pages
Introduction To Machine Learning - Unit 5 - Week 2
No ratings yet
Introduction To Machine Learning - Unit 5 - Week 2
4 pages
Introduction To Machine Learning - Unit 10 - Week 7
No ratings yet
Introduction To Machine Learning - Unit 10 - Week 7
5 pages
NN-DL Introduction Class
No ratings yet
NN-DL Introduction Class
10 pages
Stem 434 Lesson Plan Final - Kelci Spence
No ratings yet
Stem 434 Lesson Plan Final - Kelci Spence
7 pages
Masculinity vs. Femininity in Cultures
No ratings yet
Masculinity vs. Femininity in Cultures
6 pages
Lecture 3-SOCIAL RELATIONS
No ratings yet
Lecture 3-SOCIAL RELATIONS
21 pages
Grade 5 Term 3 Lessons Plans
No ratings yet
Grade 5 Term 3 Lessons Plans
132 pages
Procedural Writing
100% (1)
Procedural Writing
3 pages
School of Law, Narsee Monjee Institute of Management Studies, Bangalore
No ratings yet
School of Law, Narsee Monjee Institute of Management Studies, Bangalore
13 pages
Traffic Engineering Lab Guide IITG
No ratings yet
Traffic Engineering Lab Guide IITG
101 pages
03 References OHS01001ENGX v2 (AD02) Jan2025
No ratings yet
03 References OHS01001ENGX v2 (AD02) Jan2025
28 pages
ICT 1st Paper, CH 01
No ratings yet
ICT 1st Paper, CH 01
78 pages
CAD Mock Preparation
No ratings yet
CAD Mock Preparation
5 pages
Nonnative Speech Perception Insights
No ratings yet
Nonnative Speech Perception Insights
12 pages
MCV4U Chapter 1 Assignment - A9d0c550a10552355394502 - 240213 - 153054
No ratings yet
MCV4U Chapter 1 Assignment - A9d0c550a10552355394502 - 240213 - 153054
2 pages
Development of Algan/Gan High Electron Mobility Transistors (Hemts) On Diamond Substrates
No ratings yet
Development of Algan/Gan High Electron Mobility Transistors (Hemts) On Diamond Substrates
76 pages
ASHRAE Testing-Adjusting-Balancing-HVAC-Systems PDF
100% (1)
ASHRAE Testing-Adjusting-Balancing-HVAC-Systems PDF
74 pages
Grade 11 CBSE Exam Schedule 2023
No ratings yet
Grade 11 CBSE Exam Schedule 2023
2 pages
Richard E. Taylor
No ratings yet
Richard E. Taylor
4 pages
2023 GP Mathematics Literacy P2 June Memo
No ratings yet
2023 GP Mathematics Literacy P2 June Memo
6 pages
1-1 Introduction To Legged Robotics
No ratings yet
1-1 Introduction To Legged Robotics
13 pages
Six Sigma Method and 5s Method
No ratings yet
Six Sigma Method and 5s Method
12 pages
Mastery 2 (Etech)
No ratings yet
Mastery 2 (Etech)
4 pages
Syllabus Arch 353 Sec Sem.2024-2025
No ratings yet
Syllabus Arch 353 Sec Sem.2024-2025
4 pages
Chem 201 Experiment 5 - Lab Report
No ratings yet
Chem 201 Experiment 5 - Lab Report
3 pages
According To Official Estimates, About 330,000 Houses Were Damaged
No ratings yet
According To Official Estimates, About 330,000 Houses Were Damaged
5 pages
(Gabarito Dia 1) Quarto Bernoulli 2022
No ratings yet
(Gabarito Dia 1) Quarto Bernoulli 2022
50 pages
Ammonia to Hydrazine Reaction Balancing
No ratings yet
Ammonia to Hydrazine Reaction Balancing
1 page
Admission To Foundation Program of Tianjin University 2025
No ratings yet
Admission To Foundation Program of Tianjin University 2025
5 pages
Tandoc Et Al (2018) JCMC
No ratings yet
Tandoc Et Al (2018) JCMC
15 pages
The Complete Guide To Beholders PDF
100% (1)
The Complete Guide To Beholders PDF
134 pages
+3 Final - Programme - 2015
No ratings yet
+3 Final - Programme - 2015
4 pages
Well Logging Data Acquisition and Applications Serra Oberto Serra Download
No ratings yet
Well Logging Data Acquisition and Applications Serra Oberto Serra Download
39 pages

Introduction To Machine Learning - Unit 15 - Week 12

Uploaded by

Introduction To Machine Learning - Unit 15 - Week 12

Uploaded by

X

NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL) » Introduction to Machine Learning (course)

to check your payment Due on 2025-04-16, 23:59 IST.

Yes, the answer is correct.

3) The VC dimension of a pair of squares is: 1 point

This is our game:

Let V be the mapping from state to its value.

Till V converges (does not change for any state).

What is V(X4) after one application of the given formula?

Yes, the answer is correct.

6) What is V(X1) after one application of given formula? 1 point

Yes, the answer is correct.

7) What is V(X1) after V converges? 1 point

No, the answer is incorrect.

Left if V (SR ) > V (SL )

A = arg max({V (T (S, a))})

A = arg min({V (T (S, a))})

No, the answer is incorrect.

A = arg max({V (T (S, a))})

You might also like