Reinforcement Learning Model Paper

Uploaded by

nitla kolukula

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

0 views1 page

Reinforcement Learning Model Paper

Uploaded by

nitla kolukula

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

H.T.

No: Course Code: 201AM7E04

ADITYA ENGINEERING COLLEGE (A)

REINFORCEMENT LEARNING
(Artificial Intelligence and Machine Learning)
Time: 3 hours Max. Marks: 70
Answer ONE question from each unit
All Questions Carry Equal Marks
All parts of the questions must be answered at one place only

UNIT – I
1 a Define Reinforcement Learning? Explain with various examples L2 CO1 [7M]
b Explain about a k-armed Bandit problem with an example L2 CO1 [7M]
OR
2 a Explain optimistic initial values and explain gradient bandit algorithm L2 CO1 [7M]
with example.
b Explain incremental implementation. Explain about tracking a non L2 CO1 [7M]
stationary problem.

UNIT – II
3 a Discuss about the Agent – Environment Interface with examples. L2 CO2 [7M]
b Discuss about various Goals and Rewards with examples. L2 CO2 [7M]
OR
4 a Define Dynamic Programming. Explain about Policy Evaluation. L2 CO2 [7M]
b Explain about Value Iteration, Asynchronous Dynamic Programming. L2 CO2 [7M]

UNIT – III
5 a Define Monte Carlo Prediction. Explain about Monte Carlo Estimation of L2 CO3 [7M]
Action Values with examples.
b Explain about Monte Carlo Control and Monte Carlo Control without L2 CO3 [7M]
Exploring Starts with examples.
OR
6 a Explain a Unifying Algorithm: n – step with an example L4 CO3 [7M]
b Explain about Discontinuing – aware importance Sampling L2 CO3 [7M]
withexamples.

UNIT – IV
7 a Explain with examples about Off – Policy Divergence. L2 CO4 [7M]
b Define Semi – gradient Methods and the Deadly Triad withexamples. L2 CO4 [7M]
OR
8 a Explain about the Bellman Error is not learnable. L2 CO4 [7M]
b Explain about Dutch Traces in i) Monte Carlo Learning ii) Variables with L2 CO4 [7M]
examples.

UNIT – V
9 a Explain Policy Approximation and its advantages. L2 CO5 [7M]
b Explain about the Policy Gradient Theorem L3 CO5 [7M]
OR
10 a Explain about Reinforce – Monte Carlo Policy Gradient withexample. L2 CO5 [7M]
b Discuss about Watson’s Daily Double – Wagering and optimizing L2 CO5 [7M]
Memory Control with examples
*****

Fire Wall
No ratings yet
Fire Wall
13 pages
CDAC Lab Manual
No ratings yet
CDAC Lab Manual
34 pages
Ai Unit 5 Assignment
No ratings yet
Ai Unit 5 Assignment
2 pages
Syllabus CSEN2031 ARTIFICIAL INTELLIGENCE
No ratings yet
Syllabus CSEN2031 ARTIFICIAL INTELLIGENCE
5 pages
Csen2031 Ai Module Iv Part3-1
No ratings yet
Csen2031 Ai Module Iv Part3-1
18 pages
Preposition Logic
No ratings yet
Preposition Logic
19 pages
AI ConstraintSatisfaction
No ratings yet
AI ConstraintSatisfaction
36 pages
Ai Module V Part1-1
No ratings yet
Ai Module V Part1-1
15 pages
JB Gupta Electrical
No ratings yet
JB Gupta Electrical
1,138 pages
Rapport 15000
No ratings yet
Rapport 15000
10 pages
Development and Architecture of Noida
100% (1)
Development and Architecture of Noida
35 pages
Ijuk Fiber Composite for Motorcycles
No ratings yet
Ijuk Fiber Composite for Motorcycles
6 pages
Valvula de Boom Pluma
No ratings yet
Valvula de Boom Pluma
3 pages
Encrypted Document Analysis
100% (2)
Encrypted Document Analysis
50 pages
Quality Control Check Sheets
100% (2)
Quality Control Check Sheets
14 pages
Corrosion Important Questions
No ratings yet
Corrosion Important Questions
4 pages
OSS Note 1111
No ratings yet
OSS Note 1111
2 pages
Case Study
No ratings yet
Case Study
4 pages
Tu-863p MF - Datasheet
No ratings yet
Tu-863p MF - Datasheet
2 pages
OMM Jonsered 2490 GB
No ratings yet
OMM Jonsered 2490 GB
68 pages
Process Control for Engineers
No ratings yet
Process Control for Engineers
30 pages
Miniano-Bosh Presentation-Bsee3f
No ratings yet
Miniano-Bosh Presentation-Bsee3f
8 pages
Gear Units and Gearmotor Bonfiglioli PDF
No ratings yet
Gear Units and Gearmotor Bonfiglioli PDF
70 pages
Hjefhevhfviwaevfiew
No ratings yet
Hjefhevhfviwaevfiew
7 pages
Cub Cadet Parts Manual For Model 1641 SN 880001 899000
100% (56)
Cub Cadet Parts Manual For Model 1641 SN 880001 899000
8 pages
Norma de Tallado y Acanalado de Engranajes
No ratings yet
Norma de Tallado y Acanalado de Engranajes
7 pages
LED Specifications for Engineers
No ratings yet
LED Specifications for Engineers
2 pages
SONOFF TXUS CE LVD Safety EN60669 Test Report
No ratings yet
SONOFF TXUS CE LVD Safety EN60669 Test Report
94 pages
PGG
No ratings yet
PGG
49 pages
Variomatic Clutch Tuning Guide
0% (1)
Variomatic Clutch Tuning Guide
5 pages
Engineering List
No ratings yet
Engineering List
3 pages
Automatic Railway System Using Wireless Sensor Network
No ratings yet
Automatic Railway System Using Wireless Sensor Network
5 pages
Hudong Manual Vol 2 and 3 - 103840
No ratings yet
Hudong Manual Vol 2 and 3 - 103840
4 pages
Senate Bill No. 2008 - Plumbing Engineering Law
100% (1)
Senate Bill No. 2008 - Plumbing Engineering Law
15 pages
1) 1C LSZH
No ratings yet
1) 1C LSZH
44 pages
Evaluation of The Heat Transfer Coefficient at The Metal-Mould Interface During Flow
No ratings yet
Evaluation of The Heat Transfer Coefficient at The Metal-Mould Interface During Flow
4 pages
Vehicle Structure & Engine Basics
No ratings yet
Vehicle Structure & Engine Basics
31 pages
RF Module Guide for Engineers
No ratings yet
RF Module Guide for Engineers
6 pages

Reinforcement Learning Model Paper

Uploaded by

Reinforcement Learning Model Paper

Uploaded by

H.T.

No: Course Code: 201AM7E04

ADITYA ENGINEERING COLLEGE (A)

You might also like