0% found this document useful (0 votes)

4 views19 pages

RL Presentation2

Uploaded by

mzg.ghskhanpurbaggasher

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views19 pages

RL Presentation2

Uploaded by

mzg.ghskhanpurbaggasher

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 19

PRESENTATION

• Topic: Reinforcement Learning

• Prepared and Presented by:

Shahbaz Saeed
Muzahir Mehdi
Touqeer Awan
REINFORCEMENT LEARNING

• Reinforcement Learning is a feedback-based machine learning

approach here an agent learns to which actions to perform by
looking at the environment and the results of actions.

• For each correct action, the agent gets positive feedback, and
for each incorrect action, the agent gets negative feedback or
penalty.
Elements of Reinforcement Learning

• The agent or the learner

• The environment the

agent interacts with

• The policy that the agent

follows to take actions

• The reward signal that the

agent observes upon
taking actions
REINFORCEMENT LEARNING
• The agent interact with environment and identifies the possible actions.

• The primary goal of an agent in reinforcement learning is to perform actions by

looking at the environment and get the maximum positive reward.

• In reinforcement learning, the agent learns automatically sing feedbacks without

any labeled data, unlike supervised learning.

• Since there is no labeled data, so the agent is bound to learn by its experience
only.
REINFORCEMENT LEARNING
• Not just blind search, try to be smart about it.

• Reinforce learning is used to solve specific type of problem where decision making
is sequential, and the goal is long-term, such as game-playing, robotics etc.

Why do we need reinforcement learning?

• 1. To solve complex problems in uncertain environments
• 2. To enable agents to learn from their own experiences
• 3.To develop agents that can adapt to new situations.
Types of Reinforcement learning
• Positive Reinforcement Learning
 is a recurrence of behavior due to positive rewards.

 Positive rewards increase strength and the frequency of a specific behavior.

 This encourages to execute similar actions that yield maximum reward.

• Negative Reinforcement Learning

 negative rewards are used as a deterrent to weaken the behavior and to avoid it.

 Negative rewards decreases strength and the frequency of a specific behavior.

How Does Reinforcement Learning
Works?
• To understand the working process of RL, we need to consider two main
things:

Environment: It can be anything such as room, maze, football ground etc.

Agent: An intelligent agent such as AI robot.

• This maze is considering of an
S6 block, which is a wall, S8 a
fire pit, and S4 a diamond
block.

• The agent cannot cross the S6

block, as it is a solid wall.

• If the agent reaches S4 block,

then get the +1 reward; if it
reaches the fire pit, then gets
-1 reward point.

• It can take four actions: move

up, move down, move left
and move right.
• It will be the difficult
condition for the agent
whether he should go up or
down as each block has the
same value.

• So the above approach is not

suitable for the agent to
reach the destination.

• Hence to solve the problem,

we will use the Bellman
equation, which is the main
concept behind
reinforcement learning.
Model-Based vs Model-Free learning algorithms

• There are two main types of Reinforcement Learning algorithms:

• 1. Model-Based Algorithms
• 2. Model-Free Algorithms
Model-Based Algorithms
• They are used in scenarios where we have complete knowledge of the
environment and how it reacts to different actions.

• In Model-based Reinforcement Learning the agent has access to the model of the
environment i.e., action required to be performed to go from one state to
another, probabilities attached, and corresponding rewards attached.

• They allow the reinforcement learning agent to plan ahead by thinking ahead.

• For static/fixed environments, Model-based Reinforcement Learning is more

suitable.
Model-Free Algorithms
• Model-free algorithms find the optimal policy with very limited knowledge of the
dynamics of the environment.

• They estimate the optimal policy directly from experience i.e., interaction between
agent and environment without having any hint of the reward function.

• Model-free Reinforcement Learning should be applied in scenarios involving

incomplete information of the environment.

• In real-world, we don't have a fixed environment. Self-driving cars have a dynamic

environment with changing traffic conditions, route diversions etc. In such
scenarios, Model-free algorithms outperform other techniques
Common Mathematical and Algorithmic
Frameworks

• Markov Decision Process (MDP)

• Bellman Equations
• Dynamic Programming
• Value Iteration
• Policy Iteration
• Q-learning
Markov Decision Process (MDP)

• The components involved in a Markov Decision Process (MDP) is a

decision maker called an agent that interacts with the environment it is
placed in.

• These interactions occur sequentially overtime.

• In each timestamp, the agent will get some representation of the

environment state. Given this representation, the agent selects an action
to make. The environment is then transitioned into some new state and
the agent is given a reward as a consequence of its previous action.
Bellman
Equations
The value of a given state (s)
is determined by taking a
maximum of the actions we
can take in the state the
agent is in. The aim of the
agent is to pick the action that
is going to maximize the
value.
Q Learning

it’s a value-based model free

approach for supplying
information to intimate which
action an agent should
perform. It revolves around
the notion of updating Q
values which shows the value
of doing action A in state S.
Value update rule is the main
aspect of the Q-learning
algorithm.
Applications of deep Reinforcement Learning
• Industrial Manufacturing
Reinforcement Learning is very commonly applied in Robotics.
• Self-driving cars
The algorithms learn to recognize pedestrians, roads, traffic, detect street signs in the
environment and act accordingly.
• Trading and Finance
An RL agent can select whether to hold, buy or sell a share, it is assessed using market
benchmark standards.
• Natural Language Processing
NLP tasks like question-answering, summarization, chatbot implementation can
be done by a Reinforcement Learning agent.
• Healthcare
RL Bots trained to perform surgeries and in better diagnosis of diseases

Machine Learning Basics for Students
100% (1)
Machine Learning Basics for Students
55 pages
Reinforcement Learning Cheat Sheet: Return
No ratings yet
Reinforcement Learning Cheat Sheet: Return
7 pages
Reinforcement Learning
100% (1)
Reinforcement Learning
25 pages
Reinforcement Learning: Nazia Bibi
100% (1)
Reinforcement Learning: Nazia Bibi
61 pages
Reinforcement Learning Guide
No ratings yet
Reinforcement Learning Guide
64 pages
RL Vishnu Sankar
No ratings yet
RL Vishnu Sankar
26 pages
Reinforcement
No ratings yet
Reinforcement
9 pages
Unit-5 ML Notes
No ratings yet
Unit-5 ML Notes
31 pages
Unit 5
No ratings yet
Unit 5
45 pages
Reinforcement Learning 2012
No ratings yet
Reinforcement Learning 2012
653 pages
Artificial Intelligence: Computer Science & Engineering, Khulna University
No ratings yet
Artificial Intelligence: Computer Science & Engineering, Khulna University
30 pages
Midterm 1: CS 188 Summer 2019 Introduction To Artificial Intelligence
No ratings yet
Midterm 1: CS 188 Summer 2019 Introduction To Artificial Intelligence
13 pages
Unit 5 ML 3year
No ratings yet
Unit 5 ML 3year
17 pages
Types of Data:: Reference Website
No ratings yet
Types of Data:: Reference Website
15 pages
Unit 5
No ratings yet
Unit 5
10 pages
114021
No ratings yet
114021
55 pages
Optimal Control
No ratings yet
Optimal Control
51 pages
Reinforced Learning
No ratings yet
Reinforced Learning
25 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
11 pages
Reinforcement Learning: Csci 5512: Artificial Intelligence Ii
No ratings yet
Reinforcement Learning: Csci 5512: Artificial Intelligence Ii
30 pages
Lecture 5
No ratings yet
Lecture 5
28 pages
Lecture RL
No ratings yet
Lecture RL
37 pages
Lecture 29 RL
No ratings yet
Lecture 29 RL
38 pages
Bellman, Lee - Functional Equations in Dynamic Programming
No ratings yet
Bellman, Lee - Functional Equations in Dynamic Programming
18 pages
Unit V Reinforcement Learning and Genetic Algorithm
No ratings yet
Unit V Reinforcement Learning and Genetic Algorithm
40 pages
L-14 - Reinforcement-L-d-07062024-111949am
No ratings yet
L-14 - Reinforcement-L-d-07062024-111949am
22 pages
Unit-5 (AI)
No ratings yet
Unit-5 (AI)
21 pages
Unit 5-1
No ratings yet
Unit 5-1
8 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
15 pages
Sara Reinforcement Learning
No ratings yet
Sara Reinforcement Learning
69 pages
Lecture Week12
No ratings yet
Lecture Week12
37 pages
No-Regret Learning in AI
No ratings yet
No-Regret Learning in AI
14 pages
Unit 5 - Reinforcement Learning
No ratings yet
Unit 5 - Reinforcement Learning
15 pages
Cosmo Learning
No ratings yet
Cosmo Learning
891 pages
Sustainable Manufacturing Demand Response
No ratings yet
Sustainable Manufacturing Demand Response
10 pages
cs747 A2020 Quizzes PDF
No ratings yet
cs747 A2020 Quizzes PDF
5 pages
Markov Decision Processes: - The Markov Property - The Markov Decision Process - Partially Observable Mdps
No ratings yet
Markov Decision Processes: - The Markov Property - The Markov Decision Process - Partially Observable Mdps
24 pages
ANSWERS TO 15-381 Final, Spring 2004: Friday May 7, 2004
No ratings yet
ANSWERS TO 15-381 Final, Spring 2004: Friday May 7, 2004
20 pages
RL & DL Notes
No ratings yet
RL & DL Notes
73 pages
ML Assignment 2
No ratings yet
ML Assignment 2
6 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
5 pages
M.Tech AI & ML Syllabus 2022-23
No ratings yet
M.Tech AI & ML Syllabus 2022-23
130 pages
Unit-5 Mla
No ratings yet
Unit-5 Mla
22 pages
Exp-14 Reinforcement Learning
No ratings yet
Exp-14 Reinforcement Learning
11 pages
4.3 Reinforcement Learning
No ratings yet
4.3 Reinforcement Learning
27 pages
UNIT V Reinforcement Learning
No ratings yet
UNIT V Reinforcement Learning
8 pages
7.reinforcement Learning-Introduction-The Learning Task Q-Learning
No ratings yet
7.reinforcement Learning-Introduction-The Learning Task Q-Learning
34 pages
Intro to Reinforcement Learning
No ratings yet
Intro to Reinforcement Learning
9 pages
Machine Learning Unit-1.2
No ratings yet
Machine Learning Unit-1.2
23 pages
Module 1
No ratings yet
Module 1
72 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
3 pages
Lecture 9 Reiforcement Learning
No ratings yet
Lecture 9 Reiforcement Learning
29 pages
MLT Unit-5 Notes
No ratings yet
MLT Unit-5 Notes
17 pages
Unit 3
No ratings yet
Unit 3
29 pages
RL & DL Notes
No ratings yet
RL & DL Notes
43 pages
Unit-5 Reinforcemnt and Q Learning
No ratings yet
Unit-5 Reinforcemnt and Q Learning
45 pages
Unit 4
No ratings yet
Unit 4
56 pages
Ai (It) Unit-5
No ratings yet
Ai (It) Unit-5
43 pages
ML Unit 4
No ratings yet
ML Unit 4
9 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
17 pages
Unit 5
No ratings yet
Unit 5
58 pages
Lect 2
No ratings yet
Lect 2
26 pages
Risk-Based Inspection for Industry
No ratings yet
Risk-Based Inspection for Industry
13 pages
Deep Reinforcement Learning in Cloud-Edge
No ratings yet
Deep Reinforcement Learning in Cloud-Edge
20 pages
6CS4-02 Machine Learning
No ratings yet
6CS4-02 Machine Learning
2 pages
ML 10
No ratings yet
ML 10
9 pages
IntroductiontoRL BR
No ratings yet
IntroductiontoRL BR
22 pages
DLMAIRIL01 Q4-2024 Session1
No ratings yet
DLMAIRIL01 Q4-2024 Session1
84 pages
Applied Machine Learning For Games A Graduate School Course
No ratings yet
Applied Machine Learning For Games A Graduate School Course
9 pages
Mca20 21syllabus
No ratings yet
Mca20 21syllabus
22 pages
RL Dynamic Programming Lecture
No ratings yet
RL Dynamic Programming Lecture
43 pages
Unit-5 MLT
No ratings yet
Unit-5 MLT
13 pages
Unit 5 ML
No ratings yet
Unit 5 ML
15 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
38 pages
R22ML 5
No ratings yet
R22ML 5
24 pages
Reinforcement Learning-1
No ratings yet
Reinforcement Learning-1
19 pages
Week 4 ML
No ratings yet
Week 4 ML
8 pages
CMPE257 - W10C13 - Reinforcement Learning
No ratings yet
CMPE257 - W10C13 - Reinforcement Learning
161 pages
DRL for 5G Resource Allocation
No ratings yet
DRL for 5G Resource Allocation
4 pages
AI Unit 4 NEW
No ratings yet
AI Unit 4 NEW
60 pages
Reinforcement Learning: Foundations
No ratings yet
Reinforcement Learning: Foundations
276 pages
RL Learning
No ratings yet
RL Learning
9 pages
Crypto Portfolio with Deep Q-Learning
No ratings yet
Crypto Portfolio with Deep Q-Learning
16 pages
LANTERN Learning-Based Routing Policy For Reliable Energy-Harvesting IoT Networks
No ratings yet
LANTERN Learning-Based Routing Policy For Reliable Energy-Harvesting IoT Networks
13 pages
RL Lecture4
No ratings yet
RL Lecture4
16 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
5 pages
SquirRL - Automating Attack Analysis On Blockchain Incentive Mechanisms With Deep Reinforcement Learning
No ratings yet
SquirRL - Automating Attack Analysis On Blockchain Incentive Mechanisms With Deep Reinforcement Learning
20 pages
Statistical Reinforcement Learning Modern Machine Learning Approaches 1st Edition Masashi Sugiyama Download
100% (3)
Statistical Reinforcement Learning Modern Machine Learning Approaches 1st Edition Masashi Sugiyama Download
61 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
9 pages

RL Presentation2

Uploaded by

RL Presentation2

Uploaded by

PRESENTATION

• Topic: Reinforcement Learning

• Prepared and Presented by:

• Reinforcement Learning is a feedback-based machine learning

• The agent or the learner

• The environment the

• The policy that the agent

• The reward signal that the

• The primary goal of an agent in reinforcement learning is to perform actions by

• In reinforcement learning, the agent learns automatically sing feedbacks without

Why do we need reinforcement learning?

 Positive rewards increase strength and the frequency of a specific behavior.

 This encourages to execute similar actions that yield maximum reward.

• Negative Reinforcement Learning

 Negative rewards decreases strength and the frequency of a specific behavior.

Environment: It can be anything such as room, maze, football ground etc.

Agent: An intelligent agent such as AI robot.

• The agent cannot cross the S6

• If the agent reaches S4 block,

• It can take four actions: move

• So the above approach is not

• Hence to solve the problem,

• There are two main types of Reinforcement Learning algorithms:

• For static/fixed environments, Model-based Reinforcement Learning is more

• Model-free Reinforcement Learning should be applied in scenarios involving

• In real-world, we don't have a fixed environment. Self-driving cars have a dynamic

• Markov Decision Process (MDP)

• The components involved in a Markov Decision Process (MDP) is a

• These interactions occur sequentially overtime.

• In each timestamp, the agent will get some representation of the

it’s a value-based model free

You might also like