Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Commit 3092089

Browse files
apb7norvig
authored andcommitted
Fix #741: Add learning agent to vacuum_world.ipynb (#742)
* Modified table for TableDrivenVacuumAgent * Add learing agent * Add image for learning agent
1 parent 8dbf924 commit 3092089

File tree

2 files changed

+16
-1
lines changed

2 files changed

+16
-1
lines changed

images/general_learning_agent.jpg

31.8 KB
Loading

vacuum_world.ipynb

Lines changed: 16 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -30,7 +30,8 @@
3030
"* Simple Reflex Agent Program\n",
3131
"* Model-Based Reflex Agent Program\n",
3232
"* Goal-Based Agent Program\n",
33-
"* Utility-Based Agent Program"
33+
"* Utility-Based Agent Program\n",
34+
"* Learning Agent"
3435
]
3536
},
3637
{
@@ -518,6 +519,20 @@
518519
"**Figure 2.14** of the book shows a model-based, utility-based agent:\n",
519520
"<img src=\"files/images/model_utility_based_agent.jpg\">"
520521
]
522+
},
523+
{
524+
"cell_type": "markdown",
525+
"metadata": {},
526+
"source": [
527+
"## LEARNING AGENT\n",
528+
"\n",
529+
"Learning allows the agent to operate in initially unknown environments and to become more competent than its initial knowledge alone might allow. Here, we will breifly introduce the main ideas of learning agents. \n",
530+
"\n",
531+
"A learning agent can be divided into four conceptual components. The **learning element** is responsible for making improvements. It uses the feedback from the **critic** on how the agent is doing and determines how the performance element should be modified to do better in the future. The **performance element** is responsible for selecting external actions for the agent: it takes in percepts and decides on actions. The critic tells the learning element how well the agent is doing with respect to a fixed performance standard. It is necesaary because the percepts themselves provide no indication of the agent's success. The last component of the learning agent is the **problem generator**. It is responsible for suggesting actions that will lead to new and informative experiences. \n",
532+
"\n",
533+
"**Figure 2.15** of the book sums up the components and their working: \n",
534+
"<img src=\"files/images/general_learning_agent.jpg\">"
535+
]
521536
}
522537
],
523538
"metadata": {

0 commit comments

Comments
 (0)