Thanks to visit codestin.com
Credit goes to github.com

Skip to content

AJDERS/RL_grid

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

RL_grid

Train an agent to navigate in a grid with traps and walls, with no prior knowledge of the grid layout. The grid size i variable. The agent is trained using TD-learning, with the Monte Carlo method. The model is positively rewarded when it enter the finishing state, negatively when it enters a state with a trap, or if it navigates to a previously visited state.

Example Simulation

TODO

  • Variable learning rate.
  • Variable exploration rate.
  • Indicate from which game, it wins.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages