Thanks to visit codestin.com
Credit goes to github.com

Skip to content
/ tf-a2c Public

Minimal TensorFlow implementation of the Advantage Actor-Critic model for Atari games

License

Notifications You must be signed in to change notification settings

ppyht2/tf-a2c

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Ms Pacman Breakout Space Invaders

Advantage Actor-Critic

Minimal TensorFlow implementation of the Advantage Actor-Critic model for Atari games.

As an alternative to the asynchronous implementation, researchers found you can write asynchronous, deterministic implementation that waits for each actor to finish its segment of experience before performing an update, averaging over all of the actors. One advantage of this method is that it can more effectively use of GPUs, which perform best with large batch sizes. This algorithm is naturally called A2C, short for advantage actor-critic.

The gym environment wrappers used are from Open AI baseline

About

Minimal TensorFlow implementation of the Advantage Actor-Critic model for Atari games

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages