Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Used https://github.com/DanielSlater/AlphaToe, further developed his environment to feature DQN, user playing interface, TD-learning and most importantly various methods for two agents learning by playing against each other.

License

Notifications You must be signed in to change notification settings

TobiasBrx/AlphaToe

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

62 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AlphaToe

Applying the deep learning techniques from Alpha Go to play tic-tac-toe

These are the code examples to with my talk, the slide for which are in AlphaToe.pdf

As well as the slides, the file script/policy_gradient.py is a good starting point for the project. All networks are built using TensorFlow.

SetUp

To get running start by creating a virtual env/conda env with tensorFlow installed. Current instructions for this are at: https://www.tensorflow.org/versions/r0.11/get_started/os_setup.html#anaconda-installation

I've also found this useful: https://anaconda.org/jjhelmus/tensorflow

Then run the file file policy_gradient.py

This has been tested with python 2.7 and 3.5

About

Used https://github.com/DanielSlater/AlphaToe, further developed his environment to feature DQN, user playing interface, TD-learning and most importantly various methods for two agents learning by playing against each other.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages