Codestin Search App

Chess RL

Chess Environment & Encoding [done]
- Board stepping [done]
- Observataion encoding [done]
- Action encoding [done]
Neural Nework Model [done]
- Shared resnet with policy and value head [done]
Monte Carlo Tree search [done]
Self play loop [done]
- Play [done]
- Store data [done]
- Export data as PyTorch dataset-compatible format [done]
Training pipeline [done]
Evaluation and Rating TODO
Optimization and Scaling TODO

Interresting Papers:

Monte-Carlo tree search as regularized policy optimization https://arxiv.org/abs/2007.12509

Improvement of MCTS for low Nsim values

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
buffer		buffer
chess_env		chess_env
configs		configs
lichess		lichess
model		model
self_play		self_play
train		train
utils		utils
.gitignore		.gitignore
README.md		README.md
chatgpt_roadmap.md		chatgpt_roadmap.md
run_launch.py		run_launch.py
test.ipynb		test.ipynb