Thanks to visit codestin.com
Credit goes to github.com

Skip to content

2017210698/Reinforcement_Learning_Notes

Repository files navigation

Reinforcement Learning Notes

The notes so far included Bandit Algorithms, MDP, Model-free Methods, Value Function Approximation, Policy Optimization.

Moving on entails a lot of works on reading. Due to courses final and projects, I will have to leave it at that. However, any advice will be appreciated.

About

A naive version.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published