Created Deep Recurrent Q-Network example #85

Douglas-Cho · 2018-12-25T06:03:09Z

This shows the way to implement Deep Recurrent Q-Network (DRQN) model for the Cartpole case. I had to expand the state input to include a few number of past state data and created a meaningful sequential input stream for Long and Short-Term Memory (LSTM) model. Otherwise, it did not work with just current state information. This sounds like violating the Markov property assumption but this does the job.

Create cartpole-drqn.py

the graph for drqn

saved weights for drqn

Douglas-Cho added 6 commits December 25, 2018 13:18

Merge pull request #1 from Douglas-Cho/Douglas-Cho-drqn-1

25b7598

Create cartpole-drqn.py

the graph for drqn

e9b27c1

Merge pull request #2 from Douglas-Cho/Douglas-Cho-drqn-2

a622919

the graph for drqn

saved weights for drqn

83f7e65

Merge pull request #3 from Douglas-Cho/Douglas-Cho-drqn-3

1e43fe0

saved weights for drqn

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Created Deep Recurrent Q-Network example #85

Created Deep Recurrent Q-Network example #85

Uh oh!

Douglas-Cho commented Dec 25, 2018

Uh oh!

Uh oh!

Created Deep Recurrent Q-Network example #85

Are you sure you want to change the base?

Created Deep Recurrent Q-Network example #85

Uh oh!

Conversation

Douglas-Cho commented Dec 25, 2018

Uh oh!

Uh oh!