Commit Graph

4 Commits

Author SHA1 Message Date
vik
c21a0681d0 popleft bug buffer fixed and double deep q learning added 2017-11-06 16:17:50 +01:00
vik
86341c51ab more stable learning due to target network 2017-11-06 11:01:38 +01:00
Ritchie
7e7d931adc Q algorithm learns 2017-11-04 13:17:22 +01:00
Ritchie
40dcf31329 policy network openai gym flagpole 2017-10-31 22:30:20 +01:00