Commit Graph

3 Commits

Author SHA1 Message Date
vik
86341c51ab more stable learning due to target network 2017-11-06 11:01:38 +01:00
Ritchie
7e7d931adc Q algorithm learns 2017-11-04 13:17:22 +01:00
Ritchie
40dcf31329 policy network openai gym flagpole 2017-10-31 22:30:20 +01:00