This website requires JavaScript.
Explore
Help
Sign In
daviddoji
/
vanilla-machine-learning
Watch
1
Star
0
Fork
0
You've already forked vanilla-machine-learning
Code
Issues
Pull Requests
Packages
Projects
Releases
Wiki
Activity
Files
c7d6fea511149970718f13491d4d11f0ac9a1277
vanilla-machine-learning
/
reinforcement_learning
History
ritchie
c7d6fea511
fixed reward function -> less deflection is rewarding
2017-12-16 15:26:44 +01:00
..
deep_Q_bridge.ipynb
popleft bug buffer fixed and double deep q learning added
2017-11-06 16:17:50 +01:00
her_deep_Q_bridge.ipynb
fixed reward function -> less deflection is rewarding
2017-12-16 15:26:44 +01:00
policy_gradients_flagpole.ipynb
policy network openai gym flagpole
2017-10-31 22:30:20 +01:00