id: 12443    nodeId: 12443    type: General    point: 127.0    linkPoint: .0    maker: cella    permission: linkable    made at: 2018.07.30 07:26    edited at: 2018.09.03 00:08
deep Q-networks (DQN) code samples
https://gist.github.com/viswanathgs/ca9788020cfcc7849b9181d9239a2ef4
https://medium.com/@tuzzer/follow-up-cart-pole-balancing-with-q-network-976d13f88d2f
https://github.com/Microsoft/CNTK/blob/9231ece8cec2fa25b4ca06018854cfe3ffb8ea53/Tutorials/CNTK_203_Reinforcement_Learning_Basics.ipynb

Return to A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients or deep Q-networks (DQN) code samples