Media Summary: Modified reward function: 1.0 * ( -abs(Pole Angle) + 0.21 ) Batch size: 16 Structure shape: [128128128] -- this is overkill. Can we train an AI to complete it's objective in a video game world without needing to build a model of the world before hand? Deep Q Learning Network (DQN) Cartpole Game
Deep Q Learning Vs Cartpole - Detailed Analysis & Overview
Modified reward function: 1.0 * ( -abs(Pole Angle) + 0.21 ) Batch size: 16 Structure shape: [128128128] -- this is overkill. Can we train an AI to complete it's objective in a video game world without needing to build a model of the world before hand? Deep Q Learning Network (DQN) Cartpole Game Balancing a typical inverted pendulum with Temporal Difference methods. Done as a personal hobby project. Controlled with aย ... 42. Deep Q Learning in Tensorflow for CartPole