Using Reinforcement Learning to Beat ATARI Pong and CartPole

Beating ATARI Pong with Policy Gradients

Using a simple Policy Gradient Reinforcement Learning model, we were able to quickly train the model on an ATARI gym (that uses the AleControl) and beat the ATARI Pong game using the SignalPop AI Designer. The new MyCaffeTrainerRL seamlessly trains the model to learn the game dynamics using MyCaffe.

Balancing Cart-Pole using Policy Gradients

Using a simple Policy Gradient Reinforcement Learning model, we were able to quickly train the model on a Cart-Pole gym to balance a pole using the SignalPop AI Designer. The new MyCaffeTrainerRL seamlessly trains the model to learn the balancing dynamics using MyCaffe.

To try out this model and train it yourself, just check out our Tutorials for easy step-by-step instructions that will get you started quickly!  For other cool example videos, check out our Examples page.