An interactive demonstration of some of the techniques described in Sutton and Barto's Reinforcement Learning: An Introduction (the in-progress second edition).

The specific algorithm implemented is "Differential Semi-Gradient Sarsa for Control", found in Section 10.3.

Your browser doesn't appear to support the HTML5 canvas element.
Your browser doesn't appear to support the HTML5 canvas element.
The current action-value functions
Your browser doesn't appear to support the HTML5 canvas element.
Reward history