TD-Gammon: Reinforcement Learning Plays Backgammon
Gerald Tesauro created TD-Gammon, a neural network that learned to play backgammon at expert level through self-play using temporal difference reinforcement learning. It discovered novel strategies that surprised human experts.
Gerald TesauroIBM