Reinforcement Learning

5 milestones in AI history

1992ResearchSecond AI Winter

TD-Gammon: Reinforcement Learning Plays Backgammon

Gerald Tesauro created TD-Gammon, a neural network that learned to play backgammon at expert level through self-play using temporal difference reinforcement learning. It discovered novel strategies that surprised human experts.

Gerald TesauroIBM

2013.12ResearchDeep Learning Breakthrough

DeepMind's DQN Masters Atari Games

DeepMind demonstrated a deep reinforcement learning agent (Deep Q-Network) that learned to play Atari 2600 games directly from pixel inputs, achieving superhuman performance on many games with no task-specific engineering. Google acquired DeepMind for ~$500 million shortly after.

Volodymyr MnihDemis HassabisDeepMind

2016.03CompetitionDeep Learning Breakthrough

AlphaGo Defeats Lee Sedol

DeepMind's AlphaGo defeated Lee Sedol, one of the greatest Go players ever, 4-1 in a five-game match in Seoul. Go has more possible positions than atoms in the universe — brute force was impossible. AlphaGo used deep reinforcement learning and Monte Carlo tree search. In Game 2, AlphaGo played Move 37 — a move so creative that experts called it 'beautiful' and 'not a human move.'

Demis HassabisDavid SilverDeepMindGoogle

2017.10ResearchDeep Learning Breakthrough

AlphaGo Zero: Learning From Scratch

AlphaGo Zero achieved superhuman Go performance with ZERO human knowledge — no training data from human games, no hand-crafted features. It learned entirely through self-play, and within 40 days surpassed all previous versions, including the one that beat Lee Sedol.

David SilverDeepMind

2019.01CompetitionThe Transformer Era

AlphaStar Masters StarCraft II

DeepMind's AlphaStar reached Grandmaster level in StarCraft II, a real-time strategy game requiring long-term planning, deception, and split-second tactics with incomplete information — far more complex than Go or chess.

DeepMind

TD-Gammon: Reinforcement Learning Plays Backgammon

DeepMind's DQN Masters Atari Games

AlphaGo Defeats Lee Sedol

AlphaGo Zero: Learning From Scratch

AlphaStar Masters StarCraft II

Related Topics