4 milestones in AI history
BERT: Bidirectional Language Understanding
Google published BERT (Bidirectional Encoder Representations from Transformers), which could understand language context from both directions simultaneously. BERT shattered records on 11 NLP benchmarks. Google integrated it into Search, affecting 10% of all queries.
Gemini: Google's Multimodal Response
Google launched Gemini, its most capable AI model family, natively multimodal across text, code, images, audio, and video. Gemini Ultra matched or exceeded GPT-4 on many benchmarks. It marked Google DeepMind's full response to OpenAI's dominance.
Gemini 1.5 Pro: Million-Token Context
Google released Gemini 1.5 Pro with a 1 million token context window (later extended to 2M) — able to process entire codebases, books, or hours of video in a single prompt. It could find a needle in a haystack across millions of tokens with near-perfect recall.
Gemini 2.0: Google's Agent Platform
Google launched Gemini 2.0, designed from the ground up for the agentic era — with native tool use, code execution, and multi-step reasoning. Deeply integrated into Google's ecosystem (Search, Workspace, Android), it brought AI agent capabilities to billions of users.