Google Gemini AI model logo

Gemini: Google's Multimodal Response

What Happened

Google launched Gemini, its most capable AI model family, natively multimodal across text, code, images, audio, and video. Gemini Ultra matched or exceeded GPT-4 on many benchmarks. It marked Google DeepMind's full response to OpenAI's dominance.

Why It Mattered

Established the multimodal AI race. Google showed it could compete at the frontier. The merging of Google Brain and DeepMind into Google DeepMind signaled the company's all-in bet on AI.

Key People

Organizations

Tags

Related Milestones

Anthropic logo, creators of Claude 3
Product

Claude 3: Approaching Human-Level

Anthropic launched the Claude 3 family (Haiku, Sonnet, Opus), with Claude 3 Opus matching or exceeding GPT-4 on most benchmarks. It featured a 200K token context window, strong reasoning, nuanced instruction-following, and a 'personality' that users found distinctively thoughtful and careful.

Anthropic
OpenAI logo
Product

GPT-4o: Omni Model

OpenAI released GPT-4o ('omni'), a unified model that natively processed text, audio, images, and video with near-instant response times. It could hold natural voice conversations with emotional expression, sing, laugh, and respond to visual input in real time.

OpenAI
Google Gemini logo
Product

Gemini 2.0: Google's Agent Platform

Google launched Gemini 2.0, designed from the ground up for the agentic era — with native tool use, code execution, and multi-step reasoning. Deeply integrated into Google's ecosystem (Search, Workspace, Android), it brought AI agent capabilities to billions of users.

Google DeepMind
Google Gemini logo
Research

Gemini 1.5 Pro: Million-Token Context

Google released Gemini 1.5 Pro with a 1 million token context window (later extended to 2M) — able to process entire codebases, books, or hours of video in a single prompt. It could find a needle in a haystack across millions of tokens with near-perfect recall.

Google DeepMind
OpenAI logo, creators of ChatGPT
Product

ChatGPT: AI Goes Mainstream

OpenAI released ChatGPT, a conversational AI based on GPT-3.5 fine-tuned with RLHF (Reinforcement Learning from Human Feedback). It reached 1 million users in 5 days and 100 million in 2 months — the fastest-growing consumer application in history. People used it to write emails, debug code, brainstorm ideas, and a thousand other tasks.

Sam AltmanOpenAI

Get the latest AI milestones as they happen

Join the newsletter. No spam, just signal.