The Rise of AI Agents

What Happened

By 2025, frontier models were being wrapped in systems that could browse the web, call tools, edit files, execute code, manage state, and carry multi-step tasks forward with limited supervision. Claude Code, OpenAI's Operator, Google's Project Mariner, OpenClaw, and a wave of agent frameworks turned 'AI agent' from a research label into a practical product category.

Why It Mattered

Shifted the paradigm from AI-as-chatbot to AI-as-operator. The important change was not just better conversation, but systems that could observe, plan, and act across real software environments. That raised sharper questions about permissions, oversight, reliability, and the future of knowledge work.

Organizations

Tags

Related Milestones

GitHub Copilot logo representing the AI coding agents era
Product

AI Coding Agents Transform Software Development

AI coding agents like Claude Code, Cursor, GitHub Copilot's agentic workflows, and OpenClaw-linked remote coding loops pushed beyond autocomplete into delegated engineering work. These systems could inspect repositories, run tests, edit files, use terminals and browsers, and iterate on tasks over multiple turns.

AnthropicCursor
Anthropic logo
Product

Claude 3.5 Sonnet: A Leading Coding Model

Anthropic's Claude 3.5 Sonnet emerged as one of the strongest widely used models for coding tasks, with developers praising its code generation, debugging, and software engineering capabilities. It powered tools like Claude Code, enabling AI to work directly inside developer environments.

Anthropic
Anthropic logo, creators of Claude 4
Product

Claude 4 / Opus 4: Frontier Reasoning

Anthropic released Claude 4 Opus, a model with significantly enhanced reasoning, extended thinking capabilities, and the ability to sustain complex multi-step problem-solving over long contexts. It excelled at agentic tasks, code generation, and nuanced analysis.

Anthropic
OpenAI logo
Product

OpenAI o3: Advanced Reasoning at Scale

OpenAI released o3, the successor to o1, with markedly improved reasoning capabilities. It posted state-of-the-art results on many math and coding benchmarks and handled problems that previously required expert-level multi-step analysis.

OpenAI
Google Gemini logo
Product

Gemini 2.0: Google's Agent Platform

Google launched Gemini 2.0, designed from the ground up for the agentic era — with native tool use, code execution, and multi-step reasoning. Deeply integrated into Google's ecosystem (Search, Workspace, Android), it brought AI agent capabilities to billions of users.

Google DeepMind

Get the latest AI milestones as they happen

Join the newsletter. No spam, just signal.