DeepSeek AI logo

DeepSeek R1: Open-Source Reasoning

What Happened

Chinese AI lab DeepSeek released R1, an openly released reasoning model that approached OpenAI's o1-class performance at a fraction of the cost. Trained with reportedly modest compute budgets, it challenged the assumption that frontier reasoning required the largest Western-scale investment programs.

Why It Mattered

Sent shockwaves through the AI industry. NVIDIA stock dropped temporarily. Forced a reassessment of the 'scaling requires billions' narrative. Demonstrated that compute efficiency and algorithmic innovation could rival brute-force scaling.

Organizations

Tags

Related Milestones

Mistral AI logo, creators of Mixtral
Open Source

Mixtral 8x7B: Efficient Mixture of Experts

French startup Mistral AI released Mixtral 8x7B, a mixture-of-experts model that matched or beat GPT-3.5 while using a fraction of the compute per token. It demonstrated that clever architecture could compete with brute-force scaling.

Mistral AI
OpenClaw GitHub organization avatar
Open Source

OpenClaw: The Personal AI Assistant Goes Open Source

The `openclaw/openclaw` repository launched on GitHub, framing itself as 'your own personal AI assistant' that ran on users' own devices across the channels they already used, from WhatsApp and Telegram to Slack, Discord, and iMessage. Instead of keeping the assistant trapped in a single app, OpenClaw combined messaging integrations, voice, tools, browser control, local skills, and device-side control into an always-on personal agent.

Peter SteinbergerOpenClaw
Meta AI logo
Open Source

Llama 2: Meta Opens the Floodgates

Meta released Llama 2, a family of widely available large language models (7B, 13B, 70B parameters) distributed as open weights under a custom license that allowed broad commercial use. While not open-source in the strict OSI sense, it gave companies and researchers access to a frontier-quality model they could run, customize, and deploy themselves.

Mark ZuckerbergMeta
Meta AI logo
Open Source

Llama 3: Open-Source Catches Up

Meta released Llama 3 (8B and 70B, later 405B), closing the gap with closed frontier models. The 405B release put near-frontier open-weight models into more developers' hands, even though Meta's licensing still sat outside a strict open-source definition.

Meta
Astronaut riding a horse, iconic Stable Diffusion generated image
Open Source

Stable Diffusion: Open-Source Image Generation

Stable Diffusion was released as a widely available text-to-image model that could run on consumer hardware, with model weights distributed under an open release rather than an API-only product. Unlike DALL-E, anyone could download it, run it locally, and build on top of it. An explosion of community modifications, fine-tunes, and applications followed.

Emad MostaqueStability AICompVis (LMU Munich)

Get the latest AI milestones as they happen

Join the newsletter. No spam, just signal.