AI in 2023
6 milestones in artificial intelligence
GPT-4: Multimodal Intelligence
OpenAI released GPT-4, a multimodal model that could understand both text and images. It passed the bar exam (90th percentile), scored 1410 on the SAT, and demonstrated remarkably nuanced reasoning. It was a massive leap from GPT-3.5 in accuracy, safety, and capability.
Claude: Constitutional AI
Anthropic released Claude, an AI assistant built with Constitutional AI (CAI) — a novel approach where the model is trained to follow a set of principles rather than just optimizing for human preference ratings. Anthropic, founded by former OpenAI researchers, positioned Claude as the safety-focused alternative.
Llama 2: Meta Opens the Floodgates
Meta released Llama 2, a family of widely available large language models (7B, 13B, 70B parameters) distributed as open weights under a custom license that allowed broad commercial use. While not open-source in the strict OSI sense, it gave companies and researchers access to a frontier-quality model they could run, customize, and deploy themselves.
Midjourney V5: Photorealistic AI Art
Midjourney V5 produced images so photorealistic that AI-generated photos went viral and were mistaken for real photographs — including a fake image of the Pope in a puffer jacket and fake photos of Trump's arrest. The line between AI-generated and real imagery effectively dissolved.
Mixtral 8x7B: Efficient Mixture of Experts
French startup Mistral AI released Mixtral 8x7B, a mixture-of-experts model that matched or beat GPT-3.5 while using a fraction of the compute per token. It demonstrated that clever architecture could compete with brute-force scaling.
Gemini: Google's Multimodal Response
Google launched Gemini, its most capable AI model family, natively multimodal across text, code, images, audio, and video. Gemini Ultra matched or exceeded GPT-4 on many benchmarks. It marked Google DeepMind's full response to OpenAI's dominance.