Anthropic, OpenAI, Mistral: what actually shipped in Q4 2025

October to December 2025 was the busiest AI model release quarter since the market began. Here’s what we kept, filtering marketing from real change for businesses.

Anthropic: Claude 4.6 and the arrival of agents

Claude 4.6 Opus shipped in November, with clear improvements on multi-step reasoning and very long context handling (1M tokens). Quieter but more important: Claude for Agents, a variant optimized for agentic workflows with better tool control and fewer infinite loops.

For businesses: Sonnet remains the sweet spot for most tasks. Opus for complex analysis. Claude for Agents for multi-step production workflows.

OpenAI: GPT-5 and the price war

GPT-5 launched in October. Impressive capabilities, especially in reasoning and code. But the most strategic move was the 60% price cut on GPT-4o and GPT-4o-mini in November. It’s now the cheapest option for high-volume tasks.

Mistral: Large 3 and the European angle

Mistral Large 3 shipped in December. Performance close to Claude Sonnet on French and European languages. European hosting, natively GDPR-compliant. Interesting for businesses with Law 25 constraints and multi-jurisdiction operations.

Google: Gemini 2 Pro

Gemini 2 Pro launched in November. Excellent on multimodal (video, audio, images). If your use case involves non-textual media, it’s the first option to test.

Our recommendation by use case

For a French-speaking B2C chatbot: Claude Sonnet, hosted on AWS Canada.

For legal or financial document analysis: Claude Opus.

For high-volume classification (millions of calls/month): GPT-4o-mini.

For multimedia processing: Gemini 2 Pro.

For a business with European operations or strict compliance: Mistral Large 3.

The rule hasn’t changed: no good model in absolute terms, just a good model for your case. But the bench has widened, and the price-performance ratio has clearly improved across the board.

Anthropic: Claude 4.6 and the arrival of agents

OpenAI: GPT-5 and the price war

Mistral: Large 3 and the European angle

Google: Gemini 2 Pro

Our recommendation by use case

Have an AI project?