October to December 2025 was the busiest AI model release quarter since the market began. Here’s what we kept, filtering marketing from real change for businesses.
Anthropic: Claude 4.6 and the arrival of agents
Claude 4.6 Opus shipped in November, with clear improvements on multi-step reasoning and very long context handling (1M tokens). Quieter but more important: Claude for Agents, a variant optimized for agentic workflows with better tool control and fewer infinite loops.
For businesses: Sonnet remains the sweet spot for most tasks. Opus for complex analysis. Claude for Agents for multi-step production workflows.
OpenAI: GPT-5 and the price war
GPT-5 launched in October. Impressive capabilities, especially in reasoning and code. But the most strategic move was the 60% price cut on GPT-4o and GPT-4o-mini in November. It’s now the cheapest option for high-volume tasks.
Mistral: Large 3 and the European angle
Mistral Large 3 shipped in December. Performance close to Claude Sonnet on French and European languages. European hosting, natively GDPR-compliant. Interesting for businesses with Law 25 constraints and multi-jurisdiction operations.
Google: Gemini 2 Pro
Gemini 2 Pro launched in November. Excellent on multimodal (video, audio, images). If your use case involves non-textual media, it’s the first option to test.
Our recommendation by use case
For a French-speaking B2C chatbot: Claude Sonnet, hosted on AWS Canada.
For legal or financial document analysis: Claude Opus.
For high-volume classification (millions of calls/month): GPT-4o-mini.
For multimedia processing: Gemini 2 Pro.
For a business with European operations or strict compliance: Mistral Large 3.
The rule hasn’t changed: no good model in absolute terms, just a good model for your case. But the bench has widened, and the price-performance ratio has clearly improved across the board.