Why o3-mini had to be free: the coming DeepSeek R1, 2.0 Flash, and Sky-T1 Price War

2025's biggest surprise so far: Reasoning is less of a moat than anyone thought.

Jan 24, 2025

∙ Paid

Unlike Meta executives, we are truly on the fence about whether or not DeepSeek’s “$5.5m model trained with box of scrap GPUs with no MCTS/PRM” is a psyop 1. However, we do believe that the price-intelligence Pareto frontier is both closely watched and predictive, as we saw with the Gemini Pro price cut last September, and our updates with new models offer more clarity where the cost of reasoning is headed:

alos and prices imputed by swyx if missing. Tweet with credit here: note how distillation research decreased the slope of the parete frontier in 2024-2025

While it was expected that o3-mini due to launch “in ~a couple of weeks”, a surprise announcement after the R1 release this week (not counting Stargate and Operator) was that o3-mini would also be launched in free ChatGPT:

With Noam Shazeer back at Google actively shipping updates to Gemini 2.0 Flash Thinking - which is available for free with no pricing announced2, and with DeepSeek launching o1-competitive models at 27x cheaper than o1, the price pressure is on.

The Incredible Deflationary Impact of Reasoning Distillation

However, we believe the pressure is EVEN MORE intense for the -mini models than for the full reasoner models. The biggest surprise of the DeepSeek R1 paper wasn’t the performance of R1 itself, which was already preannounced in November.

Keep reading with a 7-day free trial

Subscribe to Latent.Space to keep reading this post and get 7 days of free access to the full post archives.

Why o3-mini *had* to be free: the coming DeepSeek R1, 2.0 Flash, and Sky-T1 Price War

2025's biggest surprise so far: Reasoning is less of a moat than anyone thought.

The Incredible Deflationary Impact of Reasoning Distillation

Keep reading with a 7-day free trial

Why o3-mini had to be free: the coming DeepSeek R1, 2.0 Flash, and Sky-T1 Price War