The Unbearable Cheapness of Open Weight Models

The Unbearable Cheapness of Open Weight Models – James O'Claire

Today I was setting up Hermes to see how it does with web research. I chose DeepSeek V4 because I know it is cheap, but seeing it’s pricing next to Anthropic and OpenAI ‘frontier’ models is crazy. Nearly a 50x price increase based on tokens alone, let alone how much pondering any of their models might fall into (using more tokens for the same task).

What worries me about this is that Anthropic and OpenAI seem to have backed themselves into a corner of high costs. Can they reasonably decrease their prices by 20-50x to compete with DeepSeek or Xiaomi’s Mimo?

Open Weight vs Low Cost

Are these models cheap because they are open weight and having hundreds or people stress test running them on different hardware helped to lower the cost? Or is it that they are being provided as loss leaders to drive the prices down?

How do you keep prices high for commodity products?

You manufacture scarcity. You sell luxury and premium branding. This is what OpenAI and Anthropic seem to be doing by gating ‘frontier’ model usage behind higher walls.

This is how luxury brands have sold cars and hand bags forever. They clubs and status symbols for the rich and not meant to be widely distributed.

Will Anthropic & OpenAI lean on China fears to push bans on open weight models?

This has been my fear for a few months now and each week that goes by seems to support this. How do you manufacture scarcity? One easy way is to fear monger and get the government to help restrict access to competition.

Why not compete?

The US used to be such a champion of open source, and I would hope that serious open source competition can come out of the US to prove that open weight and open source models are ultimately the future.

Google Gemma 4 was released in April 2026

Meta had llama which hasn’t had a rerease

OpenAI last released open weight gpt models in 2025

Anthropic to my knowledge has never released any open weight model

True Open Source vs Open Weight

I think the leap frog scenario for Open Source will be the true Open Source models where the data pipeline for training is also open sourced.

https://allenai.org/olmo -> You can download these models now and they’re seeing increasing popularity. That being said, they are a bit out of date, with data cutoffs in Dec 2024

Looking to the future, the US NSF partnered with Nvidia to enable Allen AI to develop a true fully open AI:<br>https://www.nsf.gov/news/nsf-nvidia-partnership-enables-ai2-develop-fully-open-ai

Bonus:

Curious to dig more into Claude / ChatGPT tech stacks? Check out the tools they used to build their iOS and Android apps:

Claude Android<br>ChatGPT Android

You can navigate to SDKs to view even more detailed breakdowns of specific parts as well as unmapped SDK paths.

The Unbearable Cheapness of Open Weight Models

Related Articles

US Government directive to suspend access to Fable 5 and Mythos 5

Is AI ruining our skills? Early results are in – and they're not good

The Anatomy of an AI-Native Org

Apertus – Open Foundation Model for Sovereign AI

Britain Became as Poor as Mississippi