The Unbearable Cheapness of Open Weight Models – James O'Claire
Skip to content
Menu
Today I was setting up Hermes to see how it does with web research. I chose DeepSeek V4 because I know it is cheap, but seeing it’s pricing next to Anthropic and OpenAI ‘frontier’ models is crazy. Nearly a 50x price increase based on tokens alone, let alone how much pondering any of their models might fall into (using more tokens for the same task).
What worries me about this is that Anthropic and OpenAI seem to have backed themselves into a corner of high costs. Can they reasonably decrease their prices by 20-50x to compete with DeepSeek or Xiaomi’s Mimo?
Open Weight vs Low Cost
Are these models cheap because they are open weight and having hundreds or people stress test running them on different hardware helped to lower the cost? Or is it that they are being provided as loss leaders to drive the prices down?
How do you keep prices high for commodity products?
You manufacture scarcity. You sell luxury and premium branding. This is what OpenAI and Anthropic seem to be doing by gating ‘frontier’ model usage behind higher walls.
This is how luxury brands have sold cars and hand bags forever. They clubs and status symbols for the rich and not meant to be widely distributed.
Will Anthropic & OpenAI lean on China fears to push bans on open weight models?
This has been my fear for a few months now and each week that goes by seems to support this. How do you manufacture scarcity? One easy way is to fear monger and get the government to help restrict access to competition.
Why not compete?
The US used to be such a champion of open source, and I would hope that serious open source competition can come out of the US to prove that open weight and open source models are ultimately the future.
Google Gemma 4 was released in April 2026
Meta had llama which hasn’t had a rerease
OpenAI last released open weight gpt models in 2025
Anthropic to my knowledge has never released any open weight model
True Open Source vs Open Weight
I think the leap frog scenario for Open Source will be the true Open Source models where the data pipeline for training is also open sourced.
https://allenai.org/olmo -> You can download these models now and they’re seeing increasing popularity. That being said, they are a bit out of date, with data cutoffs in Dec 2024
Looking to the future, the US NSF partnered with Nvidia to enable Allen AI to develop a true fully open AI:<br>https://www.nsf.gov/news/nsf-nvidia-partnership-enables-ai2-develop-fully-open-ai
Bonus:
Curious to dig more into Claude / ChatGPT tech stacks? Check out the tools they used to build their iOS and Android apps:
Claude Android<br>ChatGPT Android
You can navigate to SDKs to view even more detailed breakdowns of specific parts as well as unmapped SDK paths.
Categories
Development
Mobile Marketing and Advertising
Uncategorized
Search for:
Recent Posts
The Unbearable Cheapness of Open Weight Models
Scan any iOS or Android App for SDKs and API Calls for Free with AppGoblin, no login
Attribution in the Browser: Who Really Benefits from Google and Meta’s New Privacy Standard
App Marketing: Free App Analytics vs all the "Free" paywall companies
How many app SDKs did Publicis add with LiveRamp acquisition?
Recent Comments
Archives
June 2026
May 2026
January 2026
November 2025
October 2025
September 2025
August 2025
June 2025
May 2025
April 2025
March 2025
February 2025
January 2025
December 2024
November 2024
October 2024
March 2024
February 2024
January 2024
November 2023
October 2023
September 2023
October 2022
April 2016
March 2016
February 2016
Meta
Log in
Entries feed
Comments feed
WordPress.org