Return on Intelligence, Part 3: Moats | rebecca powell
Go back<br>Return on Intelligence, Part 3: Moats<br>X Facebook Y Combinator LinkedIn Reddit Email WhatsApp Medium Threads Bluesky<br>Published: 16 May, 2026 | at 02:45 PM
Suggest Changes
Series guide
Return on Intelligence<br>AI and the Programmable Firm<br>Part 3 of 8. Intelligence alone is not the moat. Durable moats may be context, distribution, workflow, identity, permissions, governance, execution rights, data and systems of record.
Open preface
Previous
Continue
Paradigm lifecycle
Invention Wonder Speculation Overbuild Belief break Crash Consolidation Mature infrastructure Extraction
Part 3: Moats
Part 2 made the distinction between utility and capture. This chapter applies that distinction to the strongest current ownership claim in AI: that a few centralized model providers will remain the permanent home of intelligence. I think that story is much weaker than it looks, and it sets up the SaaS consequences explored in Part 4.
Intelligence alone is not the moat.
Intelligence will flow through a small number of centralized model providers because they have the best models and the infrastructure to run them.
That sounds reasonable today. It may even be true today, but I do not think it is the final form.
OpenAI, Anthropic and the other frontier labs are betting on a world where intelligence remains scarce, expensive, centralized and hard to reproduce. They have the models. They have the talent. They have the infrastructure relationships. They have the APIs. They have the mindshare. They have the brand.
That is a powerful position, but it is not automatically a durable moat.
A model lead is not a moat. It is a lead, and leads decay.
Scarcity is the current business model
The economics of the frontier labs are built around scarcity.
The best models are expensive to train. They are expensive to serve. They require huge amounts of compute, power, memory, networking and operational expertise. They sit behind APIs because very few companies can afford to train or run them at scale.
That creates the feeling of a moat.
If everyone needs intelligence and only a few companies can provide it, then those companies should capture the value.
That is the simple version of the story. It is also the version I am most suspicious of.
Technology attacks scarcity .
That is what it does.
Every layer of the AI stack is currently under pressure to become cheaper, smaller, faster and more distributed. Model architecture improves. Inference improves. Quantization improves. Distillation improves. Hardware improves. Memory improves. Edge devices improve. Developer tooling improves. Open models improve. Specialized models improve.
The question is not whether frontier models will remain impressive. They will.
The question is how many tasks actually require the frontier.
That number may be much smaller than the market currently assumes.
Yesterday’s frontier becomes tomorrow’s local model
Computing history repeatedly moves capability from centralized infrastructure to local devices.
Mainframes mattered. Then personal computers mattered. Then servers and cloud mattered. Then mobile and edge mattered. The pattern is not a clean replacement. It is a migration of the default.
The same thing is likely to happen with AI.
Today’s expensive cloud capability becomes tomorrow’s cheap local capability. Not all of it. Not the moving frontier. But enough of it to change the economics.
A local model does not need to be the smartest model in the world to be valuable.
It needs to be good enough for the task, cheap enough to run continuously, private enough to trust, fast enough to feel ambient and integrated enough to disappear into the workflow.
That is a different design target from the frontier leaderboard.
Most everyday AI tasks are not grand acts of genius.
They are repetitive, contextual and close to the user:
summarize this thread;
rewrite this paragraph;
classify this email;
extract these fields;
explain this error;
draft this reply;
organize these notes;
search my files;
help with this form;
suggest the next step;
generate a small script;
automate this local workflow;
check this document against a known policy.
Those are not all frontier tasks.
They are context tasks.
The advantage goes to whoever owns the context, the device, the operating system, the workflow, the permissions and the user relationship.
That is often not the model lab.
The future is hybrid, not API-only
I am not arguing that cloud AI disappears.
That would be too simplistic.
The frontier will matter. There will be tasks where the best available model is worth paying for. Complex reasoning, scientific work, high-end coding, multimodal generation, long-context synthesis, difficult planning, regulated review, specialist analysis and heavy agentic workflows may all require cloud-scale models for a long time.
But the default will shift.
The mature AI stack is likely to be...