Microsoft and Uber Are Running into an AI Cost Problem

steveharing13 pts1 comments

Microsoft and Uber Are Running Into an AI Cost Problem - Firethering

back to top

Home

Softwares

AI Tools

DevTools

3D Tools

Design Tools

Image Editors

Video Editors

Productivity

Utilities

Apps

Android Apps

iOS Apps

Games

Windows Games

macOS Games

Android Games

iOS Games

Tech

Picks

AI Picks

AI Models

Trends

Search

Tuesday, May 26, 2026

Home

Softwares

AI Tools

DevTools

3D Tools

Design Tools

Image Editors

Video Editors

Productivity

Utilities

Apps

Android Apps

iOS Apps

Games

Windows Games

macOS Games

Android Games

iOS Games

Tech

Picks

AI Picks

AI Models

Trends

Facebook<br>Instagram<br>Twitter<br>Vimeo<br>Youtube

Home

Softwares

AI Tools

DevTools

3D Tools

Design Tools

Image Editors

Video Editors

Productivity

Utilities

Apps

Android Apps

iOS Apps

Games

Windows Games

macOS Games

Android Games

iOS Games

Tech

Picks

AI Picks

AI Models

Trends

Search

HomeTechMicrosoft and Uber Are Running Into an AI Cost Problem

Microsoft and Uber Are Running Into an AI Cost Problem

By Mohit Geryani

May 26, 2026

Last updated: May 26, 2026

Share

Facebook

Twitter

Pinterest

WhatsApp

- Advertisement -

The pitch was impressive. AI tools would make developers faster, reduce headcount costs, and pay for themselves many times over. Companies that moved early would have a structural advantage over those that waited.

Microsoft believed it. So did Uber. Both pushed hard on AI coding tool adoption across their engineering teams. Both are now dealing with same problem: the faster their employees embraced the tools, the faster the bills grew. In some cases those bills have started exceeding what the same work would have cost with human labor.

The problem is what happens to the economics when thousands of employees use something that charges per unit of thought.

Table of Contents

The token trap nobody planned for

AI models charge per token, the basic unit of text the model processes and generates.

When Uber’s CTO disclosed that the company had burned through its entire 2026 AI coding budget in four months, the detail that got less attention was how it happened. Uber had been actively pushing adoption, running internal leaderboards to rank teams by AI tool usage. More encouragement meant more usage. More usage meant more tokens. More tokens meant more compute. The budget math that looked reasonable in January looked catastrophic by April.

Amazon has been telling staff to "tokenmaxx," meaning use as many tokens as possible. Meta built an internal tracking tool called Claudeonomics to monitor which employees were using AI most heavily. These are companies treating token consumption as a metric to maximize, which is exactly backwards if the goal is cost efficiency.

The paradox is structural. Agentic AI systems, the ones that work autonomously across multiple steps consume more tokens per task than standard models. Goldman Sachs forecasts a 24-fold increase in enterprise token consumption by 2030 as agentic deployments scale. Gartner projects that inference costs will fall nearly 90% by the same year. But Gartner also warned that cheaper tokens will not produce cheaper bills, because consumption growth will outpace price declines and AI providers are unlikely to pass through the full benefit of cost reductions to business customers.

Cheaper per token. Higher total bill. The more you use it the worse the math gets.

When compute costs more than the employee

The most uncomfortable acknowledgment of where this is heading came from Bryan Catanzaro, Vice President of applied deep learning at Nvidia, the company that supplies the chips powering essentially all of this infrastructure.

"For my team, the cost of compute is far beyond the costs of the employees," he said.

That statement carries weight because of who said it. Nvidia has more financial interest in AI compute spending than almost any other company on earth. When its own executive acknowledges that compute costs are exceeding labor costs for his team, it is not a bearish take on AI. It is an honest description of the current economics from someone with no incentive to understate them.

Microsoft’s situation illustrates the same point from a different angle. The company cancelled most of its direct Claude Code licences after thousands of employees adopted the tool faster than anyone anticipated. The move doesn’t touch Microsoft’s $5 billion investment in Anthropic or its commercial relationship with the company. It’s a pure cost control decision on a tool its own engineers had grown to depend on. When the company that built GitHub Copilot, owns the dominant AI coding platform, and made one of the largest AI bets in the industry pulls back on AI coding spend, the economics are the only explanation that makes sense.

Where the math actually works

MIT research found AI is only economically viable in a limited number of job roles at current pricing. The tasks where it clears the bar tend to share common characteristics:...

games tools cost apps uber microsoft

Related Articles