Microsoft and Uber Are Running Into an AI Cost Problem - Firethering
back to top
Home
Softwares
AI Tools
DevTools
3D Tools
Design Tools
Image Editors
Video Editors
Productivity
Utilities
Apps
Android Apps
iOS Apps
Games
Windows Games
macOS Games
Android Games
iOS Games
Tech
Picks
AI Picks
AI Models
Trends
Search
Tuesday, May 26, 2026
Home
Softwares
AI Tools
DevTools
3D Tools
Design Tools
Image Editors
Video Editors
Productivity
Utilities
Apps
Android Apps
iOS Apps
Games
Windows Games
macOS Games
Android Games
iOS Games
Tech
Picks
AI Picks
AI Models
Trends
Facebook<br>Instagram<br>Twitter<br>Vimeo<br>Youtube
Home
Softwares
AI Tools
DevTools
3D Tools
Design Tools
Image Editors
Video Editors
Productivity
Utilities
Apps
Android Apps
iOS Apps
Games
Windows Games
macOS Games
Android Games
iOS Games
Tech
Picks
AI Picks
AI Models
Trends
Search
HomeTechMicrosoft and Uber Are Running Into an AI Cost Problem
Microsoft and Uber Are Running Into an AI Cost Problem
By Mohit Geryani
May 26, 2026
Last updated: May 26, 2026
Share
- Advertisement -
The pitch was impressive. AI tools would make developers faster, reduce headcount costs, and pay for themselves many times over. Companies that moved early would have a structural advantage over those that waited.
Microsoft believed it. So did Uber. Both pushed hard on AI coding tool adoption across their engineering teams. Both are now dealing with same problem: the faster their employees embraced the tools, the faster the bills grew. In some cases those bills have started exceeding what the same work would have cost with human labor.
The problem is what happens to the economics when thousands of employees use something that charges per unit of thought.
Table of Contents
The token trap nobody planned for
AI models charge per token, the basic unit of text the model processes and generates.
When Uber’s CTO disclosed that the company had burned through its entire 2026 AI coding budget in four months, the detail that got less attention was how it happened. Uber had been actively pushing adoption, running internal leaderboards to rank teams by AI tool usage. More encouragement meant more usage. More usage meant more tokens. More tokens meant more compute. The budget math that looked reasonable in January looked catastrophic by April.
Amazon has been telling staff to "tokenmaxx," meaning use as many tokens as possible. Meta built an internal tracking tool called Claudeonomics to monitor which employees were using AI most heavily. These are companies treating token consumption as a metric to maximize, which is exactly backwards if the goal is cost efficiency.
The paradox is structural. Agentic AI systems, the ones that work autonomously across multiple steps consume more tokens per task than standard models. Goldman Sachs forecasts a 24-fold increase in enterprise token consumption by 2030 as agentic deployments scale. Gartner projects that inference costs will fall nearly 90% by the same year. But Gartner also warned that cheaper tokens will not produce cheaper bills, because consumption growth will outpace price declines and AI providers are unlikely to pass through the full benefit of cost reductions to business customers.
Cheaper per token. Higher total bill. The more you use it the worse the math gets.
When compute costs more than the employee
The most uncomfortable acknowledgment of where this is heading came from Bryan Catanzaro, Vice President of applied deep learning at Nvidia, the company that supplies the chips powering essentially all of this infrastructure.
"For my team, the cost of compute is far beyond the costs of the employees," he said.
That statement carries weight because of who said it. Nvidia has more financial interest in AI compute spending than almost any other company on earth. When its own executive acknowledges that compute costs are exceeding labor costs for his team, it is not a bearish take on AI. It is an honest description of the current economics from someone with no incentive to understate them.
Microsoft’s situation illustrates the same point from a different angle. The company cancelled most of its direct Claude Code licences after thousands of employees adopted the tool faster than anyone anticipated. The move doesn’t touch Microsoft’s $5 billion investment in Anthropic or its commercial relationship with the company. It’s a pure cost control decision on a tool its own engineers had grown to depend on. When the company that built GitHub Copilot, owns the dominant AI coding platform, and made one of the largest AI bets in the industry pulls back on AI coding spend, the economics are the only explanation that makes sense.
Where the math actually works
MIT research found AI is only economically viable in a limited number of job roles at current pricing. The tasks where it clears the bar tend to share common characteristics:...