I Spent $47,000 on Claude Code in 90 Days. My CTO Asked Me One Question and I Couldn’t Answer It. | by Devrim Ozcay — Production Engineering | Jun, 2026 | MediumSitemapOpen in appSign up<br>Sign in
Medium Logo
Get app<br>Write
Search
Sign up<br>Sign in
Press enter or click to view image in full size
I Spent $47,000 on Claude Code in 90 Days. My CTO Asked Me One Question and I Couldn’t Answer It.
We were shipping faster than ever. The board didn’t care. They wanted to know if we could ever ship without it.
Devrim Ozcay — Production Engineering
6 min read·<br>4 days ago
Listen
Share
The bill came in on a Friday.<br>I was in the kitchen at the office refilling coffee when our finance lead walked over and asked if I had a minute. He was holding a printout. Finance people in 2026 do not walk up to engineers holding printouts unless something is on fire.<br>The number on the printout was $47,213. Three months of API spend. One team. Eight engineers.<br>I told him it was fine. Velocity was up. Story points were up. The CTO had cited us in the last all-hands as an example of “AI-leveraged execution.” I had a slide somewhere with a chart that bent up and to the right.<br>He nodded, said okay, and walked away. He did not look reassured.<br>The next Monday I was in the CTO’s office.<br>The setup<br>We bought into Claude Code in January. The pitch made sense. Our roadmap was packed, hiring was frozen, and the agentic coding wave was real. I had benchmarked it on three internal projects in December. The numbers were good. Not magic. Good enough that I signed off without losing sleep.<br>By April we had eight engineers using it daily. Most of them had stopped writing functions from scratch. The good ones used it as leverage. The okay ones used it as a crutch. The new hires used it as a teacher, which is the worst of the three but I will get to that later.<br>Our sprint velocity numbers looked like a company that had quietly doubled headcount. Senior leadership loved the graphs. I loved the graphs. The graphs were the problem.<br>The meeting<br>The CTO did not open with the bill.<br>He opened by asking how the team was doing. I gave him the usual update. Shipping on schedule. Two big projects ahead of plan. A junior who was crushing it. He nodded through all of it.<br>Then he said: “If we paused our Claude Code spend tomorrow, how long until our velocity matched what it was in December?”<br>I opened my mouth to answer. Then I closed it.<br>I tried again. I started saying something about team adjustment periods. He waited. I could hear the HVAC.<br>The honest answer was that I did not know. I had not measured it. I had not built a single piece of internal infrastructure that worked independently of the agent. I had not asked any of my engineers to ship a non-trivial feature without it in six weeks.<br>I gave him a number. I said four to six weeks of recalibration. He wrote it down. He did not believe me. I did not believe me either.<br>The follow-up question<br>He let the four-to-six-weeks answer sit. Then he asked the question I have been thinking about ever since.<br>“Devrim, if the price doubles next quarter, what’s our move?”<br>This is the question nobody on engineering Twitter is talking about. Everyone is arguing about whether agents replace engineers. Nobody is talking about what happens when the agent becomes the dependency and your entire roadmap has been quietly priced against a vendor’s gross margin assumptions.<br>Claude Code at $47K a quarter was a line item. Claude Code at $94K a quarter was a hiring decision. Claude Code at $188K a quarter was a strategic crisis.<br>I did not have a contingency plan. I had not negotiated a cap with the vendor. I had not built any internal tooling that would let us downshift gracefully to a cheaper model. I had not stress-tested the team’s ability to function on raw Cursor without the high-end inference loop.<br>What I had was a velocity chart. Velocity charts are what engineering leaders show their CTOs when they do not understand what they actually built.<br>The thing nobody admits<br>After the meeting I went back to my desk and pulled the audit logs.<br>What I found was worse than the bill.<br>The spend was distributed unevenly. Two engineers were responsible for 61% of it. One was a senior engineer I respected enormously. The other was a new hire who had been with us for eleven weeks.<br>The senior engineer was using the agent for code review and architecture exploration. Heavy reasoning loops. Long context windows. The kind of usage where you are actually delegating thinking, not typing.<br>The new hire was using it to write code he could not have written on his own. Not in a “stretching beyond his level” way. In a “he genuinely did not understand the code that landed in our repo with his name on it” way.<br>I had been measuring his velocity. I had not been measuring his comprehension.<br>When I asked him to walk me through a function he had shipped two weeks earlier, he stared at it for ninety seconds and said it was probably from one of the refactor PRs the...